AI app builder comparison 2026: Lovable, v0, Bolt, Cursor, Replit, Claude Code, Codex, Windsurf

The AI builder space sorted itself in 2026. Eight tools matter for most teams shipping software with an agent. They split cleanly into three groups: app generators that produce a working app from a prompt (Lovable, v0, Bolt, Replit), agent IDEs that pair-program inside an editor (Cursor, Windsurf), and agent CLIs that run headless against your repo (Claude Code, Codex). Knowing which group you need is most of the decision.

The short verdict

App generators if you don't have code yet. Start with v0 for UI-led work, Lovable for CRUD apps, Bolt for stack-flexible scaffolding, Replit for non-JS or always-on services.
Agent IDEs for ongoing development with a visible diff. Cursor for VS Code familiarity, Windsurf for stronger plan-then-execute flow.
Agent CLIs for headless work, scripted runs, CI loops, and any task you want to compose with shell tools. Claude Code for local-tree work, Codex for GitHub-PR-driven work.

Most experienced teams in 2026 use one tool from each group: a generator for v1, an IDE for daily work, and a CLI for headless tasks.

The full table

Tool	Group	Best for	Default stack	Deploy	Pricing (mid-2026)
Lovable	Generator	CRUD apps, SaaS, internal tools	React + Supabase	Lovable / Vercel	$20+/mo
v0	Generator	UI, marketing, design-led work	Next.js + shadcn	Vercel	$20+/mo
Bolt	Generator	Multi-framework prototypes, full-stack scaffolds	Pick (Next.js, Astro, SvelteKit, Vite)	Netlify / Vercel	Token-based
Replit Agent	Generator	Python, Go, services, bots	Multi-language	Replit Deployments	$20/mo Core
Cursor	Agent IDE	Daily coding, multi-file edits, code review	VS Code-based	n/a	$20/mo Pro
Windsurf	Agent IDE	Plan-then-execute, multi-step refactors	Codeium IDE	n/a	$15/mo Pro
Claude Code	Agent CLI	Headless runs, plan-driven tasks, scripted automation	CLI in your terminal	n/a	Usage-based
Codex	Agent CLI	GitHub-PR workflow, async team review	ChatGPT + GitHub	n/a	Bundled with ChatGPT

Which generator to pick (4-way)

Question	Answer is...	Pick
Does it need a database and auth on day one?	Yes	Lovable
Is the product mostly screens and marketing surfaces?	Yes	v0
Do you want stack flexibility (not just Next.js)?	Yes	Bolt
Is the project Python, Go, a bot, or always-on?	Yes	Replit

If two answers come back yes, the question above breaks the tie (database tilts to Lovable; UI tilts to v0; stack flexibility tilts to Bolt; non-JS tilts to Replit).

Which agent IDE to pick (2-way)

Dimension	Cursor	Windsurf
Editor DNA	VS Code fork; nearly all extensions work	Codeium's own IDE; fewer extensions
Agent flow	Inline + Composer	Cascade (plan-first)
Community	Largest in 2026	Growing
Best for	VS Code muscle memory, plugin reliance	Plan-driven multi-step refactors

Pick by editor preference more than by agent quality. Both run frontier models; the harness shapes the daily experience.

Which agent CLI to pick (2-way)

Dimension	Claude Code	Codex
Surface	Terminal CLI	ChatGPT app + GitHub
Output shape	Edits in your working tree	Pull requests
Scripting	Native	Limited
Best for	Headless runs, local control, CI hooks	Async PR review culture

Pick by where you want the agent to live. Local tree: Claude Code. GitHub PRs: Codex.

What changed in 2026 from 2025

A few shifts matter for picking today:

Lovable hit feature-parity with Bolt on stack flexibility for the React side, but stayed committed to Supabase. Lovable still wins for CRUD; Bolt still wins for non-Supabase work.
v0 added Notebooks for iterating components in long chains, which closed the multi-component gap with Bolt for design-heavy projects.
Cursor and Windsurf converged on agent quality. The community gap is bigger than the agent gap in mid-2026.
Claude Code's plan mode became a first-class feature, narrowing the gap with Codex for structured task work.
Codex got tighter GitHub integration including review-comment replies, which made it a real teammate for async teams.

The takeaway: the gaps closed faster than expected, and the right pick is now more about workflow fit than tool capability.

A real example: same project, four tool choices

Filled example

A real, ready-to-customize version

Project: OnboardWeave, a B2B SaaS for managing customer-onboarding checklists. Companies create a checklist template, assign it to each new customer, and track completion across email reminders and a customer-facing portal.

The four ways to ship it:

1. Lovable (v1 in a weekend, ~$25/mo) Lovable scaffolds Supabase tables (organizations, templates, checklists, tasks, customers), wires auth + RLS, and produces a working dashboard plus a customer-facing portal. The first usable version lands in ~6 hours of prompting.

2. v0 + manual Supabase wiring (v1 in 2 weekends, ~$30/mo) v0 produces a polished dashboard and customer-portal UI. You wire Supabase by hand using the v0 components as the front end. Better-looking than Lovable's output; slower to first working version.

3. Bolt with Next.js + Drizzle + Neon (v1 in a weekend, ~$30/mo) Bolt scaffolds the full Next.js project including the Drizzle schema, the routes, and the deploy to Vercel. The agent makes more stack choices than Lovable did; you adjust them as needed.

4. Cursor + a hand-written PRD (v1 in 2 weekends, ~$25/mo) You write a detailed PRD and let Cursor implement it task by task. Highest control, slowest to v1, best long-term codebase quality. Fits a team that plans to iterate on this product for years.

The right pick depends on what happens next.

If OnboardWeave is a side project that needs to demo well by next Friday: Lovable.
If it's a real startup that will live 5 years: Cursor (or Lovable for v1, then Cursor for everything else).
If you already know your stack and want stack flexibility: Bolt.
If the product needs a polished marketing site before the actual app: v0 for the marketing, then Lovable for the app.

Acceptance metrics that apply to all four approaches:

10 pilot companies create a checklist template in under 15 minutes from signup
Customer portal load time under 2 seconds on a 4G connection
Email reminder send-rate over 99% across a 30-day window

How to think about cost across the eight

For a single developer:

One generator subscription ($20/mo) + one agent IDE subscription ($20/mo) = $40/mo. This is the floor for serious work.
Add Claude Code or Codex as needed. Claude Code's usage-based pricing tracks how much headless work you do. Codex is bundled with ChatGPT Pro.
Most working solos spend $40 to $80/mo across 2 to 3 tools. Heavy headless users hit $100 to $150/mo.

For teams (3 to 10 developers):

Standardize on one agent IDE across the team (Cursor or Windsurf). Per-seat pricing scales linearly.
The generator is usually a personal preference per developer; teams rarely standardize there.
The CLI is the team's automation layer - pick Claude Code if local-tree work dominates, Codex if you're GitHub-first.

How to pick when you have 30 minutes

Pick a generator. Use the 4-way table above. Spend 20 minutes scaffolding a real project.
Pick an agent IDE. Open Cursor and Windsurf, run the same task in both. Whichever feels less frustrating wins.
Pick a CLI. Open Claude Code and run a small plan. If your team is GitHub-PR-driven, try Codex too.
Lock in for 30 days. Tools evolve fast; the right pick can shift. Don't commit beyond 30 days on any one.
Write a PRD format that works across all three groups. It should have a TL;DR, a file tree (for IDEs/CLIs), acceptance criteria with numbers, and a "for generator: stack defaults" appendix.
Plan exits. Every tool exports to GitHub. Never let work live in a single vendor for more than 30 days without an export checkpoint.

FAQ

Should I really use three tools at once?

For serious work, yes. The three groups solve different problems. A generator is the fastest path to v1. An agent IDE is the best surface for ongoing daily work. A CLI is the only fit for headless and CI tasks. Trying to do all three with a single tool means accepting it'll be mediocre at two of the three.

Which combination do most working developers settle on?

The most common mid-2026 stack is Lovable + Cursor + Claude Code. Generator for v1, IDE for daily, CLI for headless. The second most common is v0 + Cursor + Codex for teams that live in Vercel + GitHub. Both stacks work well; the differences come down to preference.

What about Aider, Plandex, or other agent CLIs?

They're valid alternatives to Claude Code. Aider is closer to Claude Code in spirit (terminal CLI, plan + edit loop). Plandex is more plan-driven and structured. For most teams, sticking with the two big-vendor CLIs (Claude Code, Codex) reduces decision fatigue without sacrificing capability. Aider and Plandex are worth a look if you specifically want their flavor.

How does Cursor + Composer compare to Claude Code + plan mode?

Both let the agent edit across many files with an upfront plan. The interaction is different. Composer surfaces diffs in tabs; Claude Code's plan mode writes files and reports. For multi-file work you want to review tightly, Composer is safer. For multi-file work you trust the agent on, Claude Code is faster.

Will any of these be obsolete in a year?

Some, probably. The category has been consolidating, and a couple of names on this list may be acquired or rebranded by mid-2027. That's why the 30-day commitment rule matters: pick tools, ship work, and stay loose enough to swap when the market moves. The output (your code in GitHub) is the asset; the tools are interchangeable.