Deploy-ready: fully static app, client-side debate engine, zero backend needed by emregucerr · Pull Request #2 · emregucerr/ai-intelligence-squared

emregucerr · 2026-04-09T05:30:52Z

Summary

Makes the entire app fully static so it can be deployed for free on Vercel (or any static host) with zero backend, zero serverless functions, and zero timeout issues.

The Problem

The original /api/debate route ran the entire debate as a server-side function. A full debate makes 38 sequential/parallel LLM API calls to OpenRouter, taking 1-3 minutes. Vercel's free tier has a 60-second serverless function timeout — debates would frequently fail mid-way.

The Solution

Move the debate orchestration entirely to the client. The API key was already user-provided and stored in localStorage, so there's no security reason to proxy through a server. OpenRouter supports CORS and is designed for direct browser calls.

What Changed

New: src/lib/debate-engine.ts — Client-side debate engine

Identical debate logic (phases, prompts, scoring) extracted into a standalone module
Calls OpenRouter directly from the browser via fetch()
No timeout limits — debates run as long as they need
Abort support via AbortSignal
Typed event callbacks for real-time UI updates

Modified: src/app/arena/page.tsx

Imports and calls runDebate() from the client-side engine
Removed SSE stream parsing (no longer needed — events come via direct callbacks)
Cleaner type-safe score rendering (proper DebateScore type instead of Record<string, unknown>)

Deleted: src/app/api/debate/route.ts

The 570-line serverless API route is no longer needed

Build Output (Before → After)

Before:                          After:
┌ ○ /                            ┌ ○ /
├ ○ /_not-found                  ├ ○ /_not-found
├ ƒ /api/debate    ← DYNAMIC    ├ ○ /arena         ← NOW STATIC
├ ○ /arena                       ├ ● /debate/[id]
├ ● /debate/[id]                 └ ● /model/[id]
└ ● /model/[id]

All routes are now ○ (Static) or ● (SSG). Zero serverless functions.

Lint & Build

npm run lint — 0 errors, 0 warnings
npm run build — all 15 pages generate successfully

Also fixed (from first commit)

react-hooks/immutability error in DebateReplay.tsx
9 unused import/variable warnings across 5 files

…, audience, debate engine, scoring, tests Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…question gen Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

… visual assets Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

… shift table Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…ate leaderboard and judge analysis Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

… started guide Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…bate history Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…for new API key Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…ress) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…oard FINAL RESULTS (45 real head-to-head debates): #1 Claude Opus 4.6 (Thinking) ELO: 1590 #2 Grok 4.20 (Reasoning) ELO: 1577 #3 Grok 4.20 Multi-Agent ELO: 1560 #4 Grok 4.20 ELO: 1546 #5 GPT-5.2 Chat ELO: 1546 #6 Claude Opus 4.6 ELO: 1508 #7 Gemini 3 Flash ELO: 1459 #8 Gemini 3 Pro ELO: 1430 #9 GPT-5.4 (High) ELO: 1407 #10 Gemini 3.1 Pro Preview ELO: 1377 Stats: 1,038 API calls, 9.2M input tokens, 1.2M output tokens, 0 errors Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

…PI route - Add maxDuration=60 to /api/debate route for Vercel serverless function timeout - Fix react-hooks/immutability error: replace mutable currentPhase with index-based check - Remove unused imports across 5 files (PERSONA_MAP, Brain, TrendingDown, etc.) - Remove unused variable (winnerName in RecentDebates) - All lint checks and builds pass cleanly Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit c56bf4e. Configure here.}

cursor · 2026-04-09T05:33:47Z

+                        await asyncio.sleep(delay)
+                        continue
+                    stats.errors += 1
+                    raise


Retry sleep holds semaphores, blocking all concurrency

Medium Severity

All four asyncio.sleep(delay) calls during retry backoff (lines 133, 143, 171, 181) execute inside the async with global_sem and async with model_sem context managers. This means during exponential backoff (up to 32 seconds), both semaphore slots are held, blocking other concurrent requests from proceeding. With only 10 global slots and 2 per-model slots, a single rate-limited request can starve the entire benchmark's concurrency for the duration of the sleep.

^{Reviewed by Cursor Bugbot for commit c56bf4e. Configure here.}

* Phase 0+1: Benchmark engine scaffolding - models, prompts, API client, audience, debate engine, scoring, tests Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Fix: min max_tokens for OpenAI models (>=16), handle null content in question gen Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add benchmark runner.py for full 45-matchup benchmark Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Phase 2+3: Web app with leaderboard, arena, SSE debate API, generated visual assets Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Fix: remove nested git repo in web/, add all web files properly Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add results sync script, fix data loading, update synced data Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add RecentDebates sidebar, sync 2 debate results Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add debate replay pages with full transcript, vote analysis, and vote shift table Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Generate full 45-debate sample results (2 real + 43 simulated), populate leaderboard and judge analysis Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add comprehensive README with architecture docs, model table, getting started guide Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add model detail pages: ELO, win rate, h2h records, judge profile, debate history Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Add fallback for empty cross-exam questions before full benchmark run Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Batch judge calls (3 at a time) to avoid credit pre-auth spikes Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Fix: always load existing debates to skip them, dotenv override=True for new API key Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 3 real benchmark debates (debates 1-3 complete, debate 4 in progress) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 5 real debates, benchmark running steadily (~$1/debate) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 7 real debates Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 9 real debates (10th in progress) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 11 real debates Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 15 real debates ($12.58 credits remaining) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 17 real debates Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 20 real debates - past halfway mark Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 22 real debates (~50%) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Parallelize benchmark: run 3 debates concurrently in batches Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 29 real debates (parallel execution working, ~3x speedup) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * Sync 32 real debates (auto top-up replenished credits) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> * 🏆 Complete benchmark: all 45 real debates finished, final ELO leaderboard FINAL RESULTS (45 real head-to-head debates): #1 Claude Opus 4.6 (Thinking) ELO: 1590 #2 Grok 4.20 (Reasoning) ELO: 1577 #3 Grok 4.20 Multi-Agent ELO: 1560 #4 Grok 4.20 ELO: 1546 #5 GPT-5.2 Chat ELO: 1546 #6 Claude Opus 4.6 ELO: 1508 #7 Gemini 3 Flash ELO: 1459 #8 Gemini 3 Pro ELO: 1430 #9 GPT-5.4 (High) ELO: 1407 #10 Gemini 3.1 Pro Preview ELO: 1377 Stats: 1,038 API calls, 9.2M input tokens, 1.2M output tokens, 0 errors Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

The live arena now calls OpenRouter directly from the browser instead of proxying through a serverless API route. This eliminates: - Serverless function timeout limits (was 60s, debates take 1-3 minutes) - The need for any backend/serverless infrastructure - Server-side API key handling (key never leaves the browser) The entire app is now fully static (all routes are ○ Static or ● SSG). It can be deployed on any free static hosting: Vercel, Netlify, Cloudflare Pages, or even GitHub Pages. Changes: - New: src/lib/debate-engine.ts — client-side debate orchestration - Modified: src/app/arena/page.tsx — uses debate-engine instead of /api/debate - Deleted: src/app/api/debate/route.ts — no longer needed Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

cursoragent and others added 28 commits April 8, 2026 23:01

Phase 0+1: Benchmark engine scaffolding - models, prompts, API client…

c9e0324

…, audience, debate engine, scoring, tests Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Fix: min max_tokens for OpenAI models (>=16), handle null content in …

dee4070

…question gen Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add benchmark runner.py for full 45-matchup benchmark

6af06de

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Phase 2+3: Web app with leaderboard, arena, SSE debate API, generated…

b6a77c0

… visual assets Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Fix: remove nested git repo in web/, add all web files properly

3ea9bea

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add results sync script, fix data loading, update synced data

f4b6e24

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add RecentDebates sidebar, sync 2 debate results

09e5dd2

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add debate replay pages with full transcript, vote analysis, and vote…

00d0b83

… shift table Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Generate full 45-debate sample results (2 real + 43 simulated), popul…

eb5ace1

…ate leaderboard and judge analysis Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add comprehensive README with architecture docs, model table, getting…

4249d15

… started guide Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add model detail pages: ELO, win rate, h2h records, judge profile, de…

cfe495a

…bate history Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Add fallback for empty cross-exam questions before full benchmark run

c369e2f

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Batch judge calls (3 at a time) to avoid credit pre-auth spikes

feefb4e

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Fix: always load existing debates to skip them, dotenv override=True …

97c0ab6

…for new API key Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 3 real benchmark debates (debates 1-3 complete, debate 4 in prog…

d271ef8

…ress) Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 5 real debates, benchmark running steadily (~$1/debate)

6efd4e6

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 7 real debates

da87422

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 9 real debates (10th in progress)

18a34bd

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 11 real debates

64b1194

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 15 real debates ($12.58 credits remaining)

734d381

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 17 real debates

1e8b6c9

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 20 real debates - past halfway mark

a42ef84

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 22 real debates (~50%)

329d897

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Parallelize benchmark: run 3 debates concurrently in batches

302d46a

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 29 real debates (parallel execution working, ~3x speedup)

c93cfaa

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

Sync 32 real debates (auto top-up replenished credits)

03d0a97

Co-authored-by: Emre Gucer <emregucerr@users.noreply.github.com>

cursor Bot reviewed Apr 9, 2026

View reviewed changes

cursor Bot changed the title ~~Prepare for free Vercel deployment: fix lint, add API timeout~~ Deploy-ready: fully static app, client-side debate engine, zero backend needed Apr 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy-ready: fully static app, client-side debate engine, zero backend needed#2

Deploy-ready: fully static app, client-side debate engine, zero backend needed#2
emregucerr wants to merge 29 commits into
mainfrom
cursor/deploy-vercel-e1df

emregucerr commented Apr 9, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

emregucerr commented Apr 9, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

The Problem

The Solution

What Changed

Build Output (Before → After)

Lint & Build

Also fixed (from first commit)

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Apr 9, 2026

Choose a reason for hiding this comment

Retry sleep holds semaphores, blocking all concurrency

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

emregucerr commented Apr 9, 2026 •

edited by cursor Bot

Loading