`>_ CodeKraft`

Your code has bugs. CodeKraft doesn't just find them — it teaches you why.

An AI-powered Python debugging pipeline that delivers mentor-style hints in under 300ms, combining static analysis, a fine-tuned CodeBERT transformer, and optional GPT-3.5 Socratic guidance.

The Problem

Every CS student knows the pain: you write Python code, it breaks, and the error message is cryptic garbage. You paste it into ChatGPT and get a full solution — but you didn't learn anything.

CodeKraft takes a different approach. Instead of handing you the answer, it gives you a hint — like a teaching assistant sitting next to you at 2am.

How It Works — 4 Layers, Under 300ms

                     Your Buggy Python Code
                              |
                              v
               =============================
               |   LAYER 1: Static Analyzer  |   < 5ms
               |   Python AST parsing        |
               |   7 pattern detectors       |
               |   Zero dependencies         |
               =============================
                              |
                              v
               =============================
               |   LAYER 2: CodeBERT         |   ~250ms
               |   125M parameter transformer|
               |   Fine-tuned on QuixBugs    |
               |   Error type classification |
               =============================
                              |
                              v
               =============================
               |   LAYER 3: Rule Engine      |   < 1ms
               |   Pre-computed mentor hints |
               |   Curated Socratic prompts  |
               |   Instant dictionary lookup |
               =============================
                              |
                     RESPONSE SENT (< 300ms)
                     User sees feedback NOW
                              |
                              v  (async, bonus)
               =============================
               |   LAYER 4: GPT-3.5 Enricher|   ~800ms
               |   Socratic follow-up        |
               |   Context-aware refinement  |
               |   Never blocks the user     |
               =============================

Why 4 layers? Because calling an LLM for every request adds 800ms+ of latency. Our first 3 layers give you an answer instantly. The LLM is just the cherry on top.

Evaluation Metrics

Metric	Score	What It Tells You
CodeBLEU	52.3%	Structural + semantic alignment with ground-truth classifications
Token-level F1	75.4%	Precision-recall balance across Python error patterns
Inference Latency	~0.25s	Per-snippet speed on NVIDIA Tesla T4 — real-time ready

The fine-tuned CodeBERT model was evaluated on the QuixBugs benchmark dataset using standard code intelligence metrics. A CodeBLEU of 52.3% demonstrates strong structural understanding beyond surface-level token matching. The Token-level F1 of 75.4% confirms the model reliably identifies error patterns with high precision and recall. At 0.25 seconds per snippet on a Tesla T4, the system is fast enough for live IDE integration.

See It In Action

Request

curl -X POST https://codekraft.vercel.app/api/analyze \
  -H "Content-Type: application/json" \
  -d '{"code": "def greet(name):\n    print(\"Hello \" + nme)"}'

Response (< 300ms)

{
  "parseable": true,
  "findings": [
    {
      "rule_id": "SA001",
      "category": "NameError",
      "severity": "error",
      "line": 2,
      "message": "Name 'nme' is used but never defined in this scope.",
      "suggestion": "Check the spelling of 'nme' -- did you mean a similar variable?",
      "confidence": 0.92
    }
  ],
  "error_category": "NameError",
  "classification_confidence": 0.87,
  "mentor_hint": "A variable name is being used that doesn't exist in this scope. This usually means a typo.",
  "follow_up": "Compare each variable name character-by-character with where it was defined.",
  "common_fix": "Check spelling of variable names and ensure they are defined before use.",
  "latency": {
    "total_ms": 287.4,
    "static_analysis_ms": 2.1,
    "classifier_ms": 284.8,
    "rule_engine_ms": 0.5
  }
}

Notice: no code solution is given. Just a hint. The student still has to think.

API Reference

Endpoint	Method	Purpose	Latency
`/api/analyze`	`POST`	3-layer instant analysis	< 300ms
`/api/enrich`	`POST`	GPT-3.5 Socratic refinement	~800ms
`/api/health`	`GET`	Service status + config	instant
`/api/metrics`	`GET`	Rolling performance dashboard	instant
`/api/categories`	`GET`	All supported error types	instant

Project Structure

CodeKraft/
|
|-- CodeKraft_Final.ipynb          # Model training (Colab)
|
|-- api/
|   '-- index.py                   # FastAPI orchestrator
|
|-- lib/
|   |-- static_analyzer.py         # L1: AST pattern detection (7 detectors)
|   |-- classifier.py              # L2: CodeBERT via HF Inference API
|   |-- rule_engine.py             # L3: Pre-computed mentor hints
|   |-- llm_enricher.py            # L4: Async GPT enrichment
|   '-- metrics.py                 # Latency tracking + aggregation
|
|-- frontend/
|   |-- src/
|   |   |-- App.jsx                # Terminal-themed React UI
|   |   |-- App.css                # Dark terminal styling
|   |   |-- index.css              # CSS variables + theme
|   |   '-- main.jsx               # Entry point
|   |-- index.html
|   |-- package.json
|   '-- vite.config.js
|
|-- vercel.json                    # Serverless deployment config
|-- requirements.txt               # Python deps
|-- .env.example                   # API key template
'-- .gitignore

Model Deep Dive

Architecture

Component	Detail
Base Model	`microsoft/codebert-base` (RoBERTa-based, 125M params)
Pre-training Data	6.4M bimodal code-text pairs from GitHub
Fine-tuning Task	Sequence Classification (error type)
Dataset	QuixBugs benchmark (classic single-line bugs)
Classification Head	Linear layer over `[CLS]` token

Training Configuration

Hyperparameter	Value
Epochs	4
Batch Size	4 (train & eval)
Learning Rate	5e-5 (AdamW)
Scheduler	Linear decay
Max Sequence Length	512 tokens
Train/Test Split	80/20
Evaluation Strategy	Every epoch
Model Selection	Best validation loss

Error Categories Detected

NameError          WrongOperator       OffByOneError
WrongComparator    WrongVariable       WrongBaseCase
MissingReturn      WrongInitialization IndexError
TypeError          SyntaxError         LogicError
InfiniteLoop       WrongMethodCall     ... and more

Deploy Your Own

Prerequisites

Hugging Face account (free)
OpenAI API key (optional, for Layer 4)
Vercel account (free)

Quick Deploy

1. Upload fine-tuned model to Hugging Face (after Colab training):

model.push_to_hub("your-username/codekraft-codebert")
tokenizer.push_to_hub("your-username/codekraft-codebert")

2. Push to GitHub:

git add . && git commit -m "CodeKraft v2" && git push origin main

3. Import into Vercel → vercel.com → Add New Project → Import repo → Framework: Other → Deploy

4. Add secrets (Vercel → Settings → Environment Variables):

Variable	Value	Required?
`HF_API_KEY`	Your Hugging Face token	Yes
`HF_MODEL`	`microsoft/codebert-base`	Yes
`OPENAI_API_KEY`	Your OpenAI key	No (Layer 4 only)
`ALLOWED_ORIGINS`	`*`	Yes

5. Redeploy and you're live.

Local Development

# Backend
pip install -r requirements.txt
cp .env.example .env              # add your API keys
uvicorn api.index:app --reload --port 8000

# Frontend (separate terminal)
cd frontend && npm install && npm run dev

Tech Stack

Layer	Technology	Role
ML Model	CodeBERT (microsoft/codebert-base)	Error classification
API	FastAPI + Uvicorn	Async Python backend
Frontend	React 18 + Vite	Terminal-themed UI
Inference	Hugging Face Inference API	Hosted model serving
Enrichment	OpenAI GPT-3.5-turbo	Socratic hint generation
Hosting	Vercel	Serverless deployment
Analysis	Python AST module	Zero-dependency static analysis

Why CodeKraft?

Feature	ChatGPT	Linters	CodeKraft
Finds the bug	Yes	Yes	Yes
Explains why	Sometimes	No	Always
Gives hints, not answers	No	No	Yes
Real-time (< 300ms)	No (~2s)	Yes	Yes
Learns your patterns	No	No	Yes
Works offline (L1-L3)	No	Yes	Yes

Built with caffeine and curiosity by Karan Patel

CodeKraft doesn't write your code. It makes you a better coder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`>_ CodeKraft`

Your code has bugs. CodeKraft doesn't just find them — it teaches you why.

The Problem

How It Works — 4 Layers, Under 300ms

Evaluation Metrics

See It In Action

Request

Response (< 300ms)

API Reference

Project Structure

Model Deep Dive

Architecture

Training Configuration

Error Categories Detected

Deploy Your Own

Prerequisites

Quick Deploy

Local Development

Tech Stack

Why CodeKraft?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
api		api
frontend		frontend
lib		lib
.env.example		.env.example
.gitignore		.gitignore
CodeKraft_DD_NN_Final.ipynb		CodeKraft_DD_NN_Final.ipynb
README.md		README.md
requirements.txt		requirements.txt
runtime.txt		runtime.txt
vercel.json		vercel.json

Folders and files

Latest commit

History

Repository files navigation

>_ CodeKraft

Your code has bugs. CodeKraft doesn't just find them — it teaches you why.

The Problem

How It Works — 4 Layers, Under 300ms

Evaluation Metrics

See It In Action

Request

Response (< 300ms)

API Reference

Project Structure

Model Deep Dive

Architecture

Training Configuration

Error Categories Detected

Deploy Your Own

Prerequisites

Quick Deploy

Local Development

Tech Stack

Why CodeKraft?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`>_ CodeKraft`

Packages