🤖 AI Test Case Generator

Generate structured Playwright or Gherkin test cases from any URL or feature description — powered by Claude and OpenAI.

Try it in 60 seconds

No API key. No setup script. Pick your terminal and run the commands below.

Start from your home folder — open a fresh terminal (it defaults to your home folder). Never clone into C:\WINDOWS\System32 or other protected directories.

Windows (Command Prompt)

cd %USERPROFILE%
git clone https://github.com/FaraazSuffla/ai-test-case-generator.git
cd ai-test-case-generator
py -m pip install click rich python-dotenv beautifulsoup4 requests
py generate_tests.py --demo --describe "User login" --format playwright

Windows (PowerShell)

cd ~
git clone https://github.com/FaraazSuffla/ai-test-case-generator.git
cd ai-test-case-generator
py -m pip install click rich python-dotenv beautifulsoup4 requests
py generate_tests.py --demo --describe "User login" --format playwright

Mac / Linux

cd ~
git clone https://github.com/FaraazSuffla/ai-test-case-generator.git
cd ai-test-case-generator
pip3 install click rich python-dotenv beautifulsoup4 requests
python3 generate_tests.py --demo --describe "User login" --format playwright

You'll see 18 Playwright tests written to output/. Ready for your own app? See Full AI Mode below.

For full AI mode you'll also need the LLM SDK and browser binaries. Run once after the above:
pip install anthropic openai playwright && playwright install chromium

Quick Start

Prerequisites

Python 3.10 or higher — check with py --version (Windows) or python3 --version (Mac/Linux)
git — for cloning (download)

Step 1 — Clone the repo

Open a fresh terminal in your home or projects folder, then run:

git clone https://github.com/FaraazSuffla/ai-test-case-generator.git
cd ai-test-case-generator

Step 2 — Run setup

This creates a virtual environment and installs all dependencies. Pick your terminal:

Windows (Command Prompt)

setup.bat

Windows (PowerShell)

.\setup.bat

PowerShell requires .\ before script names. CMD does not.

Mac / Linux

./setup.sh

Tip: If you get ModuleNotFoundError, run pip install -r requirements.txt manually.

Windows users: If you see a UnicodeEncodeError, run set PYTHONUTF8=1 once in your session, then retry.

Once setup is complete, see Shortcut Wrappers to skip typing py generate_tests.py every time.

Two Modes

Mode	What it does	API key needed?
`--demo`	Uses built-in templates against a real login page	❌ No
Full	AI-generates tests for any URL or description	✅ Yes

Demo Mode

No API key required. Runs against Practice Test Automation — a real login page with known credentials (student / Password123) and real selectors, so the generated tests are actually runnable.

Windows (Command Prompt)

py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright
py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format gherkin
py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --report
py generate_tests.py --demo --describe "User registration" --format gherkin

Windows (PowerShell)

py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright
py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format gherkin
py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --report
py generate_tests.py --demo --describe "User registration" --format gherkin

Mac / Linux

python3 generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright
python3 generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format gherkin
python3 generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --report
python3 generate_tests.py --demo --describe "User registration" --format gherkin

Demo mode produces 18 Playwright tests or 16 Gherkin scenarios across 4 categories per run.

Full AI Mode

Set your API key first, then point the tool at any URL or description.

Tip: Run python generate_tests.py --check first to verify your environment is ready.

Windows (Command Prompt)

set ANTHROPIC_API_KEY=your-key
py generate_tests.py --url https://your-app.com/login --format playwright
py generate_tests.py --url https://your-app.com/login --format gherkin --provider openai
py generate_tests.py --url https://your-app.com/login --format playwright --analyze
py generate_tests.py --describe "Shopping cart with coupon codes" --format playwright

Windows (PowerShell)

$env:ANTHROPIC_API_KEY="your-key"
py generate_tests.py --url https://your-app.com/login --format playwright
py generate_tests.py --url https://your-app.com/login --format gherkin --provider openai
py generate_tests.py --url https://your-app.com/login --format playwright --analyze
py generate_tests.py --describe "Shopping cart with coupon codes" --format playwright

Mac / Linux

export ANTHROPIC_API_KEY="your-key"
python3 generate_tests.py --url https://your-app.com/login --format playwright
python3 generate_tests.py --url https://your-app.com/login --format gherkin --provider openai
python3 generate_tests.py --url https://your-app.com/login --format playwright --analyze
python3 generate_tests.py --describe "Shopping cart with coupon codes" --format playwright

All CLI Flags

Show all flags

Flag	Description	Default
`--url`	URL to generate tests for	—
`--describe`	Feature description to generate from	—
`--format`	`playwright` or `gherkin`	`playwright`
`--output-dir`	Directory to write generated test files into	`output`
`--provider`	`anthropic` or `openai`	`anthropic`
`--model`	Override the default model (`claude-sonnet-4-20250514` / `gpt-4o`)	—
`--analyze`	Extract accessibility tree for smarter tests	off
`--demo`	Use built-in templates, no API key needed	off
`--report`	Generate an HTML coverage report	off
`--open-report`	Generate report and open it in the browser immediately	off
`--run`	Generate tests then execute them immediately with pytest / behave	off
`--watch`	Re-generate whenever the target URL changes (requires `--url`)	off
`--watch-interval`	Polling interval for `--watch` mode (seconds)	`60`
`--conftest/--no-conftest`	Generate `conftest.py` with Playwright fixtures	on
`--no-retry`	Disable retry logic for API calls (useful in CI)	off
`--costs`	Show API usage and cost summary	off

Either --url or --describe is required on every run.

Viewing Your Output

All generated files are saved to the output/ folder. Use --output-dir <path> to write elsewhere.

Windows (Command Prompt)

dir output\
type output\test_*.py
for %f in (output\report_*.html) do start %f

Windows (PowerShell)

dir output\
Get-Content output\test_*.py
start (Get-Item output/report_*.html).FullName

Mac / Linux

ls output/
cat output/test_*.py
open output/report_*.html        # macOS
xdg-open output/report_*.html   # Linux

Tip: Add --open-report to any command to generate and open the report automatically.

Shortcut Wrappers (optional)

After setup, use the testgen shortcut instead of typing py generate_tests.py every time.

Windows (Command Prompt)

testgen.bat --url https://example.com/login --format playwright
testgen.bat --demo --describe "login page" --format playwright

Windows (PowerShell)

.\testgen.bat --url https://example.com/login --format playwright
.\testgen.bat --demo --describe "login page" --format playwright

Mac / Linux

./testgen.sh --url https://example.com/login --format playwright
./testgen.sh --demo --describe "login page" --format playwright

Note: PowerShell requires .\ before script names. CMD does not. testgen.py is a stub — do not run it directly.

What Gets Generated

Every run produces tests across 4 categories:

Category	What's Tested	Example
✅ Happy Path	Valid inputs, expected flows	Login with correct credentials
❌ Negative	Invalid inputs, error handling	Wrong password, empty fields
🔄 Edge Cases	Security & unusual inputs	SQL injection, XSS, case sensitivity
📏 Boundary	Limits & extremes	500-char username, special characters

See example: what this tool generates vs. a typical junior test

Typical junior test:

def test_login():
    page.goto("https://practicetestautomation.com/practice-test-login/")
    page.fill("#username", "student")
    page.fill("#password", "Password123")
    page.click("#submit")
    assert "logged-in-successfully" in page.url

1 test. Happy path only.

What this tool generates:

class TestLoginHappyPath:
    def test_successful_login_with_valid_credentials(self, page: Page):
        page.goto(BASE_URL)
        page.locator("#username").fill("student")
        page.locator("#password").fill("Password123")
        page.locator("#submit").click()
        expect(page).to_have_url_matching(".*logged-in-successfully.*")

    def test_successful_login_displays_logout_button(self, page: Page):
        ...  # verifies Log out link is visible

class TestLoginNegative:
    def test_login_with_invalid_username(self, page: Page):
        ...  # verifies "Your username is invalid!" error

    def test_login_with_invalid_password(self, page: Page):
        ...  # verifies "Your password is invalid!" error

class TestLoginEdgeCases:
    def test_login_with_sql_injection_in_username(self, page: Page):
        ...  # verifies injection doesn't bypass auth

class TestLoginBoundary:
    def test_login_with_very_long_username(self, page: Page):
        ...  # sends 500-char string, verifies error

18 tests. 4 categories. Real selectors. Runnable.

Running the Generated Tests

Add --run to execute tests immediately after generation:

Windows (Command Prompt / PowerShell)

py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --run

Or run manually after generation:

py -m pip install playwright pytest
playwright install chromium
pytest output/test_practicetestautomation_com_practice_test_login_playwright.py -v

Mac / Linux

python3 generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --run

Or run manually after generation:

pip3 install playwright pytest
playwright install chromium
pytest output/test_practicetestautomation_com_practice_test_login_playwright.py -v

Note on the HTML report status column: Tests show as "Pending" because the tool generates test code — it doesn't execute it. Use --run to execute and see real results in the terminal.

HTML Coverage Report

Add --report to any command to generate a standalone HTML report:

Windows (Command Prompt / PowerShell)

py generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --report

Mac / Linux

python3 generate_tests.py --demo --url https://practicetestautomation.com/practice-test-login/ --format playwright --report

The report includes:

Total test count with category breakdown
Collapsible sections per category
Pass/fail status column (shows "Pending" until tests are executed)
Export to PDF button
Full generated code in a collapsible block
Dark theme, no external dependencies

Cost Tracking (Full Mode)

Every API call is logged. View your usage at any time:

Windows (Command Prompt / PowerShell)

py generate_tests.py --costs

Mac / Linux

python3 generate_tests.py --costs

Shows total requests, token counts, estimated cost, and a per-provider breakdown.

Project Structure

Show project structure

ai-test-case-generator/
├── generate_tests.py          # CLI entry point
├── src/
│   ├── analyzer.py            # Page analysis & accessibility tree
│   ├── generator.py           # LLM integration (Claude + OpenAI)
│   ├── conftest_generator.py  # Playwright fixture generator
│   ├── demo_templates.py      # Built-in templates for --demo mode
│   ├── report.py              # HTML coverage report generator
│   ├── cost_tracker.py        # API usage tracking
│   ├── prompts.py             # LLM prompt templates
│   └── formatters/
│       ├── playwright_fmt.py  # Saves .py test files
│       └── gherkin_fmt.py     # Saves .feature files
├── tests/                     # Unit tests for core logic
├── output/                    # Generated tests & reports land here
├── examples/                  # Sample outputs
└── requirements.txt

Recent Improvements

April 2026

Bug fixes:

Windows encoding — all file write operations (playwright_fmt, gherkin_fmt, cost_tracker) now explicitly use encoding="utf-8". Previously, running on a Windows system with a non-UTF8 default locale would cause UnicodeEncodeError when saving generated files.
Corrupt cost log — cost_tracker now handles a malformed cost_log.json gracefully (returns an empty log) instead of crashing the entire generation run.
Accessibility tree truncation — the A11Y_TREE_MAX_CHARS constant was defined in analyzer.py but the truncation slice still used a hardcoded 5000. The constant is now applied consistently.
Silent browser error — report.py was catching Exception silently on webbrowser.open. Narrowed to OSError so unexpected errors are no longer swallowed.
Built-in shadowing — report.py used format as a parameter name, shadowing Python's built-in format(). Renamed to fmt.
Redundant import — the generated conftest.py had a duplicate import os inside the base_url fixture. Removed.

Test coverage:

Unit tests added for cost_tracker, conftest_generator, and report — three modules that previously had no test coverage.
Test suite expanded from 108 to 176 tests, all passing.

Roadmap

Cypress support — Add Cypress as an output format alongside Playwright and Gherkin
Batch URL processing — Generate tests for multiple pages in a single run
Visual regression tests — Generate screenshot comparison tests
Custom prompt templates — Let users define their own generation prompts
Jira / Azure DevOps export — Push generated test cases directly to test management tools

Have a feature request? Open an issue or ⭐ the repo.

Tech Stack

Python 3.10+ · Anthropic SDK · OpenAI SDK · Playwright · BeautifulSoup4 · Rich · Click

License

MIT — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 AI Test Case Generator

Table of Contents

Try it in 60 seconds

Quick Start

Prerequisites

Step 1 — Clone the repo

Step 2 — Run setup

Two Modes

Demo Mode

Full AI Mode

All CLI Flags

Viewing Your Output

Shortcut Wrappers (optional)

What Gets Generated

Running the Generated Tests

HTML Coverage Report

Cost Tracking (Full Mode)

Project Structure

Recent Improvements

Roadmap

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github/workflows		.github/workflows
examples		examples
qa-report		qa-report
src		src
tests		tests
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_tests.py		generate_tests.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.bat		setup.bat
setup.py		setup.py
setup.sh		setup.sh
testgen.bat		testgen.bat
testgen.py		testgen.py
testgen.sh		testgen.sh

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Test Case Generator

Table of Contents

Try it in 60 seconds

Quick Start

Prerequisites

Step 1 — Clone the repo

Step 2 — Run setup

Two Modes

Demo Mode

Full AI Mode

All CLI Flags

Viewing Your Output

Shortcut Wrappers (optional)

What Gets Generated

Running the Generated Tests

HTML Coverage Report

Cost Tracking (Full Mode)

Project Structure

Recent Improvements

Roadmap

Tech Stack

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages