INHERITED FROM Helix Constitution

Base agent rules live in the Helix Constitution submodule at the parent project's constitution/AGENTS.md and the universal constitution/Constitution.md it references. READ THOSE FIRST. The base file is authoritative for any topic not covered here. Module-specific rules below extend them; they never weaken them.

Critical universal rules every CLI agent (Claude Code, Cursor, Aider, Codex, Gemini CLI) MUST honour while working in this module:

No bluffing. Every PASS carries positive evidence. Constitution §11.4.
Mutation-paired gates. Every new gate has a paired mutation proving it catches regressions. Constitution §1.1.
No guessing language (likely, probably, maybe, seems). Constitution §11.4.6.
Credentials never tracked. .env patterns git-ignored; runtime-load only. Constitution §11.4.10.
Never force-push. Force-push requires explicit per-session authorization AND a green §9.1.5 post-op gate. Constitution §9.
CONTINUATION.md kept in sync in every non-trivial commit. Constitution §12.10.
60% RAM cap. Heavy work wrapped in bounded execution scope. Constitution §12.6.

Canonical reference: https://github.com/HelixDevelopment/HelixConstitution

AGENTS.md — Project Authoritative Agent Guide

INHERITED FROM constitution/AGENTS.md

All rules in constitution/AGENTS.md (and the constitution/Constitution.md it references) apply unconditionally. This file's rules below extend them — they MUST NOT weaken any inherited rule. See parent root CLAUDE.md §6.AD for the Lava-specific incorporation context (29th §6.L cycle, 2026-05-14) and §6.AD-debt for the implementation-gap inventory. Use constitution/find_constitution.sh from the parent project root to resolve the absolute path of the submodule from any nested location.

Project Agent Guidelines

Version: 3.0.0 (Updated with full architecture audit) Date: 2026-04-30 Scope: All AI agents, human contributors, and automated processes working on the project Authority: Derived from the parent project's AGENTS.md with project-specific enhancements

Project Overview

The project is an enterprise-grade distributed AI development platform built in Go. It enables intelligent task division, work preservation, cross-platform development workflows, and multi-provider LLM integration through a unified REST API, CLI, Terminal UI, Desktop, and Mobile client architecture.

Current Status: The internal/ foundation is largely solid (auth, database, server, worker, task, workflow, tools, editor, notification, MCP, verifier are real implementations). Critical bluff and stub areas remain in select entry points and peripheral packages. All agents MUST prioritize zero-bluff implementation.

LLMsVerifier Integration Status: internal/verifier/ package is now implemented with REST API client, two-tier cache, circuit breaker health monitor, background poller, score adapter, and event publisher. BLUFF-002 (hardcoded CLI models) and BLUFF-004 (hardcoded external models) are FIXED. BLUFF-005 (scoring ignores verifier data) is FIXED in ModelManager.SelectOptimalModel().

Key Features:

Distributed Computing: SSH-based worker pools with health monitoring, auto-installation, and consensus
Multi-Provider LLM Integration: 15+ providers (OpenAI, Anthropic, Gemini, Ollama, Azure, Bedrock, Groq, Mistral, Cohere, xAI, DeepSeek, Qwen, OpenRouter, HuggingFace, Llama.cpp)
Development Workflows: Automated planning, building, testing, refactoring with real shell execution
Task Management: Intelligent task division with priorities, dependencies, checkpointing, and Redis caching
MCP Protocol: Full Model Context Protocol server over WebSocket with tool dispatch
Multi-Client Architecture: REST API (Gin), Cobra CLI, Terminal UI (tview), Desktop (Fyne), Mobile (gomobile), WebSocket
Memory Systems: In-memory, filesystem, Redis, Memcached, Cognee, ChromaDB, Qdrant, Weaviate integrations
Advanced Editor: Multi-format code editing (diff, whole-file, search/replace, line-based) with backups
Tools Ecosystem: 40+ tools across filesystem, shell, web, browser, mapping, multiedit, confirmation, notebook, git
Notifications: Multi-channel support (Slack, Email, Telegram, Discord, Yandex Messenger, Max)

Technology Stack

Core Technologies:

Language: Go 1.24.0 with toolchain go1.24.9
Module: dev.helix.code
HTTP Framework: Gin v1.11.0
Authentication: JWT v4.5.2, bcrypt + argon2
Database: PostgreSQL 15+ via pgx/v5 (optional)
Cache: Redis 7+ via go-redis/v9 (optional)
Configuration: Viper v1.21.0
CLI Framework: Cobra v1.8.0
Testing: Testify v1.11.1

UI Technologies:

Desktop: Fyne v2.7.0
Terminal UI: tview v0.42.0
Mobile: gomobile bindings

External Integrations:

Browser Automation: chromedp v0.14.2
Web Scraping: goquery v1.10.3
Tree-sitter: go-tree-sitter
Identity: Azure SDK, AWS SDK v2
Vector/Memory: Cognee, ChromaDB, Qdrant, Weaviate clients
Container Orchestration: digital.vasic.containers (vasic-digital/Containers submodule)

Working Directory & Build System

CRITICAL: All build and test commands must be run from the helix_code/ subdirectory, not the repository root.

cd <project_root>

Build Commands

Command	Purpose
`make build`	Build server binary to `bin/helixcode`
`make test`	Run `go test -v ./...`
`make test-all`	Run tests + coverage + benchmarks + docs
`make test-coverage`	Generate coverage report
`make test-benchmark`	Run Go benchmarks
`make logo-assets`	Generate logo assets (required before first build)
`make setup-deps`	Run `go mod tidy`
`make fmt`	Run `go fmt ./...`
`make lint`	Run `golangci-lint run ./...`
`make clean`	Clean build artifacts
`make dev`	Start development server
`make prod`	Cross-platform production build
`make mobile`	Build iOS + Android targets
`make aurora-os`	Build Aurora OS target
`make harmony-os`	Build Harmony OS target

Full Infrastructure Test Commands

Command	Purpose
`make test-infra-up`	Start full Docker test infrastructure
`make test-infra-down`	Stop full Docker test infrastructure
`make test-full`	ALL tests with real infrastructure (zero skips)
`make test-unit-full`	Unit tests with real services
`make test-integration-full`	Integration tests with `-tags=integration`
`make test-e2e-full`	E2E challenge tests via runner
`make test-security-full`	Security test suite
`make test-load-full`	Load tests
`make test-complete`	Sequential run of all full test types
`make coverage-full`	Coverage with full infrastructure

Containerized Builds (NO Host Dependencies)

Command	Purpose
`make container-builder-image`	Build the builder container image
`make container-build`	Build application inside container
`make container-test`	Run tests inside container
`make container-lint`	Run linter inside container
`make container-shell`	Interactive shell in builder container
`make container-dev-up`	Start containerized dev environment
`make container-dev-down`	Stop containerized dev environment
`make container-release`	Full release build in container
`./scripts/containers/build-in-container.sh`	Convenience wrapper script

The builder container includes: Go 1.24, gcc, postgresql-client, redis, docker-cli, golangci-lint, and all build tools. The only host requirement is Docker/Podman.

Standalone Test Scripts

Script	Purpose
`./run_tests.sh --unit`	Unit tests
`./run_tests.sh --integration`	Integration tests
`./run_tests.sh --e2e`	E2E tests
`./run_tests.sh --coverage`	Coverage analysis
`./run_tests.sh --security`	Security tests
`./run_all_tests.sh`	Orchestrates ALL suites sequentially
`./run_integration_tests.sh`	DB integration tests with Docker

Single Test Execution

go test -v -run TestName ./path/to/package
go test -v -tags=integration ./internal/database
cd tests/e2e/challenges && go run cmd/runner/main.go -challenge ascii-art-generator-001 -providers ollama

Architecture & Code Organization

helix_code/
├── cmd/                          # Application entry points
│   ├── server/main.go            # HTTP server entry point
│   ├── cli/main.go               # Legacy flag-based CLI client
│   ├── root.go                   # Cobra root command (`helix`)
│   ├── main_commands.go          # `helix start`, `helix auto`
│   ├── other_commands.go         # `helix server`, `helix version`, etc.
│   ├── local-llm.go              # `helix local-llm` command tree
│   ├── local-llm-advanced.go     # Advanced local-llm commands
│   ├── helix-config/main.go      # Dedicated config management CLI
│   ├── security-test/main.go     # Simulated security test runner
│   ├── security-fix/main.go      # Security fix wrapper
│   ├── security-fix-standalone/main.go  # Standalone security scanner
│   ├── performance-optimization/main.go # Performance optimizer
│   ├── performance-optimization-standalone/main.go # Standalone perf simulator
│   └── config-test/main.go       # Config hot-reload test utility
│
├── internal/                     # Internal packages (~40 packages)
│   ├── auth/                     # JWT authentication, bcrypt/argon2, sessions
│   ├── llm/                      # LLM provider implementations (15+ providers)
│   │   ├── providers/            # Per-provider HTTP clients
│   │   ├── compression/          # Context compression
│   │   └── vision/               # Vision/multimodal support
│   ├── provider/                 # Provider abstractions
│   ├── providers/                # Provider management
│   ├── worker/                   # SSH-based worker pool, health checks
│   ├── task/                     # Task queues, dependencies, checkpoints
│   ├── server/                   # Gin HTTP server, routes, middleware
│   ├── database/                 # PostgreSQL pgx pool, schema initialization
│   ├── redis/                    # go-redis wrapper with graceful degradation
│   ├── tools/                    # 40+ tool ecosystem registry
│   │   ├── filesystem/           # fs_read, fs_write, fs_edit, glob, grep
│   │   ├── shell/                # shell, shell_background with sandbox
│   │   ├── web/                  # web_fetch, web_search
│   │   ├── browser/              # browser_launch, browser_navigate, browser_screenshot
│   │   ├── multiedit/            # Transactional multi-file editing
│   │   └── git/                  # Git automation
│   ├── editor/                   # Multi-format code editing with backups
│   ├── memory/                   # Memory providers (in-mem, filesystem, Redis, etc.)
│   ├── cognee/                   # Cognee.ai memory integration
│   ├── context/                  # Hierarchical context management with TTL
│   ├── notification/             # Multi-channel notification engine
│   ├── mcp/                      # Model Context Protocol WebSocket server
│   ├── workflow/                 # Development workflow execution
│   ├── config/                   # Viper-based configuration management
│   ├── event/                    # Pub/sub event bus
│   ├── logging/                  # Structured logging wrapper
│   ├── monitoring/               # Metric collection framework
│   ├── security/                 # Security scanning (stubbed)
│   ├── session/                  # Development session management
│   ├── agent/                    # Agent orchestration
│   ├── project/                  # Project management
│   ├── rules/                    # Rules engine
│   ├── hooks/                    # Hook system
│   ├── focus/                    # Focus chain management
│   ├── template/                 # Template system
│   ├── persistence/              # State persistence
│   ├── deployment/               # Deployment management
│   ├── discovery/                # Service/model discovery
│   ├── hardware/                 # Hardware abstraction
│   ├── repomap/                  # Repository mapping
│   ├── version/                  # Version management
│   ├── fix/                      # Security fix engine
│   ├── performance/              # Performance optimization
│   ├── testutil/                 # Test utilities
│   └── mocks/                    # Shared mocks
│
├── applications/                 # Platform-specific applications
│   ├── desktop/                  # Fyne desktop app
│   ├── terminal-ui/              # tview terminal UI
│   ├── android/                  # Android app
│   ├── ios/                      # iOS app
│   ├── aurora-os/                # Aurora OS client
│   └── harmony-os/               # Harmony OS client
│
├── api/                          # OpenAPI specification
│   └── openapi.yaml              # Full REST API spec (OpenAPI 3.0.3)
│
├── config/                       # Configuration files
│   ├── config.yaml               # Primary application config
│   ├── production-config.yaml    # Enterprise production config
│   ├── minimal-config.yaml       # Minimal test config (DB/Redis disabled)
│   ├── test-config.yaml          # Test-specific config
│   ├── working-config.yaml       # Working variant
│   ├── azure_example.yaml        # Azure-specific example
│   └── model-aliases.example.yaml# Model alias examples
│
├── tests/                        # New test framework
│   ├── e2e/challenges/           # Challenge-based E2E tests
│   └── automation/               # Hardware automation tests
│
├── test/                         # Legacy/parallel test suites
│   ├── integration/              # Integration tests
│   ├── e2e/                      # Legacy E2E tests
│   ├── automation/               # Provider automation tests
│   └── load/                     # Load tests
│
├── benchmarks/                   # Performance benchmarks
├── security/                     # Security tests
├── standalone_tests/             # Standalone CLI tests
├── docker/                       # Docker assets and extended compose
├── scripts/                      # Build and deployment scripts
└── assets/                       # Logo and image assets

Verified Real Implementations

AUTH-001: Authentication System (VERIFIED REAL)

File: internal/auth/auth.go (~470 lines) Assessment: Production-ready

User registration with validation
Password hashing with bcrypt + argon2 fallback
JWT token generation and verification (JWT v4)
Session management with crypto-random tokens
Constant-time comparison for timing attack prevention
Full test coverage in internal/auth/auth_test.go (~777 lines)

DB-001: Database Layer (VERIFIED REAL)

File: internal/database/database.go Assessment: Production-ready

PostgreSQL connection pool via pgx/v5
Full schema initialization (users, workers, tasks, projects, sessions, LLM providers, MCP servers, notifications, audit logs)
DatabaseInterface for testability
Graceful degradation when host is empty

SRV-001: HTTP Server (VERIFIED REAL)

File: internal/server/server.go Assessment: Production-ready

Gin-based server with 50+ routes across /api/v1/
JWT auth middleware, CORS, security headers
WebSocket endpoint for MCP
Health check with DB + Redis validation
Graceful shutdown (30s timeout)

LLM-001: LLM Providers (VERIFIED REAL)

File: internal/llm/ (~5000+ lines across providers) Assessment: Real HTTP clients

AnthropicProvider (~752 lines): Full SSE streaming, prompt caching, extended thinking, tool calls
OpenAIProvider (~431+ lines): Full HTTP API client
ModelManager: Multi-provider orchestration, selection strategy, fallback chain
16 provider subdirectories with real HTTP implementations
Note: The internal/llm/ package is genuine. Bluff areas are at cmd/cli/main.go only.

WRK-001: Worker Pool (VERIFIED REAL)

File: internal/worker/ (~800+ lines) Assessment: Real distributed worker management

WorkerManager: Register, heartbeat, assign tasks, complete tasks
SSH config parsing, capability matching, resource tracking
Health checks with TTL

TSK-001: Task Management (VERIFIED REAL)

File: internal/task/ (~1000+ lines) Assessment: Real task lifecycle

Priority queues, dependency validation, checkpointing
Redis caching with graceful degradation
Retry logic and cleanup

WFL-001: Workflow Engine (VERIFIED REAL)

File: internal/workflow/ (~1100+ lines) Assessment: Real shell execution

Executor dispatches to real exec.CommandContext() calls
Security filtering via isDangerousCommand() (rm, dd, mkfs, fork bombs, etc.)
LLM integration with real LLMRequest
Supports Go, Node, Python, Rust project types

TOO-001: Tools Ecosystem (VERIFIED REAL)

File: internal/tools/ (~2000+ lines) Assessment: Real tool registry

8 categories: filesystem, shell, web, browser, mapping, multiedit, confirmation, notebook
Real chromedp browser automation
Transactional multi-file editing

EDT-001: Code Editor (VERIFIED REAL)

File: internal/editor/ (~600+ lines) Assessment: Real file I/O

Diff, whole-file, search/replace, line-based editors
Automatic file backup with io.Copy
EditApplier / EditValidator interfaces

NOT-001: Notification Engine (VERIFIED REAL)

File: internal/notification/ (~800+ lines) Assessment: Real HTTP/SMTP calls

Slack (webhook HTTP POST), Email (SMTP via net/smtp), Telegram (Bot API), Discord (webhook)
Yandex Messenger (OAuth API), Max (enterprise API)
Rate limiting, retry, queue, metrics

MCP-001: MCP Protocol Server (VERIFIED REAL)

File: internal/mcp/ (~400+ lines) Assessment: Real WebSocket server

gorilla/websocket concurrent session handling
JSON-RPC-like message format
Tool execution dispatch

CFG-001: Configuration Management (VERIFIED REAL)

File: internal/config/ (~1700+ lines) Assessment: Full Viper integration

Environment variable binding (HELIX_*)
Config file search (., $HOME/.helixcode, /etc/helixcode)
Validation rules, default config creation
ConfigManager for load/save/merge

QA-001: HelixQA Integration (VERIFIED REAL)

Files: internal/helixqa/, internal/server/qa_handlers.go, applications/terminal-ui/main.go Assessment: Full embedded QA engine with real session lifecycle

Engine struct manages QA sessions with map + sync.RWMutex
StartSession(), CancelSession(), GetSession(), ListSessions() with real state tracking
REST API: POST /api/v1/qa/session, GET /api/v1/qa/session/:id/status, GET /api/v1/qa/session/:id/report, GET /api/v1/qa/session/:id/screenshot/:name, DELETE /api/v1/qa/session/:id
CLI flags: --qa-run, --qa-list, --qa-report, --qa-screenshot, --qa-cancel
TUI dashboard with session table, stats panel, refresh/cancel actions
Screenshot pipeline: 8 platform engines (Linux, Web, iOS, Android, CLI, TUI, macOS, Windows)
Tests: internal/helixqa/wrapper_test.go, internal/server/qa_handlers_test.go, pkg/screenshot/*_test.go

Verified Bluff & Stub Areas (MUST FIX)

BLUFF-001: LLM Generation is Simulated in Legacy CLI (CRITICAL) — FIXED

File: cmd/cli/main.go lines ~236-284 Evidence: Previously returned fmt.Sprintf("Generated response for: %s...", prompt) without calling any provider. Fix: handleGenerate() now constructs a real llm.LLMRequest with user messages and calls provider.Generate() / provider.GenerateStream(). Errors are propagated to the user if the provider is unavailable. Verification: go build -tags nogui ./cmd/cli/ compiles; provider call is real (returns error if Ollama/etc. is not running). Fix Priority: P0 — RESOLVED

BLUFF-002: Model Listing is Hardcoded in Legacy CLI (CRITICAL) — FIXED

File: cmd/cli/main.go lines ~101-128 Evidence: Previously only 3 hardcoded models. No dynamic discovery. Fix: Replaced with verifier-aware handleListModels() that queries LLMsVerifier adapter first, falls back to provider discovery, then to constitutional FallbackModels (7 models with scores and verification status). Verification: go test -v ./internal/verifier/... passes; go build ./cmd/cli/... compiles. Fix Priority: P0 — RESOLVED

BLUFF-003: Command Execution is Simulated in Legacy CLI (HIGH) — FIXED

File: cmd/cli/main.go lines ~310-324 Evidence: Previously printed the command and slept for 1 second without executing anything. Fix: handleCommand() uses exec.CommandContext(ctx, "sh", "-c", command) with real os.Stdout/os.Stderr redirection. Exit codes are reported. Verification: go build -tags nogui ./cmd/cli/ compiles. Fix Priority: P0 — RESOLVED

STUB-001: Security Scanning is Simulated

File: internal/security/security.go (~132 lines) Evidence: ScanFeature() contains explicit "Simulate security scanning logic" comment. Always returns Success=true, Score=95 with empty issues. Fix Priority: P1

STUB-002: Memory Redis/Memcached Providers Store Locally

File: internal/memory/ (~1800+ lines) Evidence: RedisMemoryProvider and MemcachedMemoryProvider store data in local maps with comments like "Redis client would be used in production." Connection config is parsed but not used. Fix Priority: P2

STUB-003: Security-Test Entry Point is Entirely Simulated

File: cmd/security-test/main.go Evidence: Hardcoded list of 12 simulated security tests. simulateSecurityScan() returns pre-canned issue lists per category. Fix Priority: P2

STUB-004: Several `helix` Subcommands are Print-Only

File: cmd/other_commands.go Evidence: server, generate, test, worker, notify commands are stubbed (print placeholder messages). Fix Priority: P2

STUB-005: Several `helix-config` Subcommands are Placeholders

File: cmd/helix-config/main.go Evidence: Many template/history/schema subcommands print placeholder messages. Fix Priority: P3

BLUFF-004: LLMsVerifier Integration is Stubbed or Bypassed (CRITICAL)

File Pattern: internal/verifier/*.go containing empty structs, // TODO, or methods that return hardcoded data instead of calling the verifier. Evidence:

VerificationService methods return hardcoded VerificationResult{OverallScore: 8.5} instead of querying the verifier database
ModelDiscoveryService returns an empty slice instead of calling provider APIs
The verifier client returns fallback models without attempting a real HTTP call Fix Priority: P0 - Immediate Verification Command:

make test-verifier-integration
# This MUST pass with real verifier data, not mocked scores

BLUFF-005: Provider Discovery Uses Hardcoded Env Var Names (HIGH)

File Pattern: internal/verifier/startup.go or provider adapter files containing hardcoded strings like "OPENAI_API_KEY" without checking SupportedProviders[provider].EnvVars. Fix Priority: P1 - High

BLUFF-006: Model Capabilities Are Hardcoded (HIGH)

File Pattern: internal/llm/*.go containing SupportsToolUse: true as a struct literal for specific models, or Provider.GetCapabilities() returning a static slice. Fix Priority: P1 - High Constitutional Impact: Violates CONST-041 (MCP/LSP/ACP/Embedding/RAG/Skills/Plugins Integration Mandate).

BLUFF-007: Test Claims Integration But Uses Mocked Verifier (CRITICAL)

File Pattern: *_test.go files with testify/mock or testMode: true in non-unit test files. Fix Priority: P0 - Immediate Constitutional Impact: Violates CONST-038 (Model Provider Anti-Bluff Guarantee) and CONST-035 (Zero-Bluff Testing).

BLUFF-008: Scoring Weights Do Not Sum to 1.0 (MEDIUM)

File Pattern: configs/verifier.yaml or internal/verifier/config.go where scoring weights are misconfigured. Fix Priority: P2 - Medium

BLUFF-009: `/metrics` Endpoint Returns Hardcoded Zeros (CRITICAL) — FIXED

File: internal/server/handlers.go lines ~834-855 Evidence: All dynamic metrics (goroutines, memory, database connections) were hardcoded to 0. Fix: getMetrics() now calls runtime.ReadMemStats(), runtime.NumGoroutine(), and s.db.Pool.Stat() to return real values. Fix Priority: P0 — RESOLVED

BLUFF-010: Multi-Edit Conflict Detection is a No-Op (HIGH) — FIXED

File: internal/tools/multiedit/transaction.go lines ~352-369 Evidence: detectFileConflict() always returned nil, nil with comment "For now, we'll assume no conflicts." Fix: Implemented real conflict detection — reads the file from disk, computes SHA-256, and compares against the Checksum field. Returns ConflictModified or ConflictDeleted when appropriate. Fix Priority: P1 — RESOLVED

Configuration Management

Primary Configuration

Main config at config/config.yaml:

server:
  address: "0.0.0.0"
  port: 8080
  read_timeout: 30
  write_timeout: 30
  idle_timeout: 300
  shutdown_timeout: 30

database:
  host: ""          # Empty string disables PostgreSQL
  port: 5432
  user: "helix"
  password: "${HELIX_DATABASE_PASSWORD}"
  dbname: "helixcode_prod"
  sslmode: "disable"

redis:
  host: "redis"
  port: 6379
  password: "${HELIX_REDIS_PASSWORD}"
  db: 0
  enabled: true

auth:
  jwt_secret: "${HELIX_AUTH_JWT_SECRET}"
  token_expiry: 86400
  session_expiry: 604800
  bcrypt_cost: 12

workers:
  health_check_interval: 30
  health_ttl: 120
  max_concurrent_tasks: 10

tasks:
  max_retries: 3
  checkpoint_interval: 300
  cleanup_interval: 3600

llm:
  default_provider: "local"
  max_tokens: 4096
  temperature: 0.7
  timeout: 30
  max_retries: 3
  providers:
    <name>:
      type: <provider-type>
      endpoint: <url>
      enabled: true
      parameters:
        timeout: 30.0
        max_retries: 3
        streaming_support: true
        api_key: ""
  selection:
    strategy: "performance"
    fallback_enabled: true
    health_check_interval: 30

logging:
  level: "info"
  format: "text"
  output: "stdout"

notifications:
  enabled: true
  rules:
    - name: "..."
      condition: "type==error"
      channels: ["slack", "email"]
      priority: urgent
      enabled: true
  channels:
    slack: { enabled, webhook_url, channel, username, timeout }
    telegram: { enabled, bot_token, chat_id, timeout }
    email: { enabled, smtp: { server, port, username, password, tls }, recipients, timeout }
    discord: { enabled, webhook_url, timeout }

Environment Variables

Required for Production:

HELIX_DATABASE_PASSWORD
HELIX_AUTH_JWT_SECRET
HELIX_REDIS_PASSWORD

LLM Provider Keys (as needed):

OPENAI_API_KEY, ANTHROPIC_API_KEY, GEMINI_API_KEY, XAI_API_KEY, DEEPSEEK_API_KEY, GROQ_API_KEY, MISTRAL_API_KEY, COHERE_API_KEY, AZURE_OPENAI_API_KEY, AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY

Notification Integrations:

HELIX_SLACK_WEBHOOK_URL
HELIX_TELEGRAM_BOT_TOKEN, HELIX_TELEGRAM_CHAT_ID
HELIX_EMAIL_SMTP_SERVER, HELIX_EMAIL_USERNAME, HELIX_EMAIL_PASSWORD
HELIX_DISCORD_WEBHOOK_URL

Testing Strategy

Test Categories

Unit tests: Mocks allowed, *_test.go, -short flag
Contract tests: Real API schemas, no mocks
Component tests: Real subsystems wired together
Integration tests: Full app with real dependencies (-tags=integration)
E2E challenges: Complete user workflows against real LLM APIs
Security tests: OWASP compliance
Performance tests: Benchmarks
Automation tests: Provider/hardware automation (-tags=automation)
Load tests: Stress testing

Anti-Bluff Testing Rules

Unit tests: Mocks OK
ALL other tests: Real infrastructure ONLY
Every PASS guarantees Quality + Completion + Usability
Challenges fail on simulated/stubbed behavior
No bare t.Skip() without SKIP-OK: #<ticket> marker

Docker Test Infrastructure

docker-compose.test.yml: PostgreSQL 16, Redis 7, Memcached, Cognee, ChromaDB, Qdrant, Ollama, Prometheus, Grafana
docker-compose.full-test.yml: Complete stack with mock-LLM server, Selenium, ChromeDP, SSH server + 3 workers, Cognee, Weaviate, mock-Slack, multicast router

Challenge Framework (`tests/e2e/challenges/`)

The most rigorous test system validates the project by having it generate real projects and testing them:

Challenge Definitions: JSON specs (ASCII art generator, CLI task manager, JSON validator, notes API, tic-tac-toe TUI, URL shortener)
Execution Flow: Load spec → Call real LLM API → Parse generated code → Compile → Test → Runtime validation
Validation Layers: Directory structure, code quality, compilation, testing, functionality, runtime validation with diverse data
Test Matrix: Supports CLI, TUI, REST, WebSocket interfaces across 15+ providers and worker pool distributions

Test Scripts Summary

# Basic
cd <project_root> && make test

# Full infrastructure (recommended for validation)
make test-infra-up
make test-complete
make test-infra-down

# Individual categories
make test-unit-full
make test-integration-full
make test-e2e-full
make test-security-full
make test-load-full

# Legacy scripts
./run_tests.sh --all
./run_all_tests.sh
./run_integration_tests.sh

Docker Deployment

Production (`docker-compose.yml`)

Services: helixcode-server (8080, 2222), postgres:15, redis:7, nginx (80, 443), prometheus (9090), grafana (3000)

Quick Start

cd <project_root>
cp .env.example .env
# Edit .env with secure passwords
docker compose up -d
docker compose ps
curl http://localhost/health

Other Compose Files

File	Purpose
`docker-compose-simple.yml`	Minimal dev (postgres + redis only)
`docker-compose.test.yml`	Integration/E2E testing stack
`docker-compose.full-test.yml`	Zero-skip full test infrastructure
`docker-compose.aurora-os.yml`	Security-focused Aurora OS platform
`docker-compose.harmony-os.yml`	Distributed Harmony OS platform
`docker-compose.specialized-platforms.yml`	Combined Aurora + Harmony
`docker/docker-compose.yml`	Extended full-stack with Milvus, Elasticsearch, MLflow, Jaeger, Jupyter, Portainer

Deployment Patterns

Healthchecks on every service
Docker profiles: monitoring, distributed, with-redis, production, dev, server
Isolated bridge networks per deployment
Named persistent volumes for all stateful services
.env file for secrets

Code Style & Development Conventions

Go Conventions

Standard Go formatting: go fmt ./...
Linting: golangci-lint run ./... (timeout 10m in CI)
Vet: go vet ./...
Table-driven tests with t.Run() subtests
Build tags for integration/automation tests: //go:build integration

Project Conventions

Always work from helix_code/ subdirectory
Generate logo assets before first build: make logo-assets
Database/Redis optional: Disable by setting database.host: ""
Environment variables override config file
Use internal/ for all core packages; no pkg/ directory in active use
Error handling: explicit, no silent failures
Concurrent access: use sync.RWMutex or channel patterns

API Conventions

REST API documented in api/openapi.yaml (OpenAPI 3.0.3)
Base path: /api/v1
Authentication: Bearer JWT via Authorization header
Health endpoint: GET /health (no auth required)

Security Considerations

Verified Security Features

Password hashing: bcrypt (cost 12) with argon2 fallback
JWT with constant-time comparison
CORS middleware, security headers (X-Frame-Options, CSP, HSTS)
Rate limiting support in production config
Session timeout, concurrent session limits, IP binding options
Workflow isDangerousCommand() filter blocks rm, dd, mkfs, fork bombs, etc.
Input validation in auth and server packages

Security Testing

security/security_test.go: OWASP Top 10, SAST, DAST, credential scanning, TLS enforcement, input validation (path traversal, XSS, SQL injection, command injection, SSRF)
File permission checks (0600 for configs)

Known Security Stubs

internal/security/security.go: Simulated scanning (always returns clean)
cmd/security-test/main.go: Entirely simulated security tests

Production Hardening

Use HELIX_AUTH_JWT_SECRET with high entropy
Enable PostgreSQL SSL in production
Enable Redis authentication
Configure CORS allowed_origins explicitly
Enable audit logging
Set bcrypt_cost: 14 in production

Free AI Providers

XAI (Grok): grok-3-fast-beta, grok-3-mini-fast-beta
OpenRouter: Free models from various providers
GitHub Copilot: gpt-4o, claude-3.5-sonnet (with subscription)
Qwen: 2,000 requests/day free tier

Common Issues

Build fails: Run make logo-assets then make build
Database errors: Check HELIX_DATABASE_PASSWORD
Worker SSH failures: Verify SSH key authentication
LLM timeouts: Check provider status and config
Redis connection failures: Check HELIX_REDIS_PASSWORD and redis.enabled
Test skips: Ensure SKIP-OK: #<ticket> marker is present for any intentional skips

Resources & References

Constitution: CONSTITUTION.md
CLAUDE.md: CLAUDE.md
Gap Analysis: HELIXCODE_GAP_ANALYSIS.md
Zero-Bluff Plan: HELIXCODE_ZERO_BLUFF_PLAN.md
Testing Strategy: ANTI_BLUFF_TESTING_STRATEGY.md
OpenAPI Spec: helix_code/api/openapi.yaml
Docker Guide: helix_code/DOCKER_DEPLOYMENT.md

FilesExpand file tree

AGENTS.md

Latest commit

History