Brain Agent Platform

Experimental runtime for safe tool-using agents.

A modular agent runtime for experimenting with tool-using LLM agents: ReAct-style execution, sandboxed tool calls, model routing, browser automation, memory services, and multi-service orchestration.

graph TD
    User["User / Client"]

    User --> Gateway["API Gateway"]
    Gateway --> Runtime["Agent Runtime\n(ReAct Loop)"]

    Runtime --> Sandbox["Tool Sandbox"]
    Runtime --> Router["Model Router"]
    Runtime --> Browser["Browser Automation"]
    Runtime --> Memory["Memory Service"]

    subgraph Security Layer
        Sandbox
    end

    Sandbox --> FS["File System"]
    Sandbox --> Shell["Shell"]
    Sandbox --> HTTP["HTTP"]

    Router --> Gemini["Gemini"]
    Router --> Anthropic["Anthropic"]
    Router --> OpenAI["OpenAI"]

    Memory --> VectorStore["Vector Store"]
    Memory --> History["Conversation History"]

Architecture

brain-agent-platform/
├── gateway/                   # HTTP + WebSocket gateway, session routing, rate limiting
├── runtime/                   # ReAct agent loop, model routing, cost tracking
├── enterprise-runtime/        # Higher-level cognitive modules and adaptive routing experiments
├── agents/                    # Critic, decomposer, repair, and strategy modules
├── core/                      # Shared runtime primitives: budget, state, executor, etc.
├── sandbox/                   # Tool execution sandbox: filesystem, shell, git, browser-related tools
├── browser-relay/             # Playwright browser automation service
├── desktop-relay/             # Desktop control and screenshot/vision relay
├── brain-os/                  # Local OS daemon for process/window/filesystem control
├── os-control/                # OS control abstraction layer
├── vision-service/            # Optional Python vision pipeline
├── cron-service/              # Scheduled task service
├── memory-service/            # Memory management API
├── messaging-service/         # Messaging gateway experiments
├── device-relay/              # Camera/screen/location relay experiments
├── ui/                        # React web chat interface
├── shared/                    # Shared config, logging, and error utilities
├── tools/                     # Tool implementations
├── validators/                # Goal checking, quality checks, schemas
├── protocols/                 # LLM action schemas
├── tests/                     # Unit/integration/e2e tests
├── scripts/                   # Smoke tests and utilities
└── docs/                      # Runbooks, component maps, integration notes

Core services

Service	Port	Purpose
Gateway	4000	HTTP/WebSocket entry point, session routing, rate limiting
Runtime	3001	Agent loop, model routing, tool orchestration, cost tracking
Browser Relay	3003	Playwright browser automation pool
Desktop Relay	4003	Screenshot, input, and optional vision integration
Brain OS Daemon	8787	Local process/window/filesystem control behind approval policy
Vision Service	5100	Optional object detection / vision pipeline
Cron Service	4004	Scheduled tasks
UI	5173	React web chat interface

Quick start

Requirements:

Node.js 18+
npm
Python 3.8+ for optional vision-service components

# 1. Start OS daemon
cd brain-os
npm install
npm run build
BRAIN_APPROVAL_TOKEN=DEV-OK npm start

# 2. Start runtime and gateway
cd ..
npm install
node runtime/server.js
node gateway/server.js

# 3. Run smoke test
BRAIN_APPROVAL_TOKEN=DEV-OK ./scripts/e2e_smoke.sh

See docs/RUN_LOCAL.md for the full local setup.

Testing

npm test
npm run test:unit
npm run test:integration
BRAIN_APPROVAL_TOKEN=DEV-OK ./scripts/e2e_smoke.sh

The most important tests to keep strong are:

sandbox permission boundaries;
tool execution safety;
gateway/runtime integration;
model routing behavior;
end-to-end agent task execution;
failure recovery and retry paths.

Security model

The project is designed around the assumption that tool-using agents need explicit execution boundaries.

Key design goals:

require approval tokens for sensitive local actions;
isolate service responsibilities;
keep filesystem, shell, browser, and OS-control tools behind sandbox modules;
make tool execution auditable;
treat destructive operations as policy-gated actions;
separate runtime orchestration from tool execution.

This is not presented as production-hardened security software. It is a working research/prototype platform for studying agent runtime design and safer tool execution patterns.

Current limitations

lint, format, and docs scripts are stubs (echo-only) and need real implementations.
The platform is a prototype. Security boundaries are designed but not production-hardened.
Browser relay requires local Playwright installation.
No production authentication or multi-tenant support.
Cost tracking is experimental.
Enterprise runtime modules are early-stage experiments.

Roadmap

~~Add CI status badge and GitHub Actions workflow.~~ ✅
~~Add architecture diagram.~~ ✅
Add example tasks for model routing and browser automation.
Expand unit tests around sandbox and policy enforcement.

Documentation

docs/DESIGN.md — architecture decisions, trade-offs, and lessons learned
docs/RUN_LOCAL.md — local setup and runbook
docs/COMPONENT_MAP.md — component map
docs/INTEGRATION_OS_DAEMON.md — OS daemon integration notes

What this project demonstrates

Agent runtime architecture with tool execution boundaries.
Sandboxed tool use with policy-gated actions.
Multi-service orchestration (gateway, runtime, browser, desktop, memory, cron).
Local-first development with auditable tool execution.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
.github/workflows		.github/workflows
archive		archive
bin		bin
brain-os		brain-os
browser-relay		browser-relay
cli/commands		cli/commands
core		core
cron-service		cron-service
deploy/systemd		deploy/systemd
desktop-app		desktop-app
device-relay		device-relay
docs		docs
gateway		gateway
memory-service		memory-service
messaging-service		messaging-service
protocols		protocols
runtime		runtime
sandbox		sandbox
scripts		scripts
shared		shared
state		state
tasks		tasks
tests		tests
ui		ui
validators		validators
vision-service		vision-service
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
RESTRUCTURE.md		RESTRUCTURE.md
SMOKE_TEST_RESULTS.md		SMOKE_TEST_RESULTS.md
WORKSPACE_CLEANUP_COMPLETE.md		WORKSPACE_CLEANUP_COMPLETE.md
YOLO_IMPLEMENTATION_COMPARISON.md		YOLO_IMPLEMENTATION_COMPARISON.md
bootstrap.json		bootstrap.json
brain.config.json		brain.config.json
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
run-local.sh		run-local.sh
start-all.sh		start-all.sh
stop-all.sh		stop-all.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brain Agent Platform

Architecture

Core services

Quick start

Testing

Security model

Current limitations

Roadmap

Documentation

What this project demonstrates

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Brain Agent Platform

Architecture

Core services

Quick start

Testing

Security model

Current limitations

Roadmap

Documentation

What this project demonstrates

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages