agents-go

A Go port of openai-agents-python (tracking v0.17.4). Build agents that call tools, hand off to one another, enforce guardrails, stream events, persist sessions, pause for human approval, and emit traces — all with idiomatic Go APIs.

Documentation — mirrors the Python SDK docs structure, including a full comparison with the Python SDK.

Install

go get github.com/zzir/agents-go

Requires Go 1.26+.

Quick start

package main

import (
	"context"
	"fmt"

	agents "github.com/zzir/agents-go/agents"
	"github.com/zzir/agents-go/models/openai"
)

func main() {
	agent := &agents.Agent{
		Name:         "assistant",
		Instructions: agents.StaticInstructions("You are a helpful assistant."),
		Model:        "gpt-4o",
	}

	res, err := agents.Run(context.Background(), agent, "Hello!", agents.RunOptions{
		ModelProvider: openai.NewProvider(), // reads OPENAI_API_KEY
	})
	if err != nil {
		panic(err)
	}
	fmt.Println(res.FinalOutputString())
}

Features

Capability	API
Agents	`agents.Agent{...}`
Run (blocking)	`agents.Run(ctx, agent, input, opts)`
Streaming	`agents.RunStreamed(...)` → `Events()` iterator
Function tools	`agents.NewFunctionTool[Args, Result](name, desc, fn)`
Structured output	`agents.OutputType[T]()`
Multimodal tool output	`agents.ToolOutputText/ToolOutputImage/ToolOutputFile` (tool returns native image/file input)
Handoffs	`agents.HandoffTo(targetAgent)`
Agent as tool	`agent.AsTool(agents.AgentToolConfig{...})`
Guardrails	`InputGuardrails`, `OutputGuardrails`, tool-level guardrails
Sessions	`agents.Session`, `InMemorySession`, `memory.FileSession`, `sessions` (SQLite/Postgres), `openai.ConversationsSession` (server-side), `openai.CompactionSession` (auto-summarize)
Server-side state	`RunOptions.UsePreviousResponseID` / `RunOptions.ConversationID`
Stored prompts	`Agent.Prompt = agents.StaticPrompt(...)` / `agents.PromptFunc(...)` (OpenAI stored prompt)
Human-in-the-loop	`tool.NeedsApproval`, `RunState.Approve/Reject`, `agents.ResumeRun`
Tracing	`tracing.NewTracer`, `tracing.NewBatchProcessor`
MCP	`mcp.NewStdioServer / NewStreamableHTTPServer / NewSSEServer`
Web search	`bravesearch.New(bravesearch.Options{...})` (Brave Search API)
File editing	`editor.NewTools(dir)` (str_replace editor, `os.Root`-confined)
Retry / fallback	`agents.NewRetryModel(...)`, `agents.NewFallbackModel(...)`
Multi-provider routing	`agents.NewRouterProvider(...)` (per-agent backend by name)
Skills	`skills.Load / RenderIndex / ReadFileTool` (Agent Skills `SKILL.md` format)

Tools

A function tool is a typed Go function. The argument struct is reflected into a JSON schema (with strict-mode normalization) shown to the model:

type weatherArgs struct {
	City string `json:"city" jsonschema:"the city"`
}

getWeather := agents.NewFunctionTool("get_weather", "Look up the weather.",
	func(ctx context.Context, tc *agents.ToolContext, args weatherArgs) (string, error) {
		return "sunny in " + args.City, nil
	})

agent := &agents.Agent{Name: "bot", Model: "gpt-4o", Tools: []agents.Tool{getWeather}}

Structured output

type Sentiment struct {
	Label string `json:"label"`
	Score int    `json:"score"`
}

agent := &agents.Agent{Name: "classifier", Model: "gpt-4o", OutputType: agents.OutputType[Sentiment]()}
res, _ := agents.Run(ctx, agent, "I love this!", opts)
s, _ := agents.FinalOutputAs[Sentiment](res)

Streaming

sr := agents.RunStreamed(ctx, agent, "tell me a story", opts)
for event, err := range sr.Events() {
	if err != nil { panic(err) }
	if e, ok := event.(*agents.RunItemStreamEvent); ok {
		if msg, ok := e.Item.(*agents.MessageOutputItem); ok {
			fmt.Println(msg.Text())
		}
	}
}
res, _ := sr.FinalResult()

Human-in-the-loop

tool.NeedsApproval = true

res, _ := agents.Run(ctx, agent, "delete everything", opts)
for len(res.Interruptions) > 0 {
	for _, item := range res.Interruptions {
		res.State.Approve(item, false) // or res.State.Reject(item, false, "no")
	}
	res, _ = agents.ResumeRun(ctx, res.State, opts)
}

The paused state serializes to JSON (res.State.MarshalJSON()) and rebuilds with agents.RunStateFromJSON(data, registry) for cross-process approval flows.

Sessions

sess, _ := memory.NewFileSession("sessions", "user-123") // sessions/user-123.jsonl
agents.Run(ctx, agent, "remember my name is Ada", agents.RunOptions{Session: sess, ModelProvider: p})

History is stored as JSONL with zero external dependencies. Implement the agents.Session interface yourself to back it with a database instead.

Tracing

exporter := tracing.NewConsoleExporter(os.Stdout)
proc := tracing.NewBatchProcessor(exporter, tracing.BatchProcessorOptions{})
defer proc.Shutdown(context.Background())

agents.Run(ctx, agent, "hi", agents.RunOptions{Tracer: tracing.NewTracer(proc), ModelProvider: p})

MCP

server, _ := mcp.NewStdioServer(ctx, "fs",
	exec.Command("npx", "-y", "@modelcontextprotocol/server-filesystem", "/tmp"), mcp.Options{})
defer server.Close()

agent := &agents.Agent{Name: "a", Model: "gpt-4o", MCPServers: []agents.MCPServer{server}}

Sandbox (code execution)

Run untrusted, agent-generated code in an isolated environment and expose it as a tool. Backends (Docker, Kubernetes Jobs) are separate modules so the core stays dependency-light — install only the one you use:

import (
	"github.com/zzir/agents-go/sandbox"
	"github.com/zzir/agents-go/sandbox/docker" // go get github.com/zzir/agents-go/sandbox/docker
)

sb, _ := docker.New(docker.Options{Image: "python:3.12-slim",
	Limits: sandbox.Limits{MemoryBytes: 256 << 20, CPUs: 0.5}})
defer sb.Close()

agent := &agents.Agent{Name: "coder", Model: "gpt-4o",
	Tools: []agents.Tool{sandbox.CodeTool(sb, sandbox.CodeToolConfig{Name: "run_python"})}}

Both backends default to: no network, read-only root fs, dropped capabilities, non-root user, and CPU/memory/PID/time limits. The Kubernetes backend (.../sandbox/k8s) additionally runs each call as a one-shot Job with no service account token. sandbox.NewLocal() runs on the host without isolation — for trusted dev/tests only.

Packages

Core module path: github.com/zzir/agents-go.

.../agents — core: agents, runner, tools, guardrails, sessions, HITL, tracing hooks.
.../models/openai — OpenAI Responses API model provider (built on openai-go v3).
.../memory — FileSession (JSONL file store, zero dependencies).
.../tracing — traces, spans, processors and exporters.
.../mcp — Model Context Protocol client.
.../sandbox — Sandbox interface + CodeTool + local backend.
.../sandbox/docker, .../sandbox/k8s — separate modules with the Docker / Kubernetes backends.
.../examples — runnable examples (hello, tools, handoffs, streaming, hitl, sandbox).

Examples

export OPENAI_API_KEY=sk-...
go run ./examples/hello
go run ./examples/tools
go run ./examples/handoffs
go run ./examples/streaming
go run ./examples/hitl
go run ./examples/toolimage      # a tool returns a generated image to the model
go run ./examples/conversations  # server-side history via the Conversations API
go run ./examples/compaction      # auto-summarize long history via responses.compact
OPENAI_PROMPT_ID=pmpt_... go run ./examples/prompt  # drive an agent from a stored prompt
go run ./examples/sandbox   # writes & runs Python in a sandbox (host needs python3)

# isolated backends live in their own modules:
(cd sandbox/docker && OPENAI_API_KEY=$OPENAI_API_KEY go run ./example)  # needs Docker
(cd sandbox/k8s    && OPENAI_API_KEY=$OPENAI_API_KEY go run ./example)  # needs a cluster

Design notes

The core lives in a single agents/ package. The original plan split it further into tools/, outputs/ and models/, but in Go those would form an import cycle with the core (tool callbacks reference RunContext; the Model interface references Tool), so they are kept together in agents/. Provider, storage, tracing and MCP implementations live in subpackages that import agents. Items use the openai-go Responses types as the wire format, mirroring how the Python SDK reuses the OpenAI SDK types.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agents-go

Install

Quick start

Features

Tools

Structured output

Streaming

Human-in-the-loop

Sessions

Tracing

MCP

Sandbox (code execution)

Packages

Examples

Design notes

License

About

Uh oh!

Releases

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
agents		agents
docs		docs
examples		examples
mcp		mcp
memory		memory
models/openai		models/openai
sandbox		sandbox
scripts		scripts
sessions		sessions
skills		skills
tools		tools
tracing		tracing
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Folders and files

Latest commit

History

Repository files navigation

agents-go

Install

Quick start

Features

Tools

Structured output

Streaming

Human-in-the-loop

Sessions

Tracing

MCP

Sandbox (code execution)

Packages

Examples

Design notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors

Uh oh!

Languages