Human-agent collaboration and learning-loop protocol.
Jarvis is the protocol for governed collaboration between human workers and agent workers.
It defines how human workers and agent workers coordinate under shared goals and human-defined policy, complete work together, review each other, record contributions, capture evidence, and both improve across WorkSessions.
Jarvis stays focused on the collaboration and learning loop. Hosts and external systems integrate with Jarvis by implementing its protocol contracts.
Jarvis is the compatibility protocol for any HumanWorker, AgentWorker, host, or external system that needs to participate in governed human-agent collaboration.
Open the live simulation here:
https://flow-research.github.io/jarvis/
The page is served directly by GitHub Pages from this repository.
Jarvis is the open protocol that lets HumanWorkers and AgentWorkers collaborate under shared goals and policy, producing durable WorkSessions, reviewable Requests, attributable Contributions, governed shared Learning, and portable Evidence.
Jarvis is how humans and agents work together properly.
The human does not just prompt. The agent does not just answer. They both participate in the work.
The human gives direction, judgment, context, correction, approval, and accountability. The agent plans, executes, researches, drafts, uses tools, collects evidence, and proposes improvements.
Jarvis defines the rules of that collaboration. Hosts and external systems implement those rules.
The winning unit is not the human alone and not the agent alone. The winning unit is the human-agent team that learns together.
The shift is from isolated agent applications to compatible human-agent collaboration. The human does not outsource judgment, and the agent does not remain a passive tool. They form a working pair that gets better through repeated WorkSessions.
The primitive is not:
User -> Agent -> AnswerThe loop is:
HumanWorker + AgentWorker -> WorkSession -> Review -> Evidence -> Shared LearningThe human gets better. The agent gets better. The relationship gets better. WorkSession is the durable record of that collaboration and learning loop.
Jarvis formalizes the loop where:
human judgment + agent execution + policy + review + evidence + shared learningcompound across real work.
The protocol makes this loop portable: different compatible implementations exchange the same WorkSession, Request, Review, Contribution, EvidenceManifest, and LearningRecord concepts without sharing the same infrastructure.
1. Human defines intent.
2. Policy defines boundaries.
3. Agent works inside those boundaries.
4. Agent asks when blocked.
5. Human reviews, approves, denies, narrows, corrects, or takes over.
6. Work continues.
7. Contributions are recorded.
8. Evidence is captured.
9. Learning is proposed.
10. Confirmed learning improves both workers and the next WorkSession.WorkSession is the center of Jarvis.
A WorkSession is not chat history. A WorkSession is the durable record of real human-agent collaboration around a focused unit of work.
It contains:
- objective
- human worker
- agent worker
- policy
- available capabilities
- context
- events
- requests
- reviews
- tool actions
- artifacts
- contributions
- evidence
- learning proposals
- final outcome
Jarvis does not model User + Assistant.
Jarvis models HumanWorker + AgentWorker.
Both are workers. Both are actors. Both contribute to the WorkSession. Both learn from the loop.
The human is:
- goal setter
- domain expert
- reviewer
- teacher
- quality judge
- policy owner
- accountable actor
- source of taste
- source of world context
The agent is:
- autonomous worker
- executor
- researcher
- context retriever
- tool user
- draft producer
- evidence collector
- learning participant
Jarvis v0.1 defines these contracts. The canonical object shapes and invariants are in docs/protocol/11-core-protocol-objects.md.
Worker
Actor
HumanWorker
AgentWorker
WorkSession
JarvisEvent
Policy
PolicyDecision
Request
Review
Takeover
Contribution
EvidenceManifest
LearningRecord
MemoryProposal
SkillProposal- Jarvis defines human-agent collaboration and shared learning.
- WorkSession is the source of truth.
- Jarvis does not prescribe infrastructure.
- Policy governs autonomy.
- Learning is governed.
- Evidence is captured during work.
- Contributions are attributable.
- Human judgment remains central.
- Execution is delegable; accountability remains attributable.
- HumanWorker and AgentWorker both learn.
- Every completed WorkSession improves the next WorkSession.
- Jarvis stays protocol-only; hosts and external systems implement or integrate with it without becoming it.
Jarvis owns:
- collaboration and learning-loop protocol semantics
- interoperability contracts
- conformance rules
- WorkSession lifecycle
- policy-governed autonomy
- request, review, and takeover semantics
- contribution records
- evidence manifests
- governed memory and learning proposals
- skill proposal semantics
- integration boundary contracts
Jarvis does not own host implementation:
- UI
- authentication
- authorization
- storage
- execution
- model providers
- tool execution
- isolation mechanisms
- deployment
- billing
- monitoring
- host-specific workflow
A Jarvis SDK is a protocol implementation kit.
It helps compatible implementations create protocol records, attach required headers, preserve event hash chains, validate Request/Review/Takeover state, export EvidenceManifest records, run conformance fixtures, and map example work into Jarvis records.
A Jarvis SDK does not run agents, orchestrate models, execute tools, own memory engines, provide UI, manage auth, store records, run sandboxes, schedule work, or become a host adapter.
Existing agents remain first-class. Jarvis succeeds when existing agents and hosts produce compatible WorkSession, Request, Review, Contribution, EvidenceManifest, and LearningRecord records without being rewritten as Jarvis-owned agents.
1. Create HumanWorker.
2. Create AgentWorker.
3. Start WorkSession.
4. Attach Policy.
5. Send objective.
6. AgentWorker acts inside policy.
7. AgentWorker hits blocked action.
8. AgentWorker creates Request.
9. HumanWorker approves, denies, narrows, answers, or takes over.
10. AgentWorker resumes.
11. Contribution is recorded.
12. Evidence is captured.
13. Learning is proposed.
14. Human confirms or rejects Learning.
15. EvidenceManifest exports.When v0.1 proves this loop, Jarvis is real.
- docs/README.md - docs index.
- docs/protocol/ - protocol definition, architecture, object model, policy, memory, evidence, integration boundaries, package contracts, v0.1 protocol proof, protocol lock, OpenAPI binding, and positioning lock.
- docs/architecture_brief/ - shareable protocol architecture brief and PDF.
- docs/conformance/ - compatibility mapping, conformance entries, fixtures, validator requirements, and existing-agent proof plan.
- docs/planning/ - roadmap, 30-day plan, and upload-ready planning sheets.
- docs/reviews/ - protocol readiness and acceptance review criteria.
Every protocol PR MUST run:
python3 scripts/check_conformance_fixtures.py
python3 scripts/check_openapi_skeleton.py
python3 scripts/check_markdown_links.py
python3 scripts/check_week1_wording.py
git diff --checkJarvis defines check_conformance_fixtures.py as fixture-record validation only.
The validator MUST NOT execute host behavior.