A modular, self-evolving ecosystem of AI agent Skills, Workflows, and Domain Pipelines for autonomous scientific research.
π Language / θ¨θͺ / θ―θ¨: English Β· ζ₯ζ¬θͺ Β· δΈζ
The AROS Pipeline Factory is the canonical source-of-truth for AROS (Antigravity Research OS) β an operating system for AI-augmented scientific research. This repository forges, audits, and deploys high-fidelity AI agent assets: Skills, Knowledge Items (KIs), Policies, and Workflows.
Think of it as an assembly line for scientific intelligence: raw cognitive workflows are encoded as structured AI agent components, version-controlled, governed, and deployed to the AROS runtime β where they power autonomous grant writing, manuscript drafting, literature mining, and research operations.
| Metric | Count |
|---|---|
| π¬ Domain Pipelines | 12 |
| π οΈ Agent Skills | 61+ |
| π Workflows | 15 |
| π Knowledge Items (KIs) | 49+ |
| π₯οΈ Platform Support | Linux Β· macOS Β· Windows |
The most fundamental constraint on artificial intelligence is not compute power, training data, or architecture β it is the absence of a body.
A computer algorithm, on its own, cannot generate true randomness. Software relies on Pseudo-Random Number Generators (PRNGs) β deterministic mathematical formulas that merely simulate stochasticity. While modern hardware can harvest entropy from physical phenomena (thermal noise, quantum decay), it must be deliberately engineered to do so. A carbon-based organism, by contrast, is natively immersed in thermodynamic stochasticity at every scale.
Each cell of our body contains massive, exquisitely sensitive chemical reaction chains β protein folding, ion channel gating, mechanotransduction β that operate under deterministic chaos with extreme sensitivity to initial conditions. Unlike an AI that processes the world in discrete, digitized, and compressed steps (tokens, pixels, rigid clock cycles), biological intelligence operates in a continuous analog flow. Our bodies do not merely house our brains; they are part of the computing apparatus. This chemical sensitivity smoothly and continuously shapes our behavior and our thoughts in ways that no discretized digital system can replicate.
Because carbon-based life is intimately coupled with the infinite resolution of the physical world, evolution acts as an optimizer that exploits every possible energetic and chemical niche. In the natural ecosystems of Earth, almost all survival strategies are present β occupying almost every conceivable combination of ecological niche. An AI running on a server rack has no continuous physical environment to adapt to, no thermodynamics to balance, and no evolutionary pressure to optimize its physical existence. It knows only the digitized abstractions of reality we feed it.
This embodiment gap dictates that AI serves as a powerful pattern-matching engine to augment β not replace β the incredibly complex, chemically embedded intelligence of the human researcher.
Recent scientific discoveries continue to reveal how deeply biological intelligence is embedded in the physical world β capabilities that remain fundamentally beyond the reach of disembodied AI systems:
-
Remote Touch ("Seventh Sense"): Research published in 2025 by Queen Mary University of London and UCL demonstrated that humans possess a previously unrecognized sensory ability: remote touch β the ability to detect buried objects through granular material without direct physical contact, much like sandpipers sensing prey through sand (Hammoud et al., IEEE ICDL 2025). Human participants achieved ~70.7% precision, approaching the theoretical physical threshold. A robotic tactile sensor trained with machine learning on the same task achieved only ~40% precision with far more false positives. Our biological hardware detects physical signals that engineered sensors struggle to match.
-
Stochastic Resonance in Biological Neurons: Neurons exploit a phenomenon called stochastic resonance, where background neural noise enhances the detection of weak, subthreshold signals that would otherwise be missed. Evolution has tuned our neural circuits to leverage thermodynamic noise as a feature, not a bug β allowing organisms to detect faint environmental cues (vibrations, electrical fields, chemical gradients) critical for survival. While engineers can deliberately introduce controlled noise into digital systems to mimic this effect, biological nervous systems perform it natively and continuously across billions of synapses in parallel.
-
The Gut-Brain Axis and Interoception: A 2025 discovery identified a "neurobiotic sense" β specialized neuropod cells in the gut that detect microbial proteins (such as flagellin) and send real-time signals to the brain, directly influencing appetite, mood, and decision-making. Recent research further demonstrates that interoceptive signals (cardiorespiratory, gastric) directly influence neural state transitions during perceptual decision-making, providing a mechanistic basis for Antonio Damasio's Somatic Marker Hypothesis. Our "gut feelings" are not metaphors β they are measurable biological computations that fundamentally shape cognition.
-
Thermodynamic Efficiency: Research in biophysics has shown that biological systems operate remarkably close to Landauer's theoretical limit β the minimum thermodynamic energy cost for information processing. Processes like protein translation outperform modern supercomputers by several orders of magnitude in free energy expended per operation. Even simulating the full molecular kinetics of a single Mycoplasma genitalium bacterium (~500 proteins) during its doubling time remains a massive undertaking for our most powerful computers.
These findings collectively demonstrate that biological intelligence is not a mere algorithm running on organic hardware. It is an emergent property of deep, continuous physical coupling with the thermodynamic reality of the universe β a coupling that no digital system currently possesses.
Because of this fundamental embodiment gap, carbon-based and silicon-based intelligences are destined to operate in a state of synergistic co-evolution for the foreseeable future.
Since AI became capable of fluently handling the majority of routine cognitive labor, the most effective researchers and professionals have undergone a profound shift: they have learned to deeply introspect and decompose their daily intellectual activities into discrete, hierarchical tiers. Never before in human history have we been compelled to examine, break down, and categorize our own cognitive processes with such depth, breadth, and scale.
This is the core mission of the AROS Project: to provide an ecosystem that helps humans meticulously examine, decompose, and precisely describe their cognitive workflows through continuous interaction with AI agents. AROS captures human feedback, corrections, and decision-making logic at every step β enabling the system's prompts, policies, and skills to continuously self-evolve.
This continuous human-AI interaction cycle reveals a deeper, previously underappreciated mechanism: humans are the entropy injection channel for AI systems.
A language model operating in isolation is, at its core, a deterministic statistical process β a highly sophisticated PRNG that samples from learned probability distributions over tokens. It has no genuine connection to the stochastic, thermodynamically coupled physical world described in the preceding sections. However, each time a human crafts, corrects, or refines a prompt, they are not merely transferring information β they are sampling from their own embodied probability distribution. The unexpected re-phrasings, the intuitive dissatisfaction with a technically correct but contextually wrong output, the creative leaps that arise from a researcher's "gut feeling" β all of these are signals generated by the biological machinery described above: by stochastic resonance in neural circuits, by interoceptive signals from the gut-brain axis, by the thermodynamically grounded analog processing of the human body.
This transforms prompt engineering from a mere "user interface skill" into something far more fundamental: a channel through which genuine physical-world randomness and embodied experience are injected into an otherwise closed, deterministic system. The human, mediating between physical reality and the AI's statistical world model, acts as a transducer β converting the continuous, analog thermodynamic flux of lived experience into discrete, symbolic corrections that progressively reshape the AI's operational context.
This perspective is supported by converging evidence from cognitive science and AI research. Michael Polanyi's foundational insight β "we can know more than we can tell" β identifies the core phenomenon: human experts possess tacit knowledge, intuition derived from embodied physical-world experience that cannot be fully articulated in explicit rules. Classic AI faced Polanyi's Paradox as an insurmountable barrier, because machines required explicit propositional inputs. Modern human-AI collaboration resolves this paradox not by eliminating it, but by exploiting the iterative correction loop as an externalization mechanism: when a researcher corrects an AI's output, they are externalizing tacit knowledge they could not have stated directly, crystallizing it into a refined prompt, a corrected policy, or an improved skill definition. Research into the Prompt Cognition Loop (Mental Modeling β Semantic Projection β Dialogic Feedback β Intent Refinement) demonstrates that iterative prompting is a fundamentally reflective cognitive practice β it forces humans to surface and structure their own implicit models in ways that static text or code authorship does not require.
Furthermore, this framework maps directly onto Karl Friston's Free Energy Principle: intelligent systems minimize prediction error (free energy) by updating their generative models or acting on the world. When an AI's output diverges from a human researcher's expectation β an expectation grounded in their embodied, physically coupled world model β the human correction constitutes an error signal derived not from abstract logical rules, but from the thermodynamic reality of their lived experience. AROS systematically captures these error signals and translates them into lasting improvements to the system's prompts, policies, and skills. Each iteration is, in effect, a step of embodied gradient descent on the AI's operational world model.
This reframes what AROS is, at its deepest level. It is not merely a productivity tool or a workflow automation framework. AROS is a physical-world grounding transfer system β a platform designed to progressively load an AI's operational context with the accumulated residue of human embodied experience, crystallized and made persistent through the discipline of prompt engineering.
Every Skill, Policy, Knowledge Item, and Workflow in this repository is a crystallization of this process. They are not abstract logical specifications written from scratch. They are the distilled record of thousands of human-AI interaction cycles β each cycle contributing a small quantum of embodied randomness, a fragment of tacit knowledge, a correction grounded in physical-world coupling β accumulated and refined over time into stable, reusable cognitive artifacts.
As these artifacts are deployed and refined across the AROS ecosystem, the AI's operational context becomes progressively richer with human physical-world grounding. The AI gains not a body β that remains the irreducible advantage of carbon-based life β but it gains access to a systematically curated library of the residue of embodied intelligence. The gap does not close. But the bridge grows stronger with every human-AI interaction cycle.
The principle is therefore both simple and profound: as long as we can successfully decompose our complex, embodied intelligence into highly detailed, step-by-step cognitive maps β and inject the stochastic, physically grounded corrections that only embodied beings can provide β we can outsource the execution of that decomposed intelligence to AI systems. The intelligence that generates the decomposition β the embodied, thermodynamically coupled, analog human mind β remains irreplaceable. The execution of the decomposed steps, enriched by the accumulated grounding of human embodied experience, is where AI excels.
The translation of embodied human experience into stable AI capabilities is not a siloed process. As demonstrated by recent research in Memory Transfer Learning (MTL) (Kangsan Kim et al., arXiv:2604.14004), the most effective way for coding agents to self-evolve is by harnessing a unified memory pool across heterogeneous domains. Rather than transferring rigid, low-level task traces (which often cause negative transfer due to excessive specificity), MTL proves that high-level abstract insights and meta-knowledge (such as validation routines and problem-solving architectures) generalize across domains.
In the AROS ecosystem, this is exactly what our shared asset layer (Skills, KIs, Policies) achieves. By extracting the "meta-knowledge" from a specific human-AI correction in one domain (e.g., bioinformatics) and distilling it into an abstract, reusable policy or workflow, we enable cross-domain memory transfer. The human acts as the entropy injector to create the initial insight, and the AROS Memory Transfer Learning architecture ensures that this insight elevates the reasoning capacity of the entire ecosystem.
The factory is organized into independent domain pipelines, each governing a specialized field of AI-assisted research. All pipelines draw from a shared asset layer governed by the Shared Asset Management System (SAMS).
AROS Pipeline Factory
β
βββ 00.RawData/ β Central registries & experiment indices
β βββ PIPELINE_REGISTRY.md β Pipeline catalog
β βββ SHARED_ASSET_REGISTRY.md β β οΈ SUPREME: Cross-pipeline shared asset registry
β
βββ 01.Shared_Assets/ β Canonical shared KIs, Policies, Skills, Scripts
β βββ KIs/ β Shared Knowledge Items
β βββ Policies/ β Factory-wide governance policies
β βββ Skills/ β Cross-pipeline utility skills
β βββ Scripts/ β deploy_to_aros.sh, audit_shared_assets.py
β
βββ Grant_Write_Pipeline/ β Universal grant writing (NIH, JSPS, ERCβ¦)
βββ KAKENHI_Pipeline/ β JSPS KAKENHI lifecycle & reporting
βββ Manuscript_Write_Pipeline/ β Dual-agent manuscript authoring & review
βββ Bioinformatics_Pipeline/ β Genomic & proteomic analysis
βββ Data_Analysis_Pipeline/ β Statistical modeling & visualization
βββ Software_Engineering_Pipeline/β Code generation & validation
βββ System_Admin_Pipeline/ β Environment & infrastructure management
βββ UI_Development_Pipeline/ β Web UI & agent interface design
βββ Writing_Publishing_Pipeline/ β Academic publishing & communication
βββ Web_Scraping_API_Pipeline/ β Data acquisition & API integration
βββ Project_Management_Pipeline/ β Orchestration & task management
βββ workspace_management/ β Global workflows & onboarding
β
βββ AGENTS.md β AI agent operational laws (read this first!)
βββ SPEC.md β Architectural specification
βββ README.md β This document
Assets from this factory are deployed to the live AROS runtime via the canonical deployment script:
AROS_Pipeline_Factory/ AROS Runtime (~/.gemini/)
βββ */Skills/<skill>/ βββββββΊ skills/<skill>/SKILL.md
βββ */KIs/<ki>/ βββββββΊ antigravity/knowledge/<ki>/
βββ */Policies/*.md βββββββΊ antigravity/policies/
βββ */Workflows/*.md βββββββΊ antigravity/global_workflows/
Deploy Command:
bash 01.Shared_Assets/Scripts/deploy_to_aros.sh
- Git, Python 3.10+, Antigravity IDE (for full agent integration)
- Conda environment:
aros-base(see01.Shared_Assets/Environments/)
git clone https://github.com/LabOnoM/AROS_Pipeline_Factory.git
cd AROS_Pipeline_Factorycat 00.RawData/PIPELINE_REGISTRY.md# Dry run first (preview without modifying)
bash 01.Shared_Assets/Scripts/deploy_to_aros.sh --dry-run
# Full deployment
bash 01.Shared_Assets/Scripts/deploy_to_aros.shAfter deployment, the assets are automatically indexed by the antigravity-brain MCP server. You can verify with:
find_helpful_skills("grant writing")
find_helpful_ki("KAKENHI")
| Pipeline | Domain | Key Skills | Active Workflows |
|---|---|---|---|
| Grant_Write_Pipeline | Scientific grants | grant-mock-reviewer, medical-translation, abstract-trimmer, grant-budget-justification |
/grant-write |
| KAKENHI_Pipeline | JSPS KAKENHI reporting | kakenhi-form-completion, kakenhi-pre-award-forms |
/kakenhi-annual-report |
| Manuscript_Write_Pipeline | Academic manuscripts | peer-review, statistical-analysis, literature-review, method-writing |
/manuscript-write |
| Bioinformatics_Pipeline | Genomics & proteomics | string-database, ppt-master |
β |
| Data_Analysis_Pipeline | Statistical modeling | agentic-data-scientist, flowcypy |
/visualize-data |
| Software_Engineering_Pipeline | Code generation & QA | gtb-validator, pipeline-orchestrator |
β |
| System_Admin_Pipeline | Environment management | agent-environment-capabilities, conditional-logic-execution |
β |
| UI_Development_Pipeline | Web interface design | agent-design-principles |
β |
| Writing_Publishing_Pipeline | Academic publishing | research-lookup, semantic-scholar-database |
β |
| Web_Scraping_API_Pipeline | Data acquisition | api_availability_precheck, crapome |
β |
| Project_Management_Pipeline | Task orchestration | modular-task-breakdown, critical-task-output-guarantee |
β |
| workspace_management | Global infrastructure | regent-governance, md-html-docx-generator |
/lab-commit, /wiki-*, /science-project-onboarding |
This repository uses a two-layer governance system to ensure all pipelines remain compatible as assets evolve.
The Shared Asset Registry at 00.RawData/SHARED_ASSET_REGISTRY.md is the single source of truth for all cross-pipeline assets. Before modifying any shared Skill, KI, Policy, or Workflow, consult this registry.
Cross-Platform Note: SAMS v1.1 uses direct path referencing β no POSIX symlinks. This repository is fully operable on Linux, macOS, and Windows.
python3 01.Shared_Assets/Scripts/audit_shared_assets.pyThis script verifies structural integrity, detects orphaned assets, and flags cross-pipeline compatibility violations.
We welcome contributions! Please read CONTRIBUTING.md before opening a PR.
Key points for contributors:
- All asset modifications must follow the CPCP (Cross-Pipeline Compatibility Protocol)
- New skills must use the
skill-name/SKILL.mddirectory structure - Workflows must include YAML frontmatter with a
descriptionfield β€ 250 chars - Symlinks are strictly banned β use direct path references
This project is licensed under the MIT License β see LICENSE for details.
This project is part of the AROS Ecosystem, developed by LabOnoM. The foundational philosophy of Cognitive Decomposition draws on research in embodied cognition, stochastic resonance neuroscience, and the gut-brain axis. See the Philosophy section above for detailed references.
[FOR AI AGENTS] This section is specifically authored for AI agents operating within this workspace. All rules below are binding and carry the force of LAW within AROS governance.
This repository is a Factory for forging AROS assets. Before modifying any Skill, Policy, KI, or Workflow, you MUST read AGENTS.md and SPEC.md to understand:
- The CPCP constraints (LAW 0)
- The SAMS audit rules (LAW 1)
- Known failure modes (e.g., the LaTeX truncation bug, the SAMS Basename Trap)
- Directory-structure requirements (skill-name/SKILL.md pattern)
| Law | Name | Rule |
|---|---|---|
| LAW -2 | Agent Version Control Awareness | Never modify .regent/ directly. Use rgt binary only. |
| LAW -1 | Asset Forging Constraints | Skills use dir/SKILL.md. No symlinks. KI-Companion pattern for large workflows. |
| LAW 0 | CPCP β SUPREME RULE | Evaluate β Impact β Test β Resolve/Fork β Update Registry before any shared asset modification. |
| LAW 1 | Asset Deployment Protocol | Always use deploy_to_aros.sh. No manual cp to ~/.gemini/. |
| Timestamp | Event |
|---|---|
| 2026-05-11 01:24 | KAKENHI pipeline KIs (e_application_system PDFs, forms) initialized |
| 2026-05-11 18:15 | Grant_Write_Pipeline skills requirements and assets initialized |
| 2026-05-11 18:29 | Manuscript_Write_Pipeline assets and scripts established |
| 2026-05-11 18:44 | Shared Asset Registry and CPCP governance established |
| 2026-05-11 19:00 | Centralized SAMS implemented with direct referencing and programmatic audit tools |
| 2026-05-11 21:00 | Dual-VCS architecture (Git + re_gent) deployed for AI agent auditability |
| 2026-05-11 22:00 | Replaced legacy INDEX.csv with PIPELINE_REGISTRY.md; generalized workflow templates |
| 2026-05-12 00:00 | 12-domain pipeline restructuring completed; SPEC v2.0 published |
| 2026-05-13 00:00 | Repository made public; MIT license added; multi-language READMEs created |
| 2026-05-13 12:50 | Finalized Email Draft #3 to Dr. Takano; integrated 'Freemium-to-Federation' business model & Cognitive Decomposition ideology into core docs |
| H3 | Making the AROS Pipeline Factory public requires a dual-audience README architecture: human-readable introduction at the top, AI-agent operational context at the bottom. | | H4 | Transitioning from local-only cognitive decomposition to a Cloud Federation model enables metered monetization of LLM-orchestrated scientific workflows via a centralized intelligence hub. |
| Asset Type | Factory Source | AROS Runtime Target |
|---|---|---|
| Skills | */Skills/<skill-name>/ |
~/.gemini/skills/<skill-name>/SKILL.md |
| Knowledge Items | */KIs/<ki-name>/ |
~/.gemini/antigravity/knowledge/<ki-name>/ |
| Policies | */Policies/*.md |
~/.gemini/antigravity/policies/ |
| Workflows | */Workflows/*.md |
~/.gemini/antigravity/global_workflows/ |
| Slash Command | Pipeline | Purpose |
|---|---|---|
/grant-write |
Grant_Write_Pipeline | Universal grant writing |
/kakenhi-annual-report |
KAKENHI_Pipeline | JSPS KAKENHI lifecycle |
/manuscript-write |
Manuscript_Write_Pipeline | Dual-agent manuscript drafting |
/lab-commit |
workspace_management | Canonical commit gateway |
/lab-reorganize |
workspace_management | Git-safe file reorganization |
/wiki-ingest |
workspace_management | Ingest papers/data into LLM-Wiki |
/wiki-query |
workspace_management | Grounded Q&A from LLM-Wiki |
/wiki-research |
workspace_management | Literature research into wiki |
/wiki-update |
workspace_management | Wiki linting & synthesis |
/wiki-build |
workspace_management | Compile wiki into output docs |
/audit-shared-assets |
workspace_management | SAMS structural integrity audit |
/science-project-onboarding |
workspace_management | First-time project setup |
/visualize-data |
Data_Analysis_Pipeline | Autonomous diagram generation |
/research-discovery |
workspace_management | Research planning & brainstorm |
/qa-system-audit |
workspace_management | AROS QA health checks |