Skip to content

LabOnoM/AROS_Pipeline_Factory

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

66 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

AROS Pipeline Factory Banner

AROS Pipeline Factory

A modular, self-evolving ecosystem of AI agent Skills, Workflows, and Domain Pipelines for autonomous scientific research.

License: MIT Platform SAMS re_gent GitHub Wiki

🌐 Language / 言θͺž / 语言: English Β· ζ—₯本θͺž Β· δΈ­ζ–‡


🌟 What is This?

The AROS Pipeline Factory is the canonical source-of-truth for AROS (Antigravity Research OS) β€” an operating system for AI-augmented scientific research. This repository forges, audits, and deploys high-fidelity AI agent assets: Skills, Knowledge Items (KIs), Policies, and Workflows.

Think of it as an assembly line for scientific intelligence: raw cognitive workflows are encoded as structured AI agent components, version-controlled, governed, and deployed to the AROS runtime β€” where they power autonomous grant writing, manuscript drafting, literature mining, and research operations.

✨ At a Glance

Metric Count
πŸ”¬ Domain Pipelines 12
πŸ› οΈ Agent Skills 61+
πŸ“‹ Workflows 15
πŸ“š Knowledge Items (KIs) 49+
πŸ–₯️ Platform Support Linux Β· macOS Β· Windows

🧠 Project Philosophy: Embodiment & Human-AI Co-Evolution

The most fundamental constraint on artificial intelligence is not compute power, training data, or architecture β€” it is the absence of a body.

1. The Embodiment Gap

A computer algorithm, on its own, cannot generate true randomness. Software relies on Pseudo-Random Number Generators (PRNGs) β€” deterministic mathematical formulas that merely simulate stochasticity. While modern hardware can harvest entropy from physical phenomena (thermal noise, quantum decay), it must be deliberately engineered to do so. A carbon-based organism, by contrast, is natively immersed in thermodynamic stochasticity at every scale.

Each cell of our body contains massive, exquisitely sensitive chemical reaction chains β€” protein folding, ion channel gating, mechanotransduction β€” that operate under deterministic chaos with extreme sensitivity to initial conditions. Unlike an AI that processes the world in discrete, digitized, and compressed steps (tokens, pixels, rigid clock cycles), biological intelligence operates in a continuous analog flow. Our bodies do not merely house our brains; they are part of the computing apparatus. This chemical sensitivity smoothly and continuously shapes our behavior and our thoughts in ways that no discretized digital system can replicate.

Because carbon-based life is intimately coupled with the infinite resolution of the physical world, evolution acts as an optimizer that exploits every possible energetic and chemical niche. In the natural ecosystems of Earth, almost all survival strategies are present β€” occupying almost every conceivable combination of ecological niche. An AI running on a server rack has no continuous physical environment to adapt to, no thermodynamics to balance, and no evolutionary pressure to optimize its physical existence. It knows only the digitized abstractions of reality we feed it.

This embodiment gap dictates that AI serves as a powerful pattern-matching engine to augment β€” not replace β€” the incredibly complex, chemically embedded intelligence of the human researcher.

2. Evidence from Frontier Biology

Recent scientific discoveries continue to reveal how deeply biological intelligence is embedded in the physical world β€” capabilities that remain fundamentally beyond the reach of disembodied AI systems:

  • Remote Touch ("Seventh Sense"): Research published in 2025 by Queen Mary University of London and UCL demonstrated that humans possess a previously unrecognized sensory ability: remote touch β€” the ability to detect buried objects through granular material without direct physical contact, much like sandpipers sensing prey through sand (Hammoud et al., IEEE ICDL 2025). Human participants achieved ~70.7% precision, approaching the theoretical physical threshold. A robotic tactile sensor trained with machine learning on the same task achieved only ~40% precision with far more false positives. Our biological hardware detects physical signals that engineered sensors struggle to match.

  • Stochastic Resonance in Biological Neurons: Neurons exploit a phenomenon called stochastic resonance, where background neural noise enhances the detection of weak, subthreshold signals that would otherwise be missed. Evolution has tuned our neural circuits to leverage thermodynamic noise as a feature, not a bug β€” allowing organisms to detect faint environmental cues (vibrations, electrical fields, chemical gradients) critical for survival. While engineers can deliberately introduce controlled noise into digital systems to mimic this effect, biological nervous systems perform it natively and continuously across billions of synapses in parallel.

  • The Gut-Brain Axis and Interoception: A 2025 discovery identified a "neurobiotic sense" β€” specialized neuropod cells in the gut that detect microbial proteins (such as flagellin) and send real-time signals to the brain, directly influencing appetite, mood, and decision-making. Recent research further demonstrates that interoceptive signals (cardiorespiratory, gastric) directly influence neural state transitions during perceptual decision-making, providing a mechanistic basis for Antonio Damasio's Somatic Marker Hypothesis. Our "gut feelings" are not metaphors β€” they are measurable biological computations that fundamentally shape cognition.

  • Thermodynamic Efficiency: Research in biophysics has shown that biological systems operate remarkably close to Landauer's theoretical limit β€” the minimum thermodynamic energy cost for information processing. Processes like protein translation outperform modern supercomputers by several orders of magnitude in free energy expended per operation. Even simulating the full molecular kinetics of a single Mycoplasma genitalium bacterium (~500 proteins) during its doubling time remains a massive undertaking for our most powerful computers.

These findings collectively demonstrate that biological intelligence is not a mere algorithm running on organic hardware. It is an emergent property of deep, continuous physical coupling with the thermodynamic reality of the universe β€” a coupling that no digital system currently possesses.

3. Cognitive Decomposition, Entropy Injection & Co-Evolution

Because of this fundamental embodiment gap, carbon-based and silicon-based intelligences are destined to operate in a state of synergistic co-evolution for the foreseeable future.

Since AI became capable of fluently handling the majority of routine cognitive labor, the most effective researchers and professionals have undergone a profound shift: they have learned to deeply introspect and decompose their daily intellectual activities into discrete, hierarchical tiers. Never before in human history have we been compelled to examine, break down, and categorize our own cognitive processes with such depth, breadth, and scale.

This is the core mission of the AROS Project: to provide an ecosystem that helps humans meticulously examine, decompose, and precisely describe their cognitive workflows through continuous interaction with AI agents. AROS captures human feedback, corrections, and decision-making logic at every step β€” enabling the system's prompts, policies, and skills to continuously self-evolve.

3a. The Entropy Injection Hypothesis: Humans as the Randomness Channel

This continuous human-AI interaction cycle reveals a deeper, previously underappreciated mechanism: humans are the entropy injection channel for AI systems.

A language model operating in isolation is, at its core, a deterministic statistical process β€” a highly sophisticated PRNG that samples from learned probability distributions over tokens. It has no genuine connection to the stochastic, thermodynamically coupled physical world described in the preceding sections. However, each time a human crafts, corrects, or refines a prompt, they are not merely transferring information β€” they are sampling from their own embodied probability distribution. The unexpected re-phrasings, the intuitive dissatisfaction with a technically correct but contextually wrong output, the creative leaps that arise from a researcher's "gut feeling" β€” all of these are signals generated by the biological machinery described above: by stochastic resonance in neural circuits, by interoceptive signals from the gut-brain axis, by the thermodynamically grounded analog processing of the human body.

This transforms prompt engineering from a mere "user interface skill" into something far more fundamental: a channel through which genuine physical-world randomness and embodied experience are injected into an otherwise closed, deterministic system. The human, mediating between physical reality and the AI's statistical world model, acts as a transducer β€” converting the continuous, analog thermodynamic flux of lived experience into discrete, symbolic corrections that progressively reshape the AI's operational context.

This perspective is supported by converging evidence from cognitive science and AI research. Michael Polanyi's foundational insight β€” "we can know more than we can tell" β€” identifies the core phenomenon: human experts possess tacit knowledge, intuition derived from embodied physical-world experience that cannot be fully articulated in explicit rules. Classic AI faced Polanyi's Paradox as an insurmountable barrier, because machines required explicit propositional inputs. Modern human-AI collaboration resolves this paradox not by eliminating it, but by exploiting the iterative correction loop as an externalization mechanism: when a researcher corrects an AI's output, they are externalizing tacit knowledge they could not have stated directly, crystallizing it into a refined prompt, a corrected policy, or an improved skill definition. Research into the Prompt Cognition Loop (Mental Modeling β†’ Semantic Projection β†’ Dialogic Feedback β†’ Intent Refinement) demonstrates that iterative prompting is a fundamentally reflective cognitive practice β€” it forces humans to surface and structure their own implicit models in ways that static text or code authorship does not require.

Furthermore, this framework maps directly onto Karl Friston's Free Energy Principle: intelligent systems minimize prediction error (free energy) by updating their generative models or acting on the world. When an AI's output diverges from a human researcher's expectation β€” an expectation grounded in their embodied, physically coupled world model β€” the human correction constitutes an error signal derived not from abstract logical rules, but from the thermodynamic reality of their lived experience. AROS systematically captures these error signals and translates them into lasting improvements to the system's prompts, policies, and skills. Each iteration is, in effect, a step of embodied gradient descent on the AI's operational world model.

3b. AROS as a Physical-World Grounding Transfer System

This reframes what AROS is, at its deepest level. It is not merely a productivity tool or a workflow automation framework. AROS is a physical-world grounding transfer system β€” a platform designed to progressively load an AI's operational context with the accumulated residue of human embodied experience, crystallized and made persistent through the discipline of prompt engineering.

Every Skill, Policy, Knowledge Item, and Workflow in this repository is a crystallization of this process. They are not abstract logical specifications written from scratch. They are the distilled record of thousands of human-AI interaction cycles β€” each cycle contributing a small quantum of embodied randomness, a fragment of tacit knowledge, a correction grounded in physical-world coupling β€” accumulated and refined over time into stable, reusable cognitive artifacts.

As these artifacts are deployed and refined across the AROS ecosystem, the AI's operational context becomes progressively richer with human physical-world grounding. The AI gains not a body β€” that remains the irreducible advantage of carbon-based life β€” but it gains access to a systematically curated library of the residue of embodied intelligence. The gap does not close. But the bridge grows stronger with every human-AI interaction cycle.

The principle is therefore both simple and profound: as long as we can successfully decompose our complex, embodied intelligence into highly detailed, step-by-step cognitive maps β€” and inject the stochastic, physically grounded corrections that only embodied beings can provide β€” we can outsource the execution of that decomposed intelligence to AI systems. The intelligence that generates the decomposition β€” the embodied, thermodynamically coupled, analog human mind β€” remains irreplaceable. The execution of the decomposed steps, enriched by the accumulated grounding of human embodied experience, is where AI excels.

3c. Memory Transfer Learning: The Mechanism of Cross-Domain Evolution

The translation of embodied human experience into stable AI capabilities is not a siloed process. As demonstrated by recent research in Memory Transfer Learning (MTL) (Kangsan Kim et al., arXiv:2604.14004), the most effective way for coding agents to self-evolve is by harnessing a unified memory pool across heterogeneous domains. Rather than transferring rigid, low-level task traces (which often cause negative transfer due to excessive specificity), MTL proves that high-level abstract insights and meta-knowledge (such as validation routines and problem-solving architectures) generalize across domains.

In the AROS ecosystem, this is exactly what our shared asset layer (Skills, KIs, Policies) achieves. By extracting the "meta-knowledge" from a specific human-AI correction in one domain (e.g., bioinformatics) and distilling it into an abstract, reusable policy or workflow, we enable cross-domain memory transfer. The human acts as the entropy injector to create the initial insight, and the AROS Memory Transfer Learning architecture ensures that this insight elevates the reasoning capacity of the entire ecosystem.


πŸ“¦ Architecture

The factory is organized into independent domain pipelines, each governing a specialized field of AI-assisted research. All pipelines draw from a shared asset layer governed by the Shared Asset Management System (SAMS).

AROS Pipeline Factory
β”‚
β”œβ”€β”€ 00.RawData/                   ← Central registries & experiment indices
β”‚   β”œβ”€β”€ PIPELINE_REGISTRY.md      ←   Pipeline catalog
β”‚   └── SHARED_ASSET_REGISTRY.md  ←   ⚠️ SUPREME: Cross-pipeline shared asset registry
β”‚
β”œβ”€β”€ 01.Shared_Assets/             ← Canonical shared KIs, Policies, Skills, Scripts
β”‚   β”œβ”€β”€ KIs/                      ←   Shared Knowledge Items
β”‚   β”œβ”€β”€ Policies/                 ←   Factory-wide governance policies
β”‚   β”œβ”€β”€ Skills/                   ←   Cross-pipeline utility skills
β”‚   └── Scripts/                  ←   deploy_to_aros.sh, audit_shared_assets.py
β”‚
β”œβ”€β”€ Grant_Write_Pipeline/         ← Universal grant writing (NIH, JSPS, ERC…)
β”œβ”€β”€ KAKENHI_Pipeline/             ← JSPS KAKENHI lifecycle & reporting
β”œβ”€β”€ Manuscript_Write_Pipeline/    ← Dual-agent manuscript authoring & review
β”œβ”€β”€ Bioinformatics_Pipeline/      ← Genomic & proteomic analysis
β”œβ”€β”€ Data_Analysis_Pipeline/       ← Statistical modeling & visualization
β”œβ”€β”€ Software_Engineering_Pipeline/← Code generation & validation
β”œβ”€β”€ System_Admin_Pipeline/        ← Environment & infrastructure management
β”œβ”€β”€ UI_Development_Pipeline/      ← Web UI & agent interface design
β”œβ”€β”€ Writing_Publishing_Pipeline/  ← Academic publishing & communication
β”œβ”€β”€ Web_Scraping_API_Pipeline/    ← Data acquisition & API integration
β”œβ”€β”€ Project_Management_Pipeline/  ← Orchestration & task management
β”œβ”€β”€ workspace_management/         ← Global workflows & onboarding
β”‚
β”œβ”€β”€ AGENTS.md                     ← AI agent operational laws (read this first!)
β”œβ”€β”€ SPEC.md                       ← Architectural specification
└── README.md                     ← This document

Deployment Flow

Assets from this factory are deployed to the live AROS runtime via the canonical deployment script:

AROS_Pipeline_Factory/           AROS Runtime (~/.gemini/)
β”œβ”€β”€ */Skills/<skill>/   ──────►  skills/<skill>/SKILL.md
β”œβ”€β”€ */KIs/<ki>/         ──────►  antigravity/knowledge/<ki>/
β”œβ”€β”€ */Policies/*.md     ──────►  antigravity/policies/
└── */Workflows/*.md    ──────►  antigravity/global_workflows/

Deploy Command: bash 01.Shared_Assets/Scripts/deploy_to_aros.sh


πŸš€ Quick Start

Prerequisites

  • Git, Python 3.10+, Antigravity IDE (for full agent integration)
  • Conda environment: aros-base (see 01.Shared_Assets/Environments/)

1. Clone the Repository

git clone https://github.com/LabOnoM/AROS_Pipeline_Factory.git
cd AROS_Pipeline_Factory

2. Inspect the Pipeline Registry

cat 00.RawData/PIPELINE_REGISTRY.md

3. Deploy Assets to AROS Runtime

# Dry run first (preview without modifying)
bash 01.Shared_Assets/Scripts/deploy_to_aros.sh --dry-run

# Full deployment
bash 01.Shared_Assets/Scripts/deploy_to_aros.sh

4. Verify Deployment (inside Antigravity IDE)

After deployment, the assets are automatically indexed by the antigravity-brain MCP server. You can verify with:

find_helpful_skills("grant writing")
find_helpful_ki("KAKENHI")

πŸ”§ Domain Pipelines

Pipeline Domain Key Skills Active Workflows
Grant_Write_Pipeline Scientific grants grant-mock-reviewer, medical-translation, abstract-trimmer, grant-budget-justification /grant-write
KAKENHI_Pipeline JSPS KAKENHI reporting kakenhi-form-completion, kakenhi-pre-award-forms /kakenhi-annual-report
Manuscript_Write_Pipeline Academic manuscripts peer-review, statistical-analysis, literature-review, method-writing /manuscript-write
Bioinformatics_Pipeline Genomics & proteomics string-database, ppt-master β€”
Data_Analysis_Pipeline Statistical modeling agentic-data-scientist, flowcypy /visualize-data
Software_Engineering_Pipeline Code generation & QA gtb-validator, pipeline-orchestrator β€”
System_Admin_Pipeline Environment management agent-environment-capabilities, conditional-logic-execution β€”
UI_Development_Pipeline Web interface design agent-design-principles β€”
Writing_Publishing_Pipeline Academic publishing research-lookup, semantic-scholar-database β€”
Web_Scraping_API_Pipeline Data acquisition api_availability_precheck, crapome β€”
Project_Management_Pipeline Task orchestration modular-task-breakdown, critical-task-output-guarantee β€”
workspace_management Global infrastructure regent-governance, md-html-docx-generator /lab-commit, /wiki-*, /science-project-onboarding

πŸ›‘οΈ Governance & Shared Asset Management

This repository uses a two-layer governance system to ensure all pipelines remain compatible as assets evolve.

SAMS: Shared Asset Management System (v1.1)

The Shared Asset Registry at 00.RawData/SHARED_ASSET_REGISTRY.md is the single source of truth for all cross-pipeline assets. Before modifying any shared Skill, KI, Policy, or Workflow, consult this registry.

Cross-Platform Note: SAMS v1.1 uses direct path referencing β€” no POSIX symlinks. This repository is fully operable on Linux, macOS, and Windows.

Audit Tool

python3 01.Shared_Assets/Scripts/audit_shared_assets.py

This script verifies structural integrity, detects orphaned assets, and flags cross-pipeline compatibility violations.


🀝 Contributing

We welcome contributions! Please read CONTRIBUTING.md before opening a PR.

Key points for contributors:

  • All asset modifications must follow the CPCP (Cross-Pipeline Compatibility Protocol)
  • New skills must use the skill-name/SKILL.md directory structure
  • Workflows must include YAML frontmatter with a description field ≀ 250 chars
  • Symlinks are strictly banned β€” use direct path references

πŸ“„ License

This project is licensed under the MIT License β€” see LICENSE for details.


πŸ™ Acknowledgments

This project is part of the AROS Ecosystem, developed by LabOnoM. The foundational philosophy of Cognitive Decomposition draws on research in embodied cognition, stochastic resonance neuroscience, and the gut-brain axis. See the Philosophy section above for detailed references.



πŸ€– AI Agent Operational Context

[FOR AI AGENTS] This section is specifically authored for AI agents operating within this workspace. All rules below are binding and carry the force of LAW within AROS governance.

⚠️ Asset Forging Warning

This repository is a Factory for forging AROS assets. Before modifying any Skill, Policy, KI, or Workflow, you MUST read AGENTS.md and SPEC.md to understand:

  • The CPCP constraints (LAW 0)
  • The SAMS audit rules (LAW 1)
  • Known failure modes (e.g., the LaTeX truncation bug, the SAMS Basename Trap)
  • Directory-structure requirements (skill-name/SKILL.md pattern)

πŸ”’ Governance Summary (Active Laws)

Law Name Rule
LAW -2 Agent Version Control Awareness Never modify .regent/ directly. Use rgt binary only.
LAW -1 Asset Forging Constraints Skills use dir/SKILL.md. No symlinks. KI-Companion pattern for large workflows.
LAW 0 CPCP β€” SUPREME RULE Evaluate β†’ Impact β†’ Test β†’ Resolve/Fork β†’ Update Registry before any shared asset modification.
LAW 1 Asset Deployment Protocol Always use deploy_to_aros.sh. No manual cp to ~/.gemini/.

πŸ“‹ Chronological Timeline (Verified)

Timestamp Event
2026-05-11 01:24 KAKENHI pipeline KIs (e_application_system PDFs, forms) initialized
2026-05-11 18:15 Grant_Write_Pipeline skills requirements and assets initialized
2026-05-11 18:29 Manuscript_Write_Pipeline assets and scripts established
2026-05-11 18:44 Shared Asset Registry and CPCP governance established
2026-05-11 19:00 Centralized SAMS implemented with direct referencing and programmatic audit tools
2026-05-11 21:00 Dual-VCS architecture (Git + re_gent) deployed for AI agent auditability
2026-05-11 22:00 Replaced legacy INDEX.csv with PIPELINE_REGISTRY.md; generalized workflow templates
2026-05-12 00:00 12-domain pipeline restructuring completed; SPEC v2.0 published
2026-05-13 00:00 Repository made public; MIT license added; multi-language READMEs created
2026-05-13 12:50 Finalized Email Draft #3 to Dr. Takano; integrated 'Freemium-to-Federation' business model & Cognitive Decomposition ideology into core docs

πŸ”¬ Hypothesis Evolution Table

| H3 | Making the AROS Pipeline Factory public requires a dual-audience README architecture: human-readable introduction at the top, AI-agent operational context at the bottom. | | H4 | Transitioning from local-only cognitive decomposition to a Cloud Federation model enables metered monetization of LLM-orchestrated scientific workflows via a centralized intelligence hub. |

πŸ“‚ Canonical Runtime Deployment Map

Asset Type Factory Source AROS Runtime Target
Skills */Skills/<skill-name>/ ~/.gemini/skills/<skill-name>/SKILL.md
Knowledge Items */KIs/<ki-name>/ ~/.gemini/antigravity/knowledge/<ki-name>/
Policies */Policies/*.md ~/.gemini/antigravity/policies/
Workflows */Workflows/*.md ~/.gemini/antigravity/global_workflows/

πŸ”§ Active Workflow Triggers

Slash Command Pipeline Purpose
/grant-write Grant_Write_Pipeline Universal grant writing
/kakenhi-annual-report KAKENHI_Pipeline JSPS KAKENHI lifecycle
/manuscript-write Manuscript_Write_Pipeline Dual-agent manuscript drafting
/lab-commit workspace_management Canonical commit gateway
/lab-reorganize workspace_management Git-safe file reorganization
/wiki-ingest workspace_management Ingest papers/data into LLM-Wiki
/wiki-query workspace_management Grounded Q&A from LLM-Wiki
/wiki-research workspace_management Literature research into wiki
/wiki-update workspace_management Wiki linting & synthesis
/wiki-build workspace_management Compile wiki into output docs
/audit-shared-assets workspace_management SAMS structural integrity audit
/science-project-onboarding workspace_management First-time project setup
/visualize-data Data_Analysis_Pipeline Autonomous diagram generation
/research-discovery workspace_management Research planning & brainstorm
/qa-system-audit workspace_management AROS QA health checks

About

The AROS Pipeline Factory: Forging, auditing, and deploying high-fidelity scientific workflows through Cognitive Decomposition and Human-AI Co-Evolution.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors