Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
33 commits
Select commit Hold shift + click to select a range
4a737af
docs: modernize enterprise terminology and enforce absolute asset lin…
squid-protocol Jun 18, 2026
94f98f5
refactor(binary-anomaly): standardize terminology and expand routing …
squid-protocol Jun 18, 2026
79d1e68
refactor(firewall): polish logging outputs and fix sandbox multiplier…
squid-protocol Jun 18, 2026
a0d977c
fix(vault-sentinel): resolve legacy _check_solar_shield method call a…
squid-protocol Jun 18, 2026
888a136
refactor(pii-hunter): standardize DevSecOps terminology and add deep …
squid-protocol Jun 18, 2026
891d51f
refactor(log-scanner): update stream-processing semantics and add rig…
squid-protocol Jun 18, 2026
616768e
chore: professionalize optical taxonomy and scanner terminology
squid-protocol Jun 24, 2026
79bcedd
refactor(core): upgrade optical engine to professional 5-tier lexical…
squid-protocol Jun 24, 2026
39e2834
style: apply global ruff formatting rules
squid-protocol Jun 24, 2026
ac5defb
docs: update language addition guide with professional dictionary tax…
squid-protocol Jun 24, 2026
bfcd09c
Refactor core architecture, security sensors, and recorders for schem…
squid-protocol Jun 25, 2026
e370198
Update test suites to match new clarity V5 schema definitions
squid-protocol Jun 25, 2026
55106b2
refactor: standardize DevSecOps and SAST terminology across core engine
squid-protocol Jun 26, 2026
6b617b1
refactor(security): standardize threat inference and SAST terminology
squid-protocol Jun 26, 2026
81af9cd
refactor(audit): update terminology for zero-trust firewall and shado…
squid-protocol Jun 26, 2026
dc6fad2
refactor(ai): standardize autonomous execution vectors in appsec guar…
squid-protocol Jun 26, 2026
93d24b1
refactor(cobol): apply enterprise architectural language to legacy ex…
squid-protocol Jun 26, 2026
3dfeffd
refactor(core): standardize DevSecOps terminology in central orchestr…
squid-protocol Jun 26, 2026
d2d5309
fix(ai_guardrails): resolve file mix-up and align test assertions wit…
squid-protocol Jun 26, 2026
8302e4b
fix(network_auditing): restore truncated run_api_audit method and upd…
squid-protocol Jun 26, 2026
8154e0a
test(cobol): align legacy extraction test suite with new enterprise t…
squid-protocol Jun 26, 2026
b2050f9
test(core): update GalaxyScope success assertion
squid-protocol Jun 26, 2026
55fd54e
fix(licensing): remove hardcoded pytest bypass to enforce strict envi…
squid-protocol Jun 26, 2026
a7fb196
refactor(cobol_to_java): elevate nomenclature to enterprise generator…
squid-protocol Jun 26, 2026
4066dbb
test: align golden images with new defensive design outputs and enfor…
squid-protocol Jun 26, 2026
3deecca
docs: formalize module documentation with enterprise terminology, arc…
squid-protocol Jun 26, 2026
c337325
security(vault_sentinel): redact clear-text secrets from stdout to pr…
squid-protocol Jun 26, 2026
266d93a
Potential fix for pull request finding 'CodeQL / Clear-text logging o…
squid-protocol Jun 26, 2026
1685d93
test(language_lens): patch temporal friction timeout to resolve CI ra…
squid-protocol Jun 27, 2026
c3aea56
Merge branch 'main' into feature/clarity_updates_5
squid-protocol Jun 27, 2026
f0d2a4e
docs(core): reframe blAST engine as heuristic structural sequencer
squid-protocol Jun 27, 2026
b364821
docs(tools): standardize engineering highlights and decoupled archite…
squid-protocol Jun 27, 2026
b299429
refactor(orchestrator): align CLI outputs with new structural termino…
squid-protocol Jun 27, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -89,3 +89,5 @@ gitgalaxy/tools/supply_chain_security/README supply_chain.md
gitgalaxy/tools/terabyte_log_scanning/README terabyte.md
*.sqlite
*.db
LM-Notebook files.md
terminology_updates.md
84 changes: 42 additions & 42 deletions README.md

Large diffs are not rendered by default.

49 changes: 26 additions & 23 deletions gitgalaxy/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,66 +6,69 @@

Welcome to the internal source code for the **GitGalaxy Core Engine**.

This directory contains the central orchestrator—**GalaxyScope**—alongside the core physics, optical routing, and mathematical heuristics that power the entire system. If you are a developer looking to contribute, understand the pipeline, or run the primary CLI, here is your architectural map.
This directory contains the central orchestrator—**GalaxyScope**—alongside the core structural mechanics, lexical routing, and mathematical heuristics that power the entire DevSecOps ecosystem. If you are a developer looking to contribute, understand the data pipeline, or run the primary CLI, here is your architectural map.

### 🗺️ The Developer Map (How the Pipeline Flows)

When you trigger the `galaxyscope` command, the data flows through these five physical directories:
When you trigger the `galaxyscope` command, the data flows through these physical directories:

* **`/core/` (The Frontline):** The optical routing layer. Contains the [Aperture Filter](https://squid-protocol.github.io/gitgalaxy/02-03-aperture-filter/) and [The Prism](https://squid-protocol.github.io/gitgalaxy/02-07-the-prism/), which break down source code into structural signals, separating executable logic from ghost mass (comments) and inert binaries.
* **`/physics/` (The Math):** The heuristics engine. Contains the [Signal Processor](https://squid-protocol.github.io/gitgalaxy/02-09-signal-processing/) and [Neural Auditor](https://squid-protocol.github.io/gitgalaxy/02-19-neural-auditor/), which apply GitGalaxy mathematics to score O(N) complexity, topological blast radius, and state flux without using ASTs.
* **`/recorders/` (The Exporters):** The translation layer. Converts the internal state maps into highly relational [SQLite Databases](https://squid-protocol.github.io/gitgalaxy/02-21-record-keeper/), AI-agent JSON tickets, and the final 3D WebGPU payload.
* **`/security/` (The Sentinel):** The zero-trust validation layer. Contains the [Security Lens](https://squid-protocol.github.io/gitgalaxy/02-06-security-lens/) responsible for intercepting embedded malware, hardcoded secrets, and logic bombs on the fly.
* **`/tools/` (The Spokes):** The enterprise automation layer. Contains specialized controllers for CI/CD pipelines—like the [Supply Chain Firewall](https://squid-protocol.github.io/gitgalaxy/04-03-supply-chain-firewall/) and [PII Leak Hunter](https://squid-protocol.github.io/gitgalaxy/04-06-pii-leak-hunter/)—that consume the core engine's telemetry. **These specialized tools power our official [Zero-Trust DevSecOps GitHub Action](../github-action-read-me.md).**
* **`/core/` (The Frontline):** The lexical routing layer. Contains the [Aperture Filter](https://squid-protocol.github.io/gitgalaxy/02-03-aperture-filter/) and [The Prism](https://squid-protocol.github.io/gitgalaxy/02-07-the-prism/), which break down source code into structural signals, stripping away inert binaries and separating executable logic from documentation.
* **`/metrics/` (The Math):** The heuristic and statistical engine. Contains the [Signal Processor](https://squid-protocol.github.io/gitgalaxy/02-09-signal-processing/) and [Statistical Auditor](https://squid-protocol.github.io/gitgalaxy/02-19-neural-auditor/), which apply GitGalaxy mathematics to calculate O(N) complexity, topological blast radius, and architectural drift without requiring ASTs.
* **`/security/` (The Threat Validator):** The security inference layer. Contains the [Security Lens](https://squid-protocol.github.io/gitgalaxy/02-06-security-lens/) responsible for identifying embedded malware signatures, autonomous AI execution vectors, and destructive execution patterns during Phase 1 ingestion.
* **`/recorders/` (The Exporters):** The translation layer. Converts the internal state RAM maps into highly relational [SQLite Databases](https://squid-protocol.github.io/gitgalaxy/02-21-record-keeper/), intermediate JSON representation for AI agents, and the final WebGPU visualization payload.
* **`/standards/` (The Calibration Layer):** The source of truth for the engine. Contains the polyglot lexical taxonomies and the global configuration profiles that tune the engine's strictness levels.
* **`/tools/` (The Execution Controllers):** The enterprise automation layer. Contains specialized controllers for CI/CD pipelines—like the [Supply Chain Firewall](https://squid-protocol.github.io/gitgalaxy/04-03-supply-chain-firewall/) and [PII Leak Hunter](https://squid-protocol.github.io/gitgalaxy/04-06-pii-leak-hunter/)—that independently consume the core engine's telemetry.

*(Note: The `cobol_*_controller.py` scripts at the root level act as dedicated entry points for the Mainframe Legacy Modernization suite, bypassing standard Git orchestration to process flat mainframe directories).*

---

### ⚡ Performance Showcase: NVDA (NonVisual Desktop Access)

To demonstrate the GalaxyScope orchestrator's capability on complex, cross-language system architecture, we unleashed it on **NVDA**, the open-source Windows screen reader.
To demonstrate the GalaxyScope orchestrator's capability on complex, cross-language system architecture, we executed it against **NVDA**, the open-source Windows screen reader.

Because NVDA relies heavily on bridging Python application logic with low-level C++ system hooks, it requires advanced polyglot dependency mapping. The blAST engine successfully parsed the mixed-language architecture, analyzing **236,754 lines of code** in just **5.59 seconds** (a velocity of 42,357 LOC/sec).

Crucially, during the import resolution phase, the Air-Gapped Dependency Radar successfully intercepted a structural naming collision (`fstream` vs `sstream`), proving the real-time typosquatting defenses are fully operational without relying on cloud APIs.
Crucially, during the import resolution phase, the local dependency scanner successfully intercepted a structural naming collision (`fstream` vs `sstream`), proving the real-time typosquatting defenses are fully operational without relying on cloud-based CVE APIs.

> **Note on False Positives:** Because `fstream` and `sstream` are both standard C++ libraries, this specific flag is a false positive. To prevent the engine from halting on trusted internal libraries, contributors can whitelist them by adding them to the global `approved_imports.json` registry (see [GitGalaxy Config](https://squid-protocol.github.io/gitgalaxy/06-01-gitgalaxy-config/)).
> **Enterprise Calibration (Zero-Trust Enforcement):** Because `fstream` and `sstream` are both standard C++ libraries, flagging this collision demonstrates the engine's default Zero-Trust strictness. To prevent the pipeline from failing on trusted internal or standard libraries, DevSecOps teams simply add them to the `APPROVED_IMPORTS` allowlist in the [GitGalaxy Config](https://squid-protocol.github.io/gitgalaxy/06-01-gitgalaxy-config/).

![NVDA Processing Demo](../../docs/wiki/assets/nvda_processing.gif)
![NVDA Processing Demo](https://raw.githubusercontent.com/squid-protocol/gitgalaxy/main/docs/wiki/assets/nvda_processing.gif)

```text
~~~text
[INFO] PASS_1.5: Running Air-Gapped Typosquatting & Dependency Confusion Radar...
[CRITICAL] 🚨 TYPOSQUATTING DETECTED: 'fstream' in nvdaHelper/vbufBase/storage.cpp closely matches anchor 'sstream'!
[WARNING] Intercepted 1 typosquatting attempts via repository baseline analysis.
...
[INFO] --- MISSION_SUCCESS: 849 files mapped in 5.59s ---
[INFO] --- ENGINE_TELEMETRY: Processed 236,754 lines of code at 42,357 LOC/s ---
```
~~~

---

### 🛠️ Local Development & GalaxyScope Execution

If you are modifying the internal physics or optical routing, it is highly recommended to install the package in editable mode so your CLI commands instantly reflect your local code changes.
If you are modifying the internal analysis logic or lexical routing, it is highly recommended to install the package in editable mode so your CLI commands instantly reflect your local code changes.

From the **root directory** of the repository, run:
```bash
~~~bash
pip install -e .
```
~~~

**Important:** GitGalaxy contains an embedded commercial licensing guardrail. To prevent a 5-second execution delay while testing your code locally, you must export the Community Free Tier key into your development environment before running the orchestrator:
```bash
~~~bash
export GITGALAXY_LICENSE_KEY="COMMUNITY_FREE_TIER"
```
~~~

Once installed and the key is set, you can trigger the main orchestrator globally from your terminal. This command runs the full [Data Pipeline](https://squid-protocol.github.io/gitgalaxy/02-01-pipeline-overview/) and outputs the final artifact.
```bash
~~~bash
galaxyscope /path/to/test/repo --debug
```
~~~

Before submitting a Pull Request, ensure your changes do not skew the core baseline risk equations by running the test suite:
```bash
~~~bash
python3 -m unittest discover tests/
```
~~~

---
### 🌌 Deep Dive into the Pipeline Architecture
Expand All @@ -74,4 +77,4 @@ To fully understand how GalaxyScope processes data, maps files, and applies risk
* 📖 **[GalaxyScope CLI Reference](https://squid-protocol.github.io/gitgalaxy/01-02-galaxyscope-cli-reference/)** (Flags, outputs, and behaviors)
* 📖 **[The Data Pipeline Overview](https://squid-protocol.github.io/gitgalaxy/02-01-pipeline-overview/)** (Step-by-step breakdown of the runtime)
* 📖 **[Risk Exposures & Methodology](https://squid-protocol.github.io/gitgalaxy/08-01-methodology/)** (The math behind the heuristics)
* 🪐 **[Return to the Main GitGalaxy Hub](https://github.com/squid-protocol/gitgalaxy)**
* 🪐 **[Return to the Main GitGalaxy Hub](https://github.com/squid-protocol/gitgalaxy)**
Loading
Loading