SovereignNode AI is a production-ready, air-gapped Industrial IoT predictive maintenance edge node. It is designed to operate entirely on constrained local hardware, ingesting high-frequency factory telemetry, detecting critical anomalies, and generating Retrieval-Augmented Generation (RAG) diagnostic reports—all without ever sending sensitive operational data to the cloud.
Industrial environments require absolute data sovereignty, high reliability, and rapid insight generation on highly constrained edge servers.
- Java 21 & Spring Boot 3 (Telemetry Ingestion): Chosen for the ingestion layer due to its enterprise reliability, strict typing, and seamless AMQP integration for handling massive influxes of sensor data without dropping packets.
- Python & FastAPI (AI Engine): Chosen because Python is the undisputed king of the ML ecosystem (HuggingFace, PyTorch). FastAPI provides the high-performance, asynchronous endpoints required to bridge the Java layer with the AI engine.
- React, Vite, TypeScript, Tailwind (Dashboard): Chosen for the frontend to provide a lightning-fast, reactive operator dashboard with real-time polling hooks and a dynamic user interface.
- MongoDB & RabbitMQ (Data & Messaging): Chosen to decouple the ingestion logic from database writes, ensuring zero data loss during high-load spikes while persisting telemetry securely.
The platform is divided into domain-specific microservices:
/iot-telemetry-service: The Java-based ingestion engine that consumes AMQP streams and persists telemetry to MongoDB./ai-inference-engine: The Python backend hosting the quantizedQwen-0.5BSmall Language Model (SLM) and the ChromaDB vector store for RAG grounding./sovnode-dashboard: The Vite-powered React operator UI that visualizes live sensor feeds and AI diagnostic insights.- Root Directory: Contains the master
docker-compose.yml, a mock Pythontelemetry_simulator.py, and thesetup_edgebootstrap scripts.
The entire backend runs in a cross-platform Docker Compose stack. It relies on the NVIDIA Container Toolkit for GPU passthrough but is heavily optimized to run safely on standard CPUs (~1.1 GB RAM total footprint) if hardware resources are constrained.
- Ensure Docker and Docker Compose are installed.
- Grant execution permissions and run the bootstrap script:
sudo bash ./setup_edge.sh(If the server dependencies are already installed, you can simply run docker compose up -d)
Once the Docker Compose stack is running, you can interact with the platform:
-
Start the Dashboard: Open a new terminal in the
sovnode-dashboardfolder and run the UI natively:cd sovnode-dashboard npm run devAccess the dashboard at
http://localhost:5173. -
Trigger the Simulator: Start the Python telemetry simulator to stream mock sensor data into the platform:
python telemetry_simulator.py
-
Monitor the AI: Watch the inference engine logs in real-time as it detects anomalies and generates insights:
docker compose logs -f inference-engine
Curious about how we optimized this stack to run on a 2GB VRAM edge server without crashing? Read the Architecture & Engineering Report.