NeurDB is a cutting-edge AI-powered autonomous database system that revolutionizes how organizations manage, query, and interact with their data. NeurDB seamlessly integrates artificial intelligence capabilities directly into the database layer, in all major components, enabling intelligent data processing and autonomous optimization.
Modern data systems struggle to keep up with the growing complexity of AI-driven applications. Traditional databases rely on manual tuning and external machine learning pipelines, resulting in fragmented workflows, high maintenance costs, and limited adaptability.
NeurDB redefines this paradigm by making AI a first-class citizen inside the database. It is not just a database with AI features — it is an AI-native data system that learns, adapts, and optimizes itself in real time.
Key advantages:
- AI-Native Architecture: AI and data processing are deeply fused within the database engine, enabling seamless model training, inference, and management.
- Intelligent Analytics: Built-in AI operators let users run predictive and generative analytics directly with SQL, without external pipelines.
- Autonomous Operation: Self-tuning, self-scaling, and self-healing mechanisms continuously optimize performance and resource usage.
- Unified AI & Data Platform: One system for both data management and AI lifecycle, ensuring stronger security, lower latency, and simplified workflows.
- Docker and Docker Compose
- 8GB+ RAM recommended
- (Optional) NVIDIA GPU with CUDA support for GPU acceleration
git clone https://github.com/neurdb/neurdb.git
cd neurdb
chmod -R 777 .For GPU-enabled deployment:
bash build.sh --gpuFor CPU-only deployment:
bash build.sh --cpuWait for the system to initialize. You'll see:
Please use 'control + c' to exit the logging print
...
Press CTRL+C to quit
Once running, connect to NeurDB using your preferred PostgreSQL client:
psql -h localhost -p 5432 -U postgres -d neurdbSee NeurDB in action:
NeurDB consists of three main components:
-
AI Layer (NeurIDA): This layer manages AI models and analytics tasks inside the database and supports the lifecycle of in-database machine learning models.
- Model Selection (VLDB 2024) – Automatically ranks and selects suitable models for database tasks.
- Model Slicing (VLDB 2025) – Decomposes large models into smaller slices to enable efficient execution and deployment.
- Model Construction – Supports the composition and integration of multiple models for AI analytics and transactional workloads.
-
Database Engine Layer: Built on top of an enhanced PostgreSQL engine, this layer integrates learned optimization, learned concurrency control, and runtime adaptive execution.
- NeurQO – A learned query optimizer that performs fast-adaptive query optimization through query-state abstraction and workload feedback.
- NeurEngine – Implements the planner, optimizer, and executor, and supports unified execution graph construction and CPU–GPU co-scheduling to execute both traditional database operators and AI operators.
- NeurCC (SIGMOD 2026) – A learned concurrency control framework that models concurrency control as a learnable function and dynamically adapts to workload changes.
-
Adaptive Data Access Components: NeurDB incorporates adaptive data access modules that improve indexing and caching performance under dynamic workloads.
- NeurIndex – A general learned index framework for adaptive index design and optimization under dynamic workloads, with Selix (SIGMOD 2026) as one implementation that enables DRL-based index adaptation.
- NeurCache – A workload-aware caching framework that jointly manages model weights, features, and data through a unified cache structure.
-
Storage Layer: A dual-format storage system supporting both key–value (RocksDB), heap storage, and model storage (NeurStore).
- NeurStore – Provides efficient storage and management for deep learning models.
-
Benchmarking and Evaluation NeurDB also includes benchmarking frameworks for evaluating AI-powered database systems and learned components.
- NeurBench (SIGMOD 2026) – A unified benchmark for evaluating learned database components under data and workload drift.
- NL2SQLBench (VLDB 2026) – A modular benchmark for evaluating LLM-enabled NL2SQL systems.
-
Tools and Utilities NeurDB provides additional system tools to support model interpretation and system analysis.
- CoShap (SIGMOD 2026) – A scalable method for Shapley value approximation to support model interpretation and feature contribution analysis.
NeurDB is backed by rigorous academic research. Our work has been published in top-tier venues:
-
NeurDB: An AI-powered Autonomous Data System [PDF] SCIENCE CHINA Information Sciences, 2024
-
NeurDB: On the Design and Implementation of an AI-powered Autonomous Database [PDF] CIDR 2025
-
Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems [PDF] VLDB 2024
-
Powering In-Database Dynamic Model Slicing for Structured Data Analytics [PDF] VLDB, 2025
-
NeurStore: Efficient In-database Deep Learning Model Management System SIGMOD 2026
-
Modeling Concurrency Control as a Learnable Function SIGMOD 2026
-
On Self-Designing Learned Indexes SIGMOD 2026
-
NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions VLDB 2026
-
NeurBench: A Benchmark Suite for Learned Database Components with Drift Modeling SIGMOD 2026
-
CoShap: A Scalable Coalition Growth Approach to Shapley Value Approximation SIGMOD 2026
If you use NeurDB in your research, please cite:
@article{neurdb-scis-24,
author = {Beng Chin Ooi and Shaofeng Cai and Gang Chen and
Yanyan Shen and Kian-Lee Tan and Yuncheng Wu and
Xiaokui Xiao and Naili Xing and Cong Yue and
Lingze Zeng and Meihui Zhang and Zhanhao Zhao},
title = {NeurDB: An AI-powered Autonomous Data System},
journal = {SCIENCE CHINA Information Sciences},
year = {2024},
url = {https://www.sciengine.com/SCIS/doi/10.1007/s11432-024-4125-9},
doi = {10.1007/s11432-024-4125-9}
}

