Skip to content

knewnothing-git/gpusion-driver

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

3 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

 โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•— โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•— โ–ˆโ–ˆโ•—   โ–ˆโ–ˆโ•—โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•— โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•— โ–ˆโ–ˆโ–ˆโ•—   โ–ˆโ–ˆโ•—
โ–ˆโ–ˆโ•”โ•โ•โ•โ•โ• โ–ˆโ–ˆโ•”โ•โ•โ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•”โ•โ•โ•โ•โ•โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•”โ•โ•โ•โ–ˆโ–ˆโ•—โ–ˆโ–ˆโ–ˆโ–ˆโ•—  โ–ˆโ–ˆโ•‘
โ–ˆโ–ˆโ•‘  โ–ˆโ–ˆโ–ˆโ•—โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•”โ–ˆโ–ˆโ•— โ–ˆโ–ˆโ•‘
โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•”โ•โ•โ•โ• โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ•šโ•โ•โ•โ•โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘โ•šโ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•‘
โ•šโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•โ–ˆโ–ˆโ•‘     โ•šโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘โ•šโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•โ–ˆโ–ˆโ•‘ โ•šโ–ˆโ–ˆโ–ˆโ–ˆโ•‘
 โ•šโ•โ•โ•โ•โ•โ• โ•šโ•โ•      โ•šโ•โ•โ•โ•โ•โ• โ•šโ•โ•โ•โ•โ•โ•โ•โ•šโ•โ• โ•šโ•โ•โ•โ•โ•โ• โ•šโ•โ•  โ•šโ•โ•โ•โ•

The GPU Illusion โ€” AI Acceleration for Every Indian

Your laptop thinks it has a GPU now. It doesn't. That's the point.

License: MIT Platform: Windows Status: Early Development Made in India PRD


What is GPUsion?

GPUsion is an open-source Windows driver that tricks your CPU and operating system into believing a GPU is present โ€” then routes all AI inference workloads to a highly optimized CPU engine underneath.

Every AI app you install โ€” Ollama, Whisper, LM Studio โ€” sees a GPU in Device Manager. They run. You pay nothing extra.

No GPU required. No cloud subscription. No configuration hell.


The Problem We're Solving

India has 300 million+ laptops in the โ‚น30,000โ€“โ‚น60,000 range.
None of them have a discrete GPU.
Every serious local AI workload requires one.

What you want to run What it needs What it costs
Llama 3 8B locally 8GB VRAM โ‚น25,000+ GPU
Stable Diffusion RTX 3060 min โ‚น22,000+ GPU
Whisper (speech โ†’ text) GPU preferred 5ร— slower on CPU
Local coding assistant GPU for real-time โ‚น800โ€“2,000/mo cloud

The barrier isn't intelligence. It's access.
GPUsion removes the barrier.


How It Works

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  YOUR AI APP  (Ollama / Whisper / LM Studio / etc.) โ”‚
โ”‚         calls standard GPU APIs as normal           โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                       โ”‚  DirectML / Vulkan / OpenCL calls
                       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚            GPUSION VIRTUAL DRIVER                   โ”‚
โ”‚   Windows believes this is a real GPU adapter       โ”‚
โ”‚   "GPUsion Virtual Adapter โ€” 8GB VRAM" in Device   โ”‚
โ”‚   Manager. DXGI enumeration. WDDM 2.x compliant.   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                       โ”‚  routed inference workloads
                       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚           INFERENCE ENGINE                          โ”‚
โ”‚   llama.cpp ยท ONNX Runtime ยท AVX2/AVX-512 SIMD     โ”‚
โ”‚   INT4/INT8 quantization ยท CPU RAM as VRAM proxy   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

The illusion is at the driver level. The performance is real.


Quick Start

โš ๏ธ GPUsion is in early development. This is not yet ready for production use.
Star the repo and watch for our first release.

# Coming in Phase 1 โ€” Month 4
# One-click installer. No command line required.
# gpusion-setup.exe

For developers who want to build from source:

git clone https://github.com/gpusion/gpusion-driver
cd gpusion-driver

# Enable test signing mode (development only)
# Run as Administrator:
bcdedit /set testsigning on

# Build (requires Windows Driver Kit)
./build.ps1

# Install driver
./install.ps1

Repositories

Repo Description Status
gpusion-driver WDDM virtual GPU kernel driver โ€” the core illusion ๐Ÿ”จ Active
gpusion-inference CPU inference translation layer (llama.cpp / ONNX) ๐Ÿ”จ Active
gpusion-installer One-click Windows installer for non-technical users ๐Ÿ“‹ Planned
gpusion-firmware FPGA/ASIC firmware for Phase 2 hardware dongle ๐Ÿ“‹ Phase 2
gpusion-benchmark Standardized benchmarks vs. real GPU baselines ๐Ÿ“‹ Planned

Supported AI Frameworks (Phase 1 Target)

Framework Priority Status
Ollama Critical ๐Ÿ”จ In progress
llama.cpp Critical ๐Ÿ”จ In progress
ONNX Runtime DirectML High ๐Ÿ“‹ Planned
Whisper.cpp High ๐Ÿ“‹ Planned
AUTOMATIC1111 Phase 2 ๐Ÿ”ฎ Future

Roadmap

Month 1โ€“2   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘  Driver skeleton โ€” WDDM adapter in Device Manager
Month 2โ€“3   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘  Ollama integration โ€” first LLM runs end-to-end  
Month 3โ€“4   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘  One-click installer โ€” zero config for non-technical users
Month 4โ€“5   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘  PUBLIC LAUNCH โ€” GitHub + HackerNews + Reddit
Month 5โ€“6   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘  Benchmarks published โ€” honest numbers vs. real GPU
Month 6โ€“9   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ  PHASE 2 โ€” FPGA hardware dongle POC
Month 9โ€“12  โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ  Hardware v1 โ€” GPUsion Stick @ โ‚น2,499
Month 12โ€“24 โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ  PHASE 3 โ€” Custom ASIC, Made in India

Performance Expectations (Honest)

GPUsion Phase 1 (software) is not a GPU replacement. It is a GPU emulator that makes local AI possible on hardware that previously couldn't run it at all.

Model Real RTX 3060 GPUsion Phase 1 GPUsion Phase 2 (FPGA)
Llama 3 8B (INT4) ~50 tok/s ~6โ€“10 tok/s ~20โ€“30 tok/s (est.)
Whisper Base Real-time ~0.5ร— real-time ~0.9ร— real-time (est.)
Stable Diffusion ~15 s/img Not Phase 1 ~60 s/img (est.)

Slow is better than impossible.
Local is better than cloud.
Free is better than โ‚น2,000/month.


Why Open Source?

GPUsion's driver and inference layer are MIT licensed โ€” completely free, forever.

The software is the proof. The hardware dongle (Phase 2) is the product.

We believe:

  • Indian developers deserve infrastructure they can trust, audit, and improve
  • Open source creates the community moat that makes this impossible to kill
  • A working virtual GPU driver with real traction is the best pitch to any hardware partner

Contributing

GPUsion is built by one person and one AI.
It needs you to become something bigger.

Where help is most needed right now:

  • ๐Ÿ”ง Windows Driver Kit experience โ€” WDDM 2.x kernel driver development
  • ๐Ÿง  llama.cpp / ONNX Runtime โ€” inference optimization on CPU
  • ๐Ÿงช Testing โ€” Intel / AMD laptop compatibility across generations
  • ๐Ÿ“ Documentation โ€” Hindi + English setup guides for non-technical users
  • ๐ŸŒ Translations โ€” Hindi, Tamil, Telugu, Bengali READMEs

Bounties available:

  • โ‚น5,000 โ€” First working DirectML โ†’ CPU passthrough
  • โ‚น5,000 โ€” Successful Ollama model run via GPUsion driver
  • โ‚น2,500 โ€” Compatibility report for any new CPU/laptop model
  • โ‚น500 โ€” Documentation improvements

See CONTRIBUTING.md for details.


Technical Deep Dive

For the full architecture, phase specifications, risk analysis, and commercial strategy see the Product Requirements Document.

Key technical decisions:

Why WDDM and not a user-space shim?
Kernel-mode registration is required for apps that query the DirectX device list at startup. User-space intercepts miss these queries and apps refuse to run. WDDM is the only path to true OS-level illusion.

Why not just optimize CPU inference directly?
We do that too โ€” but without the virtual GPU layer, apps that check for GPU presence at startup simply refuse to launch. The illusion enables the inference, not the other way around.

Why INT4 quantization by default?
A 7B parameter model in FP16 needs ~14GB RAM. In INT4 it needs ~4GB. Most budget Indian laptops have 8GB RAM. INT4 is not a compromise โ€” it is the product.


The Name

GPU + Illusion = GPUsion

Inspired by Mohini โ€” the only female avatar of Vishnu, who created a perfect illusion to trick the Asuras from drinking Amrit.

GPUsion creates a perfect GPU illusion.
The Asuras are the hardware bottleneck.
The Amrit is local AI.


License

MIT License โ€” see LICENSE

The driver is free. The story is Indian. The mission is access.


Built in India ๐Ÿ‡ฎ๐Ÿ‡ณ for India โ€” and everyone else who deserves local AI

If this resonates, star the repo. That's how this story grows.

โญ Star ยท ๐Ÿด Fork ยท ๐Ÿ› Issue ยท ๐Ÿ’ฌ Discuss

About

Open-source virtual GPU driver for AI acceleration โ€” Made in India ๐Ÿ‡ฎ๐Ÿ‡ณ

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors