Skip to content

Niteesh57/GoogleGraphLearning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Gemini

๐Ÿฅฝ NeuroGraph LIVE

From Static Chatbots to Immersive Spatial Knowledge โ€” Powered by Gemini 2.5

Live Demo Backend API

Gemini Gemini Pro Cloud Run Vertex AI Docker React FastAPI MIT License


๐Ÿ“บ See It In Action

NeuroGraph LIVE transforms how students learn by converting fragmented textbook knowledge into interactive 3D mind maps with a real-time AI tutor that can see, hear, and teach โ€” all deployable in VR with just a Google Cardboard headset.

Manim Animation โ€” Live AI narrating Neural Network concepts

๐ŸŽฌ Manim Animation Engine โ€” AI generates & narrates custom educational videos in real-time

Loading Screen โ€” AI generating knowledge graph

โณ Intelligent Loading โ€” Hand gesture tips while the AI builds your graph


๐ŸŒŸ Why NeuroGraph LIVE?

The Problem

Many students suffer from "fragmented learning" โ€” collecting facts without understanding how concepts interconnect. Traditional chatbots provide linear answers, but real understanding needs spatial context.

Our Solution

A spatial knowledge navigator that creates a visual digital twin of knowledge, where:

๐Ÿ”— Connect the Dots ๐Ÿ—ฃ๏ธ Learn by Speaking ๐Ÿฅฝ Immerse Yourself
See literal links between "Neural Networks" โ†” "Gradient Descent" โ†” "Backpropagation" Talk naturally & show textbook pages to your AI tutor via camera Enter VR mode and navigate your knowledge graph with hand gestures

๐Ÿš€ Feature Showcase

1. ๐Ÿง  Interactive 3D Knowledge Graph

Concepts are semantically clustered using Vertex AI embeddings and projected into 2D/3D space using UMAP dimensionality reduction. Click any node to explore its connections.

Knowledge Graph โ€” Desktop View

Desktop View โ€” Graph with semantic clustering, text input, and file upload

Knowledge Graph โ€” Node Selection

Node Exploration โ€” Click to highlight connections with color-coded edges


2. ๐Ÿค– Multimodal Live AI Tutor (Gemini 2.5 Flash)

The core of NeuroGraph is a bidirectional streaming connection to Gemini's Multimodal Live API. The AI tutor can simultaneously:

  • ๐ŸŽค Hear you โ€” Real-time speech recognition via native audio streaming
  • ๐Ÿ‘๏ธ See you โ€” Camera frames sent as visual "heartbeats" every 2 seconds
  • ๐Ÿ“– Read your textbook โ€” Point your camera at a page and the AI extracts & maps concepts
  • ๐Ÿ—บ๏ธ Build your graph โ€” Proactively appends new nodes/edges to your existing knowledge map

Live AI Tutor Session
The AI tutor analyzes camera input, generates a Manim animation for "Neural Networks," and narrates it live.


3. ๐Ÿฅฝ VR & Mobile Immersion

VR Mode โ€” Stereoscopic View with Hand Tracking

๐Ÿ“ฑ VR Mode โ€” Stereoscopic split-screen with live hand gesture detection via camera

Mobile View โ€” Knowledge Graph

๐Ÿ“ฒ Mobile View โ€” Full responsive UI with touch interactions and concept generation

Expanded Mind Map

๐Ÿงฉ Deep Mind Maps โ€” Multi-level concept trees with AI-generated descriptions and history tracking

VR Features:

  • โœ… Google Cardboard compatible stereoscopic rendering
  • โœ… Hand gesture navigation (swipe through nodes without controllers)
  • โœ… Camera-based hand detection using the device's front camera
  • โœ… VR-optimized overlays for quizzes, videos, and mind maps

4. ๐ŸŽฌ Dynamic Manim Education Engine

Ask "Show me how this works" and the system:

  1. Gemini 2.5 Pro writes a custom Manim (Python) animation script
  2. The backend renders the animation to MP4 in real-time
  3. The video is streamed back and narrated live by the AI tutor
  4. Stored in Google Cloud Storage for future playback

5. ๐ŸŒ Multilingual Support

NeuroGraph's AI tutor powered by Gemini supports multilingual conversations:

  • ๐Ÿ—ฃ๏ธ Speak in any language โ€” the tutor understands and responds naturally
  • ๐Ÿ“ Generate concept graphs in your preferred language
  • ๐ŸŒ Break language barriers in education โ€” accessible worldwide

๐Ÿ—๏ธ System Architecture

NeuroGraph LIVE โ€” Full System Architecture on Google Cloud

๐Ÿ“‹ Architecture Breakdown (Click to expand)
Layer Technology Purpose
Edge Layer Google Cloud Load Balancer Traffic distribution & SSL termination
Compute Layer Cloud Run (Auto-scaling 0โ†’3) Serverless containers for frontend & backend
DevOps Layer Docker + Artifact Registry + gcloud CLI CI/CD pipeline & container management
AI Intelligence Gemini 2.5 Flash (Live API) Real-time bidirectional audio/vision streaming
AI Reasoning Gemini 2.5 Pro Complex Manim script generation
Vector Search Vertex AI text-embedding-004 Semantic embedding for concept clustering
Data Layer Google Cloud Storage + ChromaDB MP4 storage & vector database
Frontend React + Vite + D3.js Interactive 3D knowledge graph UI
Backend Python + FastAPI + WebSocket Multimodal Hub & Manim engine

๐Ÿ› ๏ธ Technology Stack & Google Cloud Integration

This project is built end-to-end on the Google Cloud AI ecosystem:

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  ๐Ÿง  AI MODELS                                               โ”‚
โ”‚  โ”œโ”€ Gemini 2.5 Flash (Multimodal Live API)                  โ”‚
โ”‚  โ”‚   โ””โ”€ Real-time bidi audio + vision streaming             โ”‚
โ”‚  โ”œโ”€ Gemini 2.5 Pro                                          โ”‚
โ”‚  โ”‚   โ””โ”€ High-reasoning Manim animation generation           โ”‚
โ”‚  โ””โ”€ text-embedding-004 (Vertex AI)                          โ”‚
โ”‚      โ””โ”€ Semantic vector embeddings for concept clustering   โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  โ˜๏ธ INFRASTRUCTURE                                           โ”‚
โ”‚  โ”œโ”€ Cloud Run          โ†’ Serverless containers (frontend +  โ”‚
โ”‚  โ”‚                       backend with auto-scaling)         โ”‚
โ”‚  โ”œโ”€ Artifact Registry  โ†’ Docker image management            โ”‚
โ”‚  โ”œโ”€ Cloud Storage      โ†’ Persistent MP4 video storage       โ”‚
โ”‚  โ”œโ”€ Secret Manager     โ†’ Secure API key management          โ”‚
โ”‚  โ””โ”€ Cloud Build        โ†’ Automated container builds         โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  ๐Ÿ–ฅ๏ธ APPLICATION                                              โ”‚
โ”‚  โ”œโ”€ Frontend: React + Vite + D3.js + WebRTC                 โ”‚
โ”‚  โ”œโ”€ Backend:  Python + FastAPI + WebSocket + Manim          โ”‚
โ”‚  โ””โ”€ Database: ChromaDB (Vector Store)                       โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

โ˜๏ธ GCP Deployment โ€” Live & Verified

Both frontend and backend are live on Google Cloud Run with full observability. Here is proof of our production deployment:

Cloud Run Services Dashboard

Cloud Run Console โ€” Services Overview

Both frontend and backend services are healthy and running in us-central1

Backend Service โ€” Logs & Metrics

Backend Cloud Run Logs

WebSocket connections to Gemini Live successfully established

Backend Cloud Run Metrics

Request count, latency, and end-to-end performance metrics

Frontend Service โ€” Logs & Metrics

Frontend Cloud Run Logs

Nginx serving the React SPA with proper routing

Frontend Cloud Run Metrics

Sub-100ms latency serving static assets via Cloud Run CDN

Gemini API Integration

Vertex AI API Keys

Vertex AI API key configured for Gemini 2.5 Flash & Pro models


๐Ÿ“ˆ Key Findings & Learnings

# Insight Impact
1 Sub-second visual-to-audio latency with Gemini 2.5 Flash Live API Makes the AI tutor feel human โ€” critical for engagement
2 Visual heartbeats (2-sec camera snapshots) build context proactively AI suggests learning paths without the user asking "What is this?"
3 Spatial mapping via UMAP of embeddings reduces cognitive load Learners visualize conceptual "distance" โ€” solving the "where was I?" problem
4 Manim + Gemini Pro combo enables on-the-fly educational animations No pre-rendered content needed โ€” every explanation is unique
5 Hand gesture detection via camera makes VR accessible No expensive controllers โ€” just a phone and Google Cardboard

๐ŸŽค Try These Live Prompts

Experience the full depth of NeuroGraph LIVE with these multimodal prompts:

1. ๐Ÿ“– Proactive Knowledge Mapping

"I'm looking at this page about Gradient Descent. Can you read this and add it to our map?"

What happens: The AI Tutor analyzes your camera feed, extracts key concepts, and calls create_mind_map. It explains the new nodes and their connections to your existing graph.

2. ๐Ÿ” Deep-Dive Mind Map Exploration

"Can you explain this mind map in full depth? Show me how 'Backpropagation' connects to 'Neural Networks' and why it matters."

What happens: The AI Tutor performs a semantic deep-dive, describing relationships and links qualitatively, helping you synthesize the entire topic.

3. ๐ŸŽฌ On-Demand Visual Learning

"This concept is abstract. Can you generate an animation explaining how the weights are updated?"

What happens: The AI triggers the Manim Engine via generate_video. You receive a custom educational animation with live narration from your tutor.

4. ๐ŸŒ Multilingual Learning

"ยฟPuedes explicarme las redes neuronales en espaรฑol?" / "เฆจเฆฟเฆ‰เฆฐเฆพเฆฒ เฆจเง‡เฆŸเฆ“เฆฏเฆผเฆพเฆฐเงเฆ• เฆธเฆฎเงเฆชเฆฐเงเฆ•เง‡ เฆฌเฆพเฆ‚เฆฒเฆพเฆฏเฆผ เฆฌเฆฒเง‹"

What happens: The AI responds fluently in the requested language, generating concept maps with localized labels.


๐Ÿ—๏ธ Reproducibility: Spin-Up Instructions

Prerequisites

  • Python 3.12+, Node.js 20+, FFmpeg (for Manim)
  • A Google Cloud project with Gemini API access

1. Clone & Setup Backend

git clone https://github.com/your-repo/neurograph-live.git
cd neurograph-live

# Backend
cd app
pip install -r requirements.txt
# Create .env with your GEMINI_API_KEY
echo "GEMINI_API_KEY=your_key_here" > .env
uvicorn main:app --reload

2. Setup Frontend

cd frontend
npm install
npm run dev

3. ๐Ÿš€ One-Click GCP Deployment

We provide a fully automated deployment script that handles everything:

chmod +x deploy-gcp.sh
./deploy-gcp.sh

The script automatically:

  1. โœ… Enables required Google APIs (Cloud Run, Cloud Build, Artifact Registry)
  2. โœ… Creates an Artifact Registry Docker repository
  3. โœ… Builds & deploys the Backend to Cloud Run
  4. โœ… Injects the Backend URL into VITE_API_URL
  5. โœ… Builds & deploys the Frontend to Cloud Run
  6. โœ… Returns your live Frontend URL ๐ŸŽ‰

๐Ÿ“ Project Structure

neurograph-live/
โ”œโ”€โ”€ app/                          # ๐Ÿ Python Backend (FastAPI)
โ”‚   โ”œโ”€โ”€ main.py                   #   WebSocket server & API routes
โ”‚   โ”œโ”€โ”€ gemini_live_agent.py      #   Gemini 2.5 Flash Live integration
โ”‚   โ”œโ”€โ”€ manim_generator.py        #   Dynamic Manim animation engine
โ”‚   โ”œโ”€โ”€ requirements.txt          #   Python dependencies
โ”‚   โ”œโ”€โ”€ Dockerfile                #   Backend container
โ”‚   โ””โ”€โ”€ .gcloudignore
โ”œโ”€โ”€ frontend/                     # โš›๏ธ React Frontend (Vite)
โ”‚   โ”œโ”€โ”€ src/
โ”‚   โ”‚   โ”œโ”€โ”€ App.jsx               #   Main application
โ”‚   โ”‚   โ”œโ”€โ”€ components/           #   UI components
โ”‚   โ”‚   โ””โ”€โ”€ index.css             #   Styles
โ”‚   โ”œโ”€โ”€ Dockerfile                #   Frontend container
โ”‚   โ””โ”€โ”€ package.json
โ”œโ”€โ”€ Img/                          # ๐Ÿ–ผ๏ธ Screenshots & assets
โ”œโ”€โ”€ GCP/                          # โ˜๏ธ GCP deployment proof
โ”œโ”€โ”€ deploy-gcp.sh                 # ๐Ÿš€ One-click deployment script
โ”œโ”€โ”€ docker-compose.yml            # ๐Ÿณ Local multi-service setup
โ””โ”€โ”€ LICENSE                       # MIT License

๐Ÿ“œ License

This project is licensed under the MIT License. See the LICENSE file for details.


Gemini
NeuroGraph LIVE
Moving education from static chat to immersive spatial exploration.

Built with โค๏ธ using Gemini 2.5 & Google Cloud
for the Gemini Live Agent Challenge

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors