ApplianceIQ: AI-Powered Smart Manual Management

Interactive Manual Assistant leveraging Retrieval-Augmented Generation (RAG)

Overview

ApplianceIQ is a full-stack web application that transforms static appliance manuals into interactive, AI-powered knowledge bases. Instead of reading through lengthy PDF manuals to find a specific instruction, users can simply ask a question in plain English and receive a precise, context-aware answer sourced directly from the relevant manual.

The system is built on the concept of Retrieval-Augmented Generation (RAG). When a user asks a question, the platform first retrieves the most relevant sections from the uploaded manual using vector similarity search (powered by Pinecone), and then passes those sections to a Large Language Model (Groq/Gemini) to generate a clear, accurate, and conversational response. This two-step approach ensures that answers are always grounded in the actual manual content, minimizing hallucination and maximizing reliability.

ApplianceIQ supports multiple user roles, including administrators, business owners, and end users, each with their own tailored dashboard and permissions. Business owners can upload product manuals, track usage analytics, and generate QR codes that link directly to an interactive chat interface for their appliances.

Key Features

RAG-Powered Chat: Ask questions about any uploaded manual in natural language. The system retrieves the most relevant passages and generates accurate, context-aware answers using a Large Language Model.
Smart Document Processing: Automatic text extraction from PDFs and images (JPG, PNG) using OCR. Extracted content is chunked, embedded into vectors, and stored for fast retrieval.
Multi-Tenant Role-Based Dashboards:
- Admin Dashboard: Full system oversight including user management, manual management across all business owners, and platform-wide analytics.
- Business Owner Dashboard: Upload and manage manuals, view per-manual usage statistics, generate and download QR codes, and monitor customer interaction analytics.
QR Code Integration: Each uploaded manual automatically gets a unique QR code. Customers can scan the QR sticker on their physical appliance to instantly open the interactive chat assistant for that specific product, without needing to log in.
Conversation History: Chat sessions are persisted in the database. When a user returns, the system automatically injects previous conversation context for continuity.
Rate Limiting: Built-in rate limiter to prevent abuse of the chat and query endpoints.
Cloud-Native Storage: All manual files and QR code images are stored on Cloudinary, ensuring reliable access and delivery.

Technology Stack

Layer	Technology	Purpose
Frontend	React, Tailwind CSS	User interface and role-based dashboards
Backend API	FastAPI (Python)	REST API, authentication, orchestration
Chat Service	FastAPI (Python)	RAG engine, vector search, LLM integration
Ingestion Service	FastAPI (Python)	PDF/image processing, text extraction, embedding
Database	MongoDB (Motor async driver)	User profiles, manual metadata, chat history
Vector Store	Pinecone	Semantic search over document embeddings
File Storage	Cloudinary	Manual PDFs, images, and QR code hosting
LLM Provider	Groq (Llama-3) / Google Gemini	Natural language answer generation
Authentication	Session-based (bcrypt + secure tokens)	User login, signup, and role enforcement

Architecture

ApplianceIQ follows a microservices-inspired architecture with three independently deployable services:

                    +---------------------+
                    |     React Frontend  |
                    |  (Tailwind CSS UI)  |
                    +----------+----------+
                               |
                               | HTTP REST
                               v
                    +----------+----------+
                    |   Backend API       |
                    |   (FastAPI :8000)   |
                    |                     |
                    |  - Authentication   |
                    |  - Manual CRUD      |
                    |  - QR Code Gen      |
                    |  - Chat Proxy       |
                    |  - Rate Limiting    |
                    +----+----------+-----+
                         |          |
              +----------+          +----------+
              v                                v
   +----------+----------+         +-----------+-----------+
   |  Chat Service        |         |  Ingestion Service    |
   |  (FastAPI :8001)     |         |  (FastAPI :8002)      |
   |                      |         |                       |
   |  - RAG Engine        |         |  - PDF Text Extraction|
   |  - Vector Search     |         |  - OCR Processing     |
   |  - LLM Generation    |         |  - Text Chunking      |
   |  - Image Analysis    |         |  - Vector Embedding   |
   +----+----------+------+         +----+----------+-------+
        |          |                      |          |
        v          v                      v          v
   +----+---+  +---+------+         +----+---+  +---+------+
   |Pinecone|  |Groq/Gemini|        |Pinecone|  |Cloudinary|
   |        |  |(LLM API)  |        |        |  |          |
   +--------+  +-----------+        +--------+  +----------+
                    
                    +--------+
                    |MongoDB |
                    |(Atlas) |
                    +--------+

Application Workflow

The following sections describe how the major use cases flow through the system end to end.

1. User Registration and Authentication

A new user navigates to the Signup page and submits their name, email, password, and role (Admin or Business Owner).
The frontend sends a POST request to /api/auth/signup.
The backend hashes the password using bcrypt, creates a user document in MongoDB, and generates a unique session token.
The session token is returned to the frontend, which stores it and uses it in the Authorization header for all subsequent requests.
On future visits, the frontend calls /api/auth/me with the stored token to verify the session and retrieve user details.
Role-based access control (RBAC) is enforced at the API layer. Admin-only and business-owner-only endpoints reject unauthorized users with a 403 error.

2. Manual Upload and Processing

A business owner navigates to the Manual Upload page and selects a PDF or image file, along with metadata (appliance model name, version, region).
The frontend sends a multipart POST request to /api/manuals/upload.
The backend reads the file content and uploads it to Cloudinary, obtaining a public URL.
A manual record is created in MongoDB with status set to "processing".
The backend forwards the file URL and metadata to the Ingestion Service (/ingest endpoint).
The Ingestion Service downloads the file from Cloudinary, extracts text (using PDF parsing or OCR for images), splits the text into chunks, generates vector embeddings for each chunk, and upserts them into the Pinecone vector index.
Once processing completes, the manual status is updated to "ready".
A QR code is automatically generated for the manual, linking to the chat interface with the manual ID embedded. The QR code image is uploaded to Cloudinary and the record is stored in MongoDB.

3. Customer Query (RAG Chat)

A customer accesses the chat interface, either by logging in and selecting a manual, or by scanning a QR code on their physical appliance.
The customer types a question (e.g., "How do I clean the filter?").
The frontend sends a POST request to /api/chat with the manual ID and question.
The backend retrieves the manual record from MongoDB to get the model name, then fetches conversation history from the database (last 3 exchanges) for context continuity.
The backend forwards the query, history, and manual metadata to the Chat Service (/query endpoint).
The Chat Service generates a vector embedding of the question, searches Pinecone for the top-k most similar document chunks from that specific manual, and constructs a prompt combining the retrieved context with the user's question.
The prompt is sent to the LLM provider (Groq Llama-3 or Google Gemini), which generates a natural language answer.
The answer, along with source references and a confidence score, is returned to the backend.
The backend logs the query and response in MongoDB for analytics and future context, then streams the response back to the frontend.
The frontend displays the answer with source citations and confidence indicators.

4. QR Code Scanning (Guest Access)

A customer scans the QR code sticker on their physical appliance using their phone camera.
The QR code URL directs them to the ApplianceIQ chat interface with the manual ID pre-loaded.
No login is required. The backend allows unauthenticated access to the chat endpoint when a valid manual ID is provided.
The customer can immediately start asking questions about their appliance.

5. Admin and Analytics Workflow

An admin logs in and is directed to the Admin Dashboard.
The dashboard displays platform-wide statistics: total users, total manuals, total queries, and active sessions.
The admin can view and manage all users, assign QR codes to specific manuals, and delete manuals or users.
Business owners see their own dashboard with per-manual analytics: number of queries, average confidence scores, and upload history.
The Analytics page provides visual charts and breakdowns of usage patterns over time.

Project Structure

applianceiq/
|-- backend/                   # Backend API Gateway (FastAPI)
|   |-- server.py              # Main API entry point with all route definitions
|   |-- auth.py                # Authentication logic (signup, login, session management)
|   |-- models.py              # Pydantic models and database schemas
|   |-- ml_client.py           # HTTP client for communicating with ML services
|   |-- qr_handler.py          # QR code generation and management
|   |-- .env                   # Environment variables (MongoDB, Cloudinary, API keys)
|   +-- requirements.txt       # Python dependencies for the backend
|
|-- chat_service/              # Chat and RAG Service (FastAPI)
|   |-- server.py              # Chat service API endpoints
|   |-- rag_engine.py          # Core RAG logic (retrieval, prompt construction, LLM calls)
|   |-- config.py              # Service configuration and environment loading
|   |-- model_manager.py       # LLM model initialization and management
|   |-- errors.py              # Custom error types and error handling
|   |-- logger_config.py       # Structured logging configuration
|   +-- requirements.txt       # Python dependencies for the chat service
|
|-- ingestion_service/         # Document Ingestion Service (FastAPI)
|   |-- server.py              # Ingestion service API endpoints
|   |-- processor.py           # PDF/image text extraction, chunking, and embedding
|   |-- config.py              # Service configuration
|   |-- model_manager.py       # Embedding model management
|   |-- errors.py              # Custom error types
|   |-- logger_config.py       # Logging configuration
|   +-- requirements.txt       # Python dependencies for the ingestion service
|
|-- frontend/                  # React Frontend Application
|   |-- public/                # Static assets and index.html
|   +-- src/
|       |-- pages/
|       |   |-- Landing.js             # Public landing page
|       |   |-- Login.js               # Login page
|       |   |-- Signup.js              # Registration page
|       |   |-- AdminDashboard.js      # Admin management dashboard
|       |   |-- BusinessOwnerDashboard.js  # Business owner dashboard
|       |   |-- ManualUpload.js        # Manual upload interface
|       |   |-- ChatBot.js            # Interactive chat interface
|       |   |-- Analytics.js           # Usage analytics and charts
|       |   +-- Dashboard.js           # General dashboard view
|       |-- components/
|       |   |-- ui/
|       |   |   |-- Navbar.jsx         # Navigation bar
|       |   |   +-- stat-card.jsx      # Statistics display card
|       |   +-- LiveCameraOverlay.js   # Live camera analysis overlay
|       |-- App.css                    # Application-level styles
|       |-- index.css                  # Global styles and design tokens
|       +-- tailwind.config.js         # Tailwind CSS configuration
|
|-- assets/                    # Logo and branding assets
|-- docs/                      # Additional documentation
|-- tests/                     # Test suite
|-- ARCHITECTURE.md            # High-level architecture documentation
|-- API_REFERENCE.md           # API endpoint reference
|-- QUICKSTART.md              # Quick start guide
|-- CONTRIBUTING.md            # Contribution guidelines
|-- SECURITY.md                # Security policy
|-- ROADMAP.md                 # Project roadmap
|-- requirements.txt           # Root-level Python dependencies
+-- run_all.bat                # Script to start all services

Getting Started

Prerequisites

Python 3.9 or higher
Node.js 16 or higher
MongoDB Atlas account (or local MongoDB instance)
Cloudinary account (for file and QR code storage)
Pinecone account (for vector storage)
Groq API key (for Llama-3 inference) or Google Gemini API key

Installation

Clone the repository:

git clone https://github.com/tiirth22/Appliance_IQ.git
cd Appliance_IQ

Set up the Backend:

cd backend
pip install -r requirements.txt

Create a .env file in the backend/ directory:

MONGO_URL=your_mongodb_atlas_connection_string
DB_NAME=applianceiq_db
PINECONE_API_KEY=your_pinecone_api_key
GROQ_API_KEY=your_groq_api_key
CLOUDINARY_CLOUD_NAME=your_cloud_name
CLOUDINARY_API_KEY=your_cloudinary_api_key
CLOUDINARY_API_SECRET=your_cloudinary_api_secret
ML_SERVICE_URL=http://localhost:8001
INGESTION_SERVICE_URL=http://localhost:8002

Set up the Chat Service:

cd chat_service
pip install -r requirements.txt

Set up the Ingestion Service:

cd ingestion_service
pip install -r requirements.txt

Set up the Frontend:
```
cd frontend
npm install
```

Running the Application

Start each service in a separate terminal:

Backend API (Port 8000):
```
cd backend
python server.py
```
Chat Service (Port 8001):
```
cd chat_service
python server.py
```
Ingestion Service (Port 8002):
```
cd ingestion_service
python server.py
```
Frontend (Port 3000):
```
cd frontend
npm start
```

Alternatively, use the provided batch script to start all services at once:

run_all.bat

Contributing

Contributions are welcome. Please fork the repository and open a Pull Request for any enhancements or bug fixes. See CONTRIBUTING.md for detailed guidelines.

License

Distributed under the MIT License. See LICENSE for more information.

Made by Tirth J Dalal

Contact: tirth2216@gmail.com | +91 8733038793

Name		Name	Last commit message	Last commit date
Latest commit History 265 Commits
.agent/workflows		.agent/workflows
.emergent		.emergent
.github		.github
assets		assets
backend		backend
chat_service		chat_service
docs		docs
frontend		frontend
ingestion_service		ingestion_service
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
API_REFERENCE.md		API_REFERENCE.md
ARCHITECTURE.md		ARCHITECTURE.md
CONTRIBUTING.md		CONTRIBUTING.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
postman_collection.json		postman_collection.json
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_all.bat		run_all.bat
run_local.bat		run_local.bat
run_tests.ps1		run_tests.ps1
run_tests.sh		run_tests.sh
test_gemini.py		test_gemini.py
test_login_endpoint.py		test_login_endpoint.py
test_query.py		test_query.py
test_signup_api.py		test_signup_api.py
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ApplianceIQ: AI-Powered Smart Manual Management

Interactive Manual Assistant leveraging Retrieval-Augmented Generation (RAG)

Overview

Key Features

Technology Stack

Architecture

Application Workflow

1. User Registration and Authentication

2. Manual Upload and Processing

3. Customer Query (RAG Chat)

4. QR Code Scanning (Guest Access)

5. Admin and Analytics Workflow

Project Structure

Getting Started

Prerequisites

Installation

Running the Application

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ApplianceIQ: AI-Powered Smart Manual Management

Interactive Manual Assistant leveraging Retrieval-Augmented Generation (RAG)

Overview

Key Features

Technology Stack

Architecture

Application Workflow

1. User Registration and Authentication

2. Manual Upload and Processing

3. Customer Query (RAG Chat)

4. QR Code Scanning (Guest Access)

5. Admin and Analytics Workflow

Project Structure

Getting Started

Prerequisites

Installation

Running the Application

Contributing

License

About

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages