multimodal-rag

Here are 65 public repositories matching this topic...

nomic-ai / contrastors

Train Models Contrastively in Pytorch

deep-learning transformers pytorch embeddings multimodal rag image-embeddings text-embeddings contrastive-learning dense-retrieval multimodal-rag

Updated Mar 26, 2025
Python

fzliu / radient

Star

Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

Updated Mar 2, 2026
Python

DataArcTech / ToG-3

Star

Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval

rag graphrag multimodal-rag

Updated Apr 9, 2026
Python

Azure-Samples / azure-ai-search-multimodal-sample

Star

A sample app for the Multimodal Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power Q&A experiences.

image-retrieval ai-search multimodal rag multimodal-rag

Updated Jan 23, 2026
Python

ddickmann / vllm-factory

Star

Production inference for encoder models - ColBERT, GLiNER, ColPali, embeddings etc. - as vLLM plugins for online and in-process deployment

retrieval encoder inference embeddings ner serving rag colbert ml-infra-deployments vllm gliner multimodal-rag ai-infrastructure colpali triton-kernels vllm-plugins

Updated Apr 23, 2026
Python

declare-lab / Sealing

Star

[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"

video-understanding visual-language-models naacl2024 multimodal-rag video-rag

Updated Apr 14, 2026
Python

High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.

semantic-search-engine on-premise colbert-ai multivector quantizations retrieval-augmented-generation vector-databases rag-pipeline multimodal-rag colpali triton-kernels late-interaction multivector-search multivector-embeddings

Updated Apr 21, 2026
Python

NotShrirang / LoomRAG

Sponsor

Star

🧠 Multimodal Retrieval-Augmented Generation that "weaves" together text and images seamlessly. 🪡

Updated Mar 29, 2025
Python

yankmo / HAG

Star

🚀 HAG: Next-Gen AI | Neo4j + Weaviate Fusion | Dual-Similarity Retrieval | 100% Local & Private | Graph Intelligence Meets Vector Search

Updated Sep 17, 2025
Python

Abeshith / RAG-FundaMentals

Star

🔰 A Comprehensive RAG repository covering basic vanilla RAG techniques, advanced retrieval methods, hybrid search fusion approaches, hands-on reranking techniques with code + explanation 📚✨

retrieval agents reranking vector-database sentence-transformers langchain llama-index retrieval-augmented-generation langgraph graph-rag multimodal-rag agentic-rag ragas-evaluation

Updated Apr 9, 2026
Jupyter Notebook

philmcginty / qwen3-vl-embedding-server

Star

OpenAI-compatible multimodal embedding server for Qwen3-VL-Embedding-2B — embed text, images, or both via a simple REST API.

embeddings image-search rag fastapi vector-search local-first multimodal-search qwen graph-rag multimodal-rag qwen3-vl qwen3-vl-embedding-2

Updated Apr 3, 2026
Python

KKenny0 / open-omnisearch

Star

Self-adaptive Planning Agent。自适应规划代理的多模态检索增强生成技术。

ai-search multimodal rag llms multimodal-rag

Updated Nov 28, 2024
Python

swax10 / anaya

Star

Anaya is a Content Engine that specializes in analyzing and comparing multiple PDF documents. It uses Retrieval Augmented Generation (RAG) techniques to effectively retrieve, assess, and generate insights from the documents.

rag streamlit langchain chromadb ollama multimodal-rag

Updated Jul 5, 2024
Python

hanifsyarubany / VisText-RAG-Document-QNA

Star

📄 Multimodal RAG pipeline combining ColPALI visual retrieval, YOLO-DocLayNet layout detection, sentence embedding-based text retrieval, and LLaMA-4 completion for document question answering.

computer-vision artificial-intelligence generative-ai multimodal-rag

Updated Nov 13, 2025
Jupyter Notebook

Learnathon-By-Geeky-Solutions / devs

Star

Repository for team Devs

social-media ai maps travel trip itinerary-generator multimodal-rag

Updated Apr 30, 2025
TypeScript

cany7 / LumiCite

Star

LumiCite is a multimodal RAG system for academic papers, designed for multimodal evidence retrieval and citation-aware question answering.

python cli rag fastapi hybrid-retrieval retrieval-augmented-generation rag-pipeline multimodal-rag

Updated Mar 29, 2026
Python

CKeibel / FHSWF-deep-learning

Star

Multimodal RAG and comparisons between language models. (Project for Deep Learning Module at the FHSWF)

machine-learning deep-learning multimodal rag multimodal-large-language-models multimodal-rag

Updated Oct 27, 2024
Jupyter Notebook

philmcginty / qwen-vl-graph-rag

Star

Local-first multimodal Graph RAG with Qwen3-VL embeddings, Neo4j vector search, and a lightweight visual retrieval console.

neo4j embeddings image-search rag fastapi vector-search local-first multimodal-search qwen graph-rag multimodal-rag qwen3-vl

Updated Apr 23, 2026
Python

selvatharrun / Multimodal-RAG-Application

Star

A comprehensive Multimodal Retrieval-Augmented Generation (RAG) application that combines FastAPI backend with Streamlit frontend, supporting multiple AI models, advanced OCR capabilities, and intelligent document processing.

python information-retrieval ocr conversational-ai fastapi streamlit retrieval-augmented-generation multimodal-rag multimodal-ai

Updated Aug 7, 2025
Jupyter Notebook

revoker3661 / Multimodal-Clinical-RAG-Assistant-Medical-Text-Image-Retrieval-System-

Star

A doctor-assistive AI system that interprets medical knowledge and patient images simultaneously. It utilizes a Dual-Encoder architecture to cross-reference textbook theory with visual pathology, generating clinically grounded diagnoses.

Updated Dec 13, 2025
Python

Improve this page

Add a description, image, and links to the multimodal-rag topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-rag topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-rag

Here are 65 public repositories matching this topic...

nomic-ai / contrastors

fzliu / radient

DataArcTech / ToG-3

Azure-Samples / azure-ai-search-multimodal-sample

ddickmann / vllm-factory

declare-lab / Sealing

ddickmann / colsearch

NotShrirang / LoomRAG

yankmo / HAG

Abeshith / RAG-FundaMentals

philmcginty / qwen3-vl-embedding-server

KKenny0 / open-omnisearch

swax10 / anaya

hanifsyarubany / VisText-RAG-Document-QNA

Learnathon-By-Geeky-Solutions / devs

cany7 / LumiCite

CKeibel / FHSWF-deep-learning

philmcginty / qwen-vl-graph-rag

selvatharrun / Multimodal-RAG-Application

revoker3661 / Multimodal-Clinical-RAG-Assistant-Medical-Text-Image-Retrieval-System-

Improve this page

Add this topic to your repo