#

arabic-ocr

Here are 24 public repositories matching this topic...

craneset / ocr-data

In this repository, OCR-related datasets are available.

ocr datasets optical-character-recognition arabic ocr-arabic ocr-dataset arabic-ocr arabic-ocr-dataset

Updated Jun 17, 2026

MohammedNasserAhmed / arabic-pdf-chat

Arabic Chat with PDF is a user-friendly application that lets you interact with Arabic PDF documents. Powered by advanced language models, OCR, and vector search, it allows you to upload PDFs, ask questions, and receive accurate Arabic responses 🚀

ocr chat-application ocr-text-reader ocr-python rag rag-chatbot arabic-ocr

Updated Nov 20, 2024
Python

OmarSamirz / Fine-Tuning-an-Arabic-OCR-Model-using-Tesseract-5.0

This research aims to fine-tune an Arabic OCR model using Tesseract 5.0, enhancing text recognition accuracy through extensive data collection, preprocessing, and image generation. By leveraging advanced training techniques and data augmentation, we achieve significant improvements in word error rates (WER).

ocr tesseract tesseract-ocr ocr-model arabic-ocr arabic-ocr-model arabic-tesseract-ocr fine-tune-arabic-model fine-tune-arabic-tesseract-ocr-model fine-tune-arabic-ocr-model fine-tune-ocr

Updated Apr 4, 2025
Jupyter Notebook

OussamaBenSlama / Alef-OCR-Image2Html

Alef-OCR-Image2Html, an OCR model designed to transform Arabic documents including historical texts, scanned pages, and handwritten materials into structured and semantic HTML.

ocr ocr-recognition arabic-ocr arabic-ocr-model

Updated Nov 4, 2025
Jupyter Notebook

HasanBGit / Ketaba-OCR-LoRA

Official code for "Ketaba-OCR at AR-MS NakbaNLP 2026" — QLoRA fine-tuning of a specialized HTR model with Linear+Boost ensemble for Arabic manuscript recognition. 1st place per-line (CER 0.082) and 3rd place official leaderboard at NakbaNLP 2026 (LREC 2026).

ensemble dora handwritten-text-recognition peft shared-task vision-language-model qlora arabic-ocr qwen2-vl lrec2026 arabic-manuscripts nakbanlp

Updated Mar 5, 2026
Python

amolood / sudan-alpr-ai

Automatic license-plate recognition for Sudanese plates — YOLO detector + fine-tuned OCR, with a reproducible benchmark.

ocr deep-learning yolo alpr sudan onnx license-plate-recognition arabic-ocr fast-alpr

Updated Jun 19, 2026
Python

PRADUMAN-KR / OCR_model-HugginFace

Optical Character Recognition, OCR pipeline, Arabic OCR, Deep Learning OCR, Computer Vision text extraction, Text recognition system, AI document processing, Multilingual OCR, Transformer OCR, OCR benchmarking, Bounding box detection, Ground truth evaluation.

opencv paddlepaddle paddleocr hugginface arabic-ocr ai-document-processing ocr-pipeline deep-learning-ocr computer-vision-text-extraction paddleocr-v5

Updated May 20, 2026
Python

HasanBGit / QARI-OCR-LoRA

Additional experimental model for NakbaNLP 2026 Shared Task (AR-MS) — LoRA/DoRA fine-tuning of Qari-OCR (Qwen2-VL-2B) for Arabic handwritten manuscript recognition on the Omar Al-Saleh Memoir Collection (1951-1965).

lora dora handwritten-text-recognition peft shared-task vision-language-model arabic-ocr qwen2-vl lrec2026 arabic-manuscripts nakbanlp

Updated Mar 5, 2026
Python

logiccrafterdz / nassij

Nassij V3: High-accuracy Arabic PDF-to-DOCX converter with direct digital extraction (NassijScanner) and cryptographic linguistic integrity verification (Merkle proofs).

python ocr document-conversion offline-first arabic-language pdf-processing word-document privacy-focused paddleocr tashkeel rtl-support pdf-to-docx arabic-ocr ligature-handling data-digitization

Updated May 17, 2026
Python

Abd-alrhman1 / multilingual-ocr-toolkit

Multilingual OCR with per-region script routing for Arabic + Latin. Built for MENA documents.

multilingual ocr computer-vision tesseract text-detection arabic-nlp mena streamlit easyocr script-detection arabic-ocr

Updated May 7, 2026
Python

lAvArt / arabic-book-corpus-platform

OCR-first Arabic book corpus platform with citation-grade APIs

ocr nextjs postgresql minio full-text-search computational-linguistics digital-humanities arabic text-corpus fastify arabic-language lexicography corpus-search bullmq document-ai arabic-ocr citation-search

Updated Feb 21, 2026
TypeScript

mohamedkhamis / AQMAR

Local Python pipeline + bilingual SPA archiving the @AqmarTofan Telegram channel — Telethon, ffmpeg, EasyOCR (Arabic+English), openpyxl, Alpine.js.

python github-pages spa ocr telegram ffmpeg arabic openpyxl telethon tailwindcss telegram-scraper alpine-js easyocr arabic-ocr aqmar-tofan

Updated Jun 22, 2026
Python

DrAbdulmalek / medical-handwriting-ocr

Flagship Demo — Medical Handwriting OCR powered by Omni Medical Suite

demo ocr deployment medical gradio handwriting huggingface arabic-ocr

Updated Jun 29, 2026
Python

zenmakhlouf / arabic-bill-field-extractor

Local Arabic OCR field extraction for utility bills with PaddleOCR, FastAPI, CLI, and validation.

ocr computer-vision fastapi paddleocr document-ai utility-bills arabic-ocr

Updated Apr 25, 2026
Python

wiameadnane / arabic-handwriting-ocr

A deep learning-based handwritten Arabic OCR system using ResNet50 + BiLSTM + Attention with CTC decoding. Achieves 96.3% character accuracy and 80% word accuracy on the IFN/ENIT dataset, featuring a PyQt6 desktop GUI for real-time inference. Supports both greedy and beam search decoding.

computer-vision deep-learning lstm attention-mechanism resnet-50 ctc-loss arabic-ocr

Updated Apr 25, 2026
Jupyter Notebook

FixFips / ArabicOCR_KHATT

Arabic handwritten text recognition using a CRNN (CNN + BiLSTM) with CTC loss, trained on the KHATT dataset — includes a Gradio web demo for OCR on your own images.

ocr pytorch gradio handwritten-text-recognition ctc-loss crnn khatt arabic-ocr

Updated Apr 24, 2026
Python

MedoHamdani / ArabicOCR

In this repo, we will list down all tools that we know about Arabic OCR text extraction

ocr arabic arabic-ocr

Updated Jun 27, 2026

AlQari-ai / alqari

Arabic-first Document Intelligence API platform for OCR, Arabic handwriting recognition, document extraction, validation, search, chat with documents, and structured JSON output.

handwriting-recognition document-extraction document-processing-pipeline arabic-ocr arabic-ocr-model document-intelligence-ai-platform

Updated Jun 12, 2026
Python

DrAbdulmalek / omni-medical-suite

Unified Medical OCR Platform — Next.js + FastAPI + Gradio + Qdrant + Redis

ocr nextjs gradio fastapi qdrant arabic-ocr medical-ocr medical-handwriting

Updated Jul 2, 2026
Python

youssefelzedy / GateGuard-AI

Arabic Plate Recognition System

python ai yolo plate-recognition arabic-ocr arabic-ocr-model

Updated Jun 26, 2025
Python

Improve this page

Add a description, image, and links to the arabic-ocr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the arabic-ocr topic, visit your repo's landing page and select "manage topics."