Benchmarking NER on Naamapadam across 11 Indic languages. EDA + model training using mBERT, XLM-R, T5, FlanT5, mT5 + LLM fine-tuning (TinyLlama, Llama-3.2, Gemma, Qwen, Mistral) + 0–5 shot inference on 9 generative models.
pytorch named-entity-recognition lora bert indic-languages peft few-shot-learning xlm-roberta huggingface-transformers multilingual-nlp llm ai4bharat hindi-nlp naamapadam 4bit-quantization 8bit-quantization
-
Updated
Jun 23, 2026 - Jupyter Notebook