A practical guide for local LLM deployment with 4-bit quantization. / 4-bit量化本地大模型部署实战指南
-
Updated
May 6, 2026 - Python
A practical guide for local LLM deployment with 4-bit quantization. / 4-bit量化本地大模型部署实战指南
Benchmarking NER on Naamapadam across 11 Indic languages. EDA + model training using mBERT, XLM-R, T5, FlanT5, mT5 + LLM fine-tuning (TinyLlama, Llama-3.2, Gemma, Qwen, Mistral) + 0–5 shot inference on 9 generative models.
Add a description, image, and links to the 4bit-quantization topic page so that developers can more easily learn about it.
To associate your repository with the 4bit-quantization topic, visit your repo's landing page and select "manage topics."