4bit-quantization

Star

Here are 2 public repositories matching this topic...

Haven16262 / LLM-Local-Deployment-Guide

Star

A practical guide for local LLM deployment with 4-bit quantization. / 4-bit量化本地大模型部署实战指南

python cuda quantization gradio huggingface llm local-llm bitsandbytes qwen deployment-guide 4bit-quantization

Updated May 6, 2026
Python

MaheshJakkala / naamapadam-multilingual-ner

Star

Benchmarking NER on Naamapadam across 11 Indic languages. EDA + model training using mBERT, XLM-R, T5, FlanT5, mT5 + LLM fine-tuning (TinyLlama, Llama-3.2, Gemma, Qwen, Mistral) + 0–5 shot inference on 9 generative models.

pytorch named-entity-recognition lora bert indic-languages peft few-shot-learning xlm-roberta huggingface-transformers multilingual-nlp llm ai4bharat hindi-nlp naamapadam 4bit-quantization 8bit-quantization

Updated Jun 23, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the 4bit-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the 4bit-quantization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4bit-quantization

Here are 2 public repositories matching this topic...

Haven16262 / LLM-Local-Deployment-Guide

MaheshJakkala / naamapadam-multilingual-ner

Improve this page

Add this topic to your repo