🔧 Commit Message LoRA

Fine-tuning Qwen2.5-1.5B-Instruct with LoRA to rewrite sloppy git commit messages into clean, professional ones.

Input : fix login bug
Output: Fix authentication failure caused by invalid session token handling

What This Project Does

Developers often write vague commit messages under pressure. This project fine-tunes a 1.5B LLM using Parameter-Efficient Fine-Tuning (LoRA) on 5,000 real GitHub commit messages, teaching the model to rewrite informal inputs into descriptive, professional commits — without fine-tuning all model weights.

Tech Stack

Component	Tool
Base Model	`Qwen/Qwen2.5-1.5B-Instruct`
Fine-Tuning	LoRA via 🤗 PEFT
Trainer	TRL `SFTTrainer`
Quantization	4-bit (bitsandbytes)
Dataset	GitHub Commit Messages — Kaggle
Framework	PyTorch + HuggingFace Transformers

Project Structure

commit-message-lora/
├── data/
│   └── prepare_dataset.py   # Download, clean & sample from Kaggle
├── src/
│   ├── config.py            # All hyperparameters in one place
│   ├── dataset.py           # HuggingFace Dataset + tokenization
│   ├── train.py             # End-to-end training pipeline
│   ├── inference.py         # Load adapter & generate messages
│   └── utils.py             # Prompt templates & text helpers
├── commit_lora/             # Saved LoRA adapter weights (post-training)
├── requirements.txt

Quickstart

1. Install dependencies

pip install -r requirements.txt

Also requires a Kaggle API key at ~/.kaggle/kaggle.json

2. Prepare the dataset

python data/prepare_dataset.py

Downloads the dataset, filters by message length, and samples 5,000 rows to data/prepared.csv.

3. Train

python -m src.train

Trains for 2 epochs with 4-bit quantization + LoRA. The adapter is saved to ./commit_lora.

4. Run inference

python -m src.inference

Or use it directly in code:

from src.inference import load_model_for_inference, generate_commit_message

model, tokenizer = load_model_for_inference()
print(generate_commit_message("add dark mode", model, tokenizer))
# → "Add dark mode support with system preference detection"

Training Details

Hyperparameter	Value
LoRA rank (`r`)	16
LoRA alpha	32
Dropout	0.05
Learning rate	2e-4
Batch size	4
Gradient accumulation	4 steps
Epochs	2
Max sequence length	256 tokens
Quantization	4-bit NF4

Why LoRA?

Fine-tuning a full 1.5B model requires significant GPU memory and time. LoRA inserts small trainable rank-decomposition matrices into the attention layers, cutting trainable parameters by ~99% while achieving comparable task performance. This makes the project reproducible on a single consumer GPU (e.g. T4 on Colab).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
commit_lora		commit_lora
data		data
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔧 Commit Message LoRA

What This Project Does

Tech Stack

Project Structure

Quickstart

1. Install dependencies

2. Prepare the dataset

3. Train

4. Run inference

Training Details

Why LoRA?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔧 Commit Message LoRA

What This Project Does

Tech Stack

Project Structure

Quickstart

1. Install dependencies

2. Prepare the dataset

3. Train

4. Run inference

Training Details

Why LoRA?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages