Skip to content
#

rouge-score

Here are 10 public repositories matching this topic...

End-to-end MLOps pipeline that catches LLM quality regressions before production. Every PR is scored against a versioned golden dataset using BERTScore + ROUGE-L + an LLM-as-Judge rubric, compared to the MLflow production baseline, and shadowed against 5% of live traffic. FastAPI + Celery + TimescaleDB + Streamlit + DVC + GitHub Actions.

  • Updated Jun 15, 2026
  • Python

Improve this page

Add a description, image, and links to the rouge-score topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rouge-score topic, visit your repo's landing page and select "manage topics."

Learn more