Skip to content
#

expected-calibration-error

Here are 6 public repositories matching this topic...

Language: All
Filter by language

Decision-safe evaluation + Streamlit dashboard for AI vs Human vs Post-Edited AI text detection. Generates a reliability report card (Accuracy, Macro F1, ECE, Brier), calibration plots, confidence histograms, and a coverage-vs-performance abstention curve. Recommends an operating threshold for human-review routing.

  • Updated Jun 30, 2026
  • Python

How Chain-of-Thought Budgets Induce Overconfidence in LLMs. Investigating Calibration Drift Under Reasoning (CDUR), hypothesis lock-in mechanisms, and dynamic token budgeting via the CABStop optimal stopping algorithm.

  • Updated Jun 16, 2026
  • Python

MSc thesis: evaluating & improving uncertainty calibration in retrieval-augmented LLMs. Compares 5 embedding models (BGE beats OpenAI) and cuts ECE ~32% via isotonic regression on real financial filings.

  • Updated Jun 24, 2026
  • Jupyter Notebook

Research framework for analyzing calibration collapse under class imbalance in tabular machine learning, with adaptive calibration methods, stress-test benchmarks, and minority reliability evaluation.

  • Updated May 8, 2026
  • Python

Improve this page

Add a description, image, and links to the expected-calibration-error topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the expected-calibration-error topic, visit your repo's landing page and select "manage topics."

Learn more