Skip to content
#

model-auditing

Here are 13 public repositories matching this topic...

A production-grade LLM Evaluation & Benchmarking Framework for systematic model auditing. Features parallel benchmarking, fairness/bias detection, MMLU integration, and a real-time analytics dashboard powered by React and FastAPI.

  • Updated Apr 14, 2026
  • Python

Subgroup-stratified, calibration-aware fairness auditing for ML models: DeLong AUC confidence intervals, per-subgroup calibration error, multiple-comparison-corrected significance, and a novel five-axis cross-platform protocol (CPFE). Grounded in peer-reviewed methods.

  • Updated Jun 28, 2026
  • Python

AI Evaluator Pro 🛡️ is an AI security auditing tool that checks Hugging Face models for supply chain risks, unsafe formats, and author trust using OSINT + LLMs. It supports direct or discovery-based audits to detect security and integrity issues before deployment.

  • Updated May 13, 2026
  • Python

Second Look — a clinically-grounded triage safety-net & audit system (Triagegeist hackathon): calibrated ESI + vitals-independent red-flag NLP + honest informative-missingness & fairness audit + live Gradio demo.

  • Updated Jun 13, 2026
  • Python

Improve this page

Add a description, image, and links to the model-auditing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-auditing topic, visit your repo's landing page and select "manage topics."

Learn more