Skip to content
#

self-evaluation

Here are 15 public repositories matching this topic...

ProductionOS v1.0 — Claude Code plugin with 76 agents, 39 commands, and 12 hooks. Deploys specialized agents that review, score, and improve your entire codebase. Smart routing, recursive convergence, self-evaluation.

  • Updated Apr 16, 2026
  • TypeScript

Local-first, offline, no-LLM CLI that scores how well your confidence matches reality. Log a falsifiable prediction before you act; get Brier/calibration-scored when it resolves. Built first for coding agents — your standing over/under-confidence is injected into every session. (Both for Humans and Agents)

  • Updated Jun 9, 2026
  • Rust

A cognitive agent architecture using LangGraph and Python custom orchestration for adaptive travel planning. Employs a non-destructive state machine with dynamic self-evaluation, conditional re-search loops to fix data gaps, and robust Streamlit UI persistence guards alongside token-optimized data serialization.

  • Updated Jun 6, 2026
  • Python

This engine models adaptive reasoning by integrating metacognitive feedback, enabling systems to refine their decision-making through self-evaluation and dynamic restructuring. 本エンジンはメタ認知的フィードバックを統合し、自己評価と動的再構成を通じて意思決定を洗練させる適応的推論をモデル化します。

  • Updated Jun 24, 2025

Improve this page

Add a description, image, and links to the self-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the self-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more