agent-quality

Star

Here are 8 public repositories matching this topic...

addyosmani / agent-house

Star

Lighthouse for agents - score an agent run, then tell it how to get faster and cheaper.

agents agent-skills agent-quality

Updated Jun 20, 2026
TypeScript

vivekkrishna / semantic-conflicts-benchmark

Star

Benchmarking the ability of large language models to detect semantic conflicts across domains, documents, and evolving knowledge bases.

law science semantic benchmark philosophy artificial-intelligence teams software knowledge-base knowledge-management reasoning conflicts-detection model-quality llm agent-quality

Updated Apr 16, 2026
Python

Oxygen56 / deep-trace

Star

AI Agent Root Cause Analysis Protocol — A systematic five-question methodology adapted from Toyota's 5 Whys for diagnosing failures in AI agent systems. Battle-tested over 60+ production incidents.

root-cause-analysis ai-agent llm-agent five-whys agent-quality deep-trace

Updated Jun 27, 2026

Nick-is-building / ast-guard

Star

Pre-Execution Gate for AI Code. A deterministic, gradient-immune structural guard against reward hacking and hardcoding in RL training loops.

python static-analysis ast code-quality ai-safety ai-governance reward-hacking agent-quality

Updated Jun 29, 2026
Python

wreggyy / answerforge-os-skill

Star

Professional cross-agent answer quality gate for improving AI responses: intent match, evidence, assumptions, verification, brevity, and usefulness.

gemini codex claude prompt-engineering hallucination-reduction ai-skill agent-quality answer-quality

Updated May 14, 2026
PowerShell

eris-ths / llm-agent-quality

Star

LLM Agent quality metrics — structured recording and quality threshold testing for Function Calling agents

python testing metrics llm function-calling agent-quality

Updated Feb 18, 2026
Python

zurbrick / agent-qa-gates

Star

Field-tested QA validation gates for AI agent systems. Tiered gates, protocol gates, severity classification, and automated checks. Born from production failures.

qa validation ai-agents llm-ops openclaw agent-quality

Updated Mar 28, 2026
Shell

agent-quality-controls / specular

Star

Deterministic spec-driven development CLI

rust cli json static-analysis verification spec-driven-development agent-quality

Updated Jun 25, 2026
Rust

Improve this page

Add a description, image, and links to the agent-quality topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agent-quality topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent-quality

Here are 8 public repositories matching this topic...

addyosmani / agent-house

vivekkrishna / semantic-conflicts-benchmark

Oxygen56 / deep-trace

Nick-is-building / ast-guard

wreggyy / answerforge-os-skill

eris-ths / llm-agent-quality

zurbrick / agent-qa-gates

agent-quality-controls / specular

Improve this page

Add this topic to your repo