AgentOps Accelerator

Evaluate. Ship. Observe. Own.
Continuous evaluation, safety testing, observability, and release readiness for Microsoft Foundry agents.

Documentation | PyPI | VS Code Extension | Latest release

AgentOps Accelerator helps Microsoft Foundry agent teams evaluate quality, prepare releases, monitor behavior, and stay accountable after launch. It gives you a practical starting point for agent operations, with Foundry integration as the default path and deeper setup guidance in the full docs.

Get started

python -m pip install agentops-accelerator
agentops init

agentops init starts a guided setup that creates your agentops.yaml and .agentops/ workspace.

Next, follow the tutorial that matches your agent type:

What it helps you do

Use AgentOps Accelerator when you need to:

Evaluate an agent before release
Compare changes across versions
Capture release evidence
Monitor agent quality and regressions
Give teams a repeatable way to own agent behavior in production

The accelerator keeps the local workflow simple, then points you to the full docs when you are ready to configure pipelines, dashboards, and release practices.

Learn more

For setup guides, tutorials, architecture, CI/CD guidance, Doctor checks, and evaluator reference, start with the documentation site:

https://aka.ms/agentops-accelerator

Run a first evaluation

az login
$env:AZURE_AI_FOUNDRY_PROJECT_ENDPOINT = "https://<resource>.services.ai.azure.com/api/projects/<project>"
$env:AZURE_OPENAI_ENDPOINT = "https://<openai-resource>.openai.azure.com"
$env:AZURE_OPENAI_DEPLOYMENT = "gpt-4o-mini"
agentops eval analyze
agentops eval run
agentops doctor --evidence-pack

For Foundry targets, use either project_endpoint: in agentops.yaml or AZURE_AI_FOUNDRY_PROJECT_ENDPOINT. Config wins when both are set.

Outputs land in .agentops/results/latest/:

results.json - machine-readable (versioned, stable schema)
report.md - human-readable, PR-friendly

Release evidence lands in .agentops/release/latest/:

evidence.json - machine-readable production-readiness projection
evidence.md - PR/release summary

Capture the first successful run as a baseline:

New-Item -ItemType Directory -Force .agentops\baseline | Out-Null
Copy-Item .agentops\results\latest\results.json .agentops\baseline\results.json

To see a visible comparison, publish a new agent version with a prompt that paraphrases instead of copying exact-answer requests, update agentops.yaml to that new name:version, and compare against the baseline:

agentops eval run --baseline .agentops/baseline/results.json

The report grows a Comparison vs Baseline section with per-metric deltas.

Commands

Install optional extras as needed: [agent] for Doctor/Cockpit and [mcp] for MCP.

agentops --version - show installed version.
agentops init - bootstrap config and seed data.
agentops eval analyze - check eval readiness.
agentops eval init - bootstrap an azd eval.yaml recipe and wire execution: azd.
agentops eval run [--baseline PATH] - run an evaluation.
agentops eval promote-traces --source FILE [--apply] - promote local trace export files.
agentops telemetry validate NAME - validate an Azure Monitor or Application Insights import.
agentops telemetry preview NAME --rows N - preview telemetry import rows.
agentops telemetry import NAME --apply - write the imported telemetry dataset.
agentops report generate - regenerate report.md.
agentops workflow analyze - recommend CI/CD shape.
agentops workflow generate - generate CI/CD workflows.
agentops skills install - install Copilot or Claude skills.
agentops mcp serve - start the MCP server.
agentops doctor [--evidence-pack] - run readiness checks.
agentops cockpit - open the local Cockpit.
agentops agent serve - serve Doctor as a Copilot Extension.

AgentOps Cockpit

agentops cockpit opens a localhost command center for the current workspace. It combines eval history, Doctor findings, workflow status, and links to the matching Foundry and Azure Monitor views.

Cockpit sections, in display order:

Foundry connection - project, tenant, agent, App Insights.
Foundry launchpad - links for the agent, project, and telemetry.
Observability readiness - tracing, evals, red team, alerts.
AgentOps Doctor - latest Doctor findings.
Eval gate summary - local and CI gate history.
Quality gate summary - score trends and regressions.
Production signal - App Insights health snapshot.
CI/CD Pipelines - GitHub Actions status.
Next actions - contextual recommendations.

Documentation

Foundry Prompt Agent tutorial - use this when the Foundry target is agent: name:version. Walks the sandbox to dev journey with a PR gate.
Hosted or HTTP Agent tutorial - use this when the target is a Foundry hosted or HTTP endpoint URL. Same sandbox to dev journey for endpoint-based agents.
End-to-end tutorial - extends either of the above with the full sandbox to dev to qa to prod promotion, Foundry red-team scans, and trace-to-regression promotion.
Evaluation paths - choose static dataset, grey-box HTTP, or telemetry/trace import.
Core concepts
How it works
Doctor explained
CI/CD with GitHub Actions
Built-in evaluator reference
Release process

Contributing

See CONTRIBUTING.md for development, testing, and contribution guidance.

Name		Name	Last commit message	Last commit date
Latest commit History 704 Commits
.claude-plugin		.claude-plugin
.github		.github
.vscode		.vscode
docs		docs
examples/flat-quickstart		examples/flat-quickstart
infra/e2e		infra/e2e
media		media
plugins/agentops		plugins/agentops
scripts		scripts
src/agentops		src/agentops
tests		tests
tombstones/vscode		tombstones/vscode
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
icon.png		icon.png
launch.json		launch.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AgentOps Accelerator

Get started

What it helps you do

Learn more

Run a first evaluation

Commands

AgentOps Cockpit

Documentation

Contributing

About

Uh oh!

Releases 39

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

AgentOps Accelerator

Get started

What it helps you do

Learn more

Run a first evaluation

Commands

AgentOps Cockpit

Documentation

Contributing

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 39

Uh oh!

Contributors

Uh oh!

Languages