docs: audit agent instructions and skills for accuracy and brevity by webern · Pull Request #292 · webern/mx

webern · 2026-07-03T06:48:34Z

Summary

To get the most out of limited Fable preview, I asked it to take a pass over the repository looking to improve its legibility to coding agents on a few dimensions:

skill tightness and organization
instructions
documenting weird code including TODOs where it looks like a bug
creating copilot instruction mds for pull requests

AI Summary

What the audit found and fixed:

Wrong instructions: gen/AGENTS.md pointed at a nonexistent make target (test-cpp), AGENTS.md claimed make test runs everything and banned anonymous namespaces "anywhere" while 45 files contain them, the README hello-world example did not compile (Result<int> needs .value()), and several skills described a repo state that is months stale (baseline "pins one file" vs ~140, "segno is never written", a worked bug example that is already fixed).
Bloat: skills trimmed by ~140 net lines (arch lost its microservices boilerplate, duplicated test-run and PR-template text now has one owner), and the third retelling of the audit-tool story now points at audit/README.md instead.
New guidance where agents actually look:
- AGENTS.md "mx::api conventions": new absent-able fields use std::optional; absent-able enums keep an unspecified first enumerator (including the ternary Bool); no new -1/is...Specified patterns and no mass-migration. Codifies the convention from the feat(api): support lv (let-ring) ties #286 review discussion.
- docs/ai/design/api-design-principles.md: seven principles for how the api re-shapes MusicXML (stateless ticks, containment over label fields, one-fact-one-field, no neighbor-dependent meaning, denormalized state, quarantined edge cases, ignorable fidelity knobs), with a digest inlined in AGENTS.md.
- Six path-scoped Copilot review instruction files under .github/instructions/.
Code comments at verified traps (tie/tied dual emit, 1/1 time-modification sentinel, undertie bytes, id seed) and TODOs marking six suspected bugs found during the audit: arranger/publisher overwrite lyricist, a && that drops explicit normal-type, an off-by-one in test filename stripping, silent normal-type loss for tuplets without tuplet notations, ambiguous __ path flattening in the classifier, and the dead DurationData::isTied field. Each TODO is an issue candidate; happy to file them.

Testing

clang-format dry-run clean on all touched C++ files (comment-only changes; no behavior)
python3 -m py_compile on touched Python
Every factual claim in the edited docs was verified against the Makefile, CI workflow, and source before editing; stale-skill claims were re-checked in code (e.g. emitSegno exists, baseline length, batch_plan output)

References

Related: api: replace value + isSpecified bool pairs with an explicit optional value type #249 (isSpecified -> optional direction now documented), Sentinel values should have constexpr names. #282 (sentinel naming), feat(api): support lv (let-ring) ties #286 (review discussion that set the optional/unspecified convention)

Generated by Claude Code

- AGENTS.md: quickstart, mx::api presence/absence conventions (std::optional for new fields, unspecified enumerators for enums incl. Bool), accurate 'make test' scope, soften anonymous-namespace rule to match reality, condense Docker/audit/corert sections, fix typos - gen/AGENTS.md: fix nonexistent 'make test-cpp' target, unify commit policy - gen/README.md: point at gen/AGENTS.md instead of duplicating commands - README.md: fix non-compiling createFromScore example (Result<int>.value()), ARGS filter example, 'make dev' description - audit/data READMEs, Makefile, CI: fix skill-name references, add classify.py to layout table, fix garbled sentences, drop stale 829 pin count comment - code: add TODOs for 5 suspected bugs (arranger/publisher clobber lyricist, time-mod normal-type && condition, filename off-by-one, silent normal-type drop for tuplets without <tuplet> marks, ambiguous __ path flattening) and 9 clarifying comments at verified confusing spots

Corrects stale claims and trims redundancy across .claude/skills per the instructions audit; a follow-up commit completes the remaining skill fixes.

- skills: complete audit fixes (arch trimmed of irrelevant generic sections, stale examples replaced, optionality guidance in add-feature step-2 now matches the std::optional / unspecified convention) - docs/ai/design/api-design-principles.md: seven principles describing how mx::api re-shapes MusicXML (stateless time, containment over labels, one-fact-one-field, no neighbor-dependent meaning, denormalized state, quarantined edge cases, ignorable fidelity knobs), referenced from AGENTS.md and the mx-api-add-feature skill - NoteData.h: correct stale tie comment (writer emits both tie and tied) - DurationData.h: TODO on dead isTied field

Six .github/instructions/*.instructions.md files gated by applyTo globs: public api headers (presence/absence conventions, equality macros), impl (read/write symmetry, silent-drop and sentinel contracts), generated core (never review or hand-edit generated code), gen (cardinal rules), repo-wide C++ (anonymous namespaces, formatter-owned layout), and corpus/tests (pinned counts, markers, baseline discipline).

Agents reliably read AGENTS.md (SessionStart hook) but often skip linked docs; carry the seven rules inline, keep the full doc for examples and per-rule tests.

github-actions · 2026-07-03T07:03:38Z

Coverage report

Core-dev coverage `src/private/mx/core/`

Metric	Coverage	Covered / Total
Lines	77.8%	28487 / 36624
Functions	74.3%	6349 / 8550
Branches	50.6%	22632 / 44725

API coverage `src/private/mx/{api,impl,utility}/`

Metric	Coverage	Covered / Total
Lines	78.7%	6101 / 7754
Functions	64.3%	2091 / 3250
Branches	47.8%	5182 / 10834

Core HTML report | API HTML report

Commit 124e3d2314d151486e2b03cb7a42b8b582481255.

github-actions · 2026-07-03T07:03:39Z

gen-quality `gen/`

gen-quality: 84.5 / 100   (floor 84.5, +0.0)

  structure     86.5  x0.50   [fn 90.5 / file 82.6]
  cyclomatic    88.4  x0.25
  cognitive     76.6  x0.25

  409 functions across 31 files, 7702 lines (largest file 1044)
  max cc 56  max cognitive 44  max fn loc 152

Worst offenders (top 5 per axis; full lists in score.json):
  cyclomatic gen/xsd/analyze.py:311     report                             56
  cyclomatic gen/plates/build.py:956    _validate_config_against_ir        35
  cyclomatic gen/press/context.py:145   plate_context                      34
  cyclomatic gen/__main__.py:46         _ir                                23
  cyclomatic gen/tests/test_ir.py:102   _check_references                  20
  cognitive  gen/xsd/analyze.py:311     report                             44
  cognitive  gen/ir/resolve.py:119      flat_elements                      40
  cognitive  gen/tests/test_ir.py:102   _check_references                  38
  cognitive  gen/press/context.py:145   plate_context                      37
  cognitive  gen/xsd/analyze.py:207     _sccs                              37
  size       gen/xsd/analyze.py:311     report                             152
  size       gen/press/context.py:145   plate_context                      96
  size       gen/plates/build.py:533    _value_plate                       89
  size       gen/plates/build.py:956    _validate_config_against_ir        89
  size       gen/ir/resolve.py:119      flat_elements                      78

Commit 124e3d2314d151486e2b03cb7a42b8b582481255.

arch -> mx-architecture, grill-me -> mx-grill-me, open-mx-issue -> mx-open-issue, open-mx-pr -> mx-open-pr, project -> mx-project, questions -> mx-questions. Cross-references updated; open-issue/open-pr descriptions gain trigger phrases (report a bug, submit as PR). AGENTS.md layout notes the convention.

github-actions · 2026-07-03T08:55:28Z

Coverage report

Core-dev coverage `src/private/mx/core/`

Metric	Coverage	Covered / Total
Lines	77.8%	28487 / 36624
Functions	74.3%	6349 / 8550
Branches	50.6%	22632 / 44725

API coverage `src/private/mx/{api,impl,utility}/`

Metric	Coverage	Covered / Total
Lines	78.7%	6101 / 7754
Functions	64.3%	2091 / 3250
Branches	47.8%	5182 / 10834

Core HTML report | API HTML report

Commit 4cda961ba1756f5905628b9ee04c21a71296ed00.

github-actions · 2026-07-03T08:55:28Z

gen-quality `gen/`

gen-quality: 84.5 / 100   (floor 84.5, +0.0)

  structure     86.5  x0.50   [fn 90.5 / file 82.6]
  cyclomatic    88.4  x0.25
  cognitive     76.6  x0.25

  409 functions across 31 files, 7702 lines (largest file 1044)
  max cc 56  max cognitive 44  max fn loc 152

Worst offenders (top 5 per axis; full lists in score.json):
  cyclomatic gen/xsd/analyze.py:311     report                             56
  cyclomatic gen/plates/build.py:956    _validate_config_against_ir        35
  cyclomatic gen/press/context.py:145   plate_context                      34
  cyclomatic gen/__main__.py:46         _ir                                23
  cyclomatic gen/tests/test_ir.py:102   _check_references                  20
  cognitive  gen/xsd/analyze.py:311     report                             44
  cognitive  gen/ir/resolve.py:119      flat_elements                      40
  cognitive  gen/tests/test_ir.py:102   _check_references                  38
  cognitive  gen/press/context.py:145   plate_context                      37
  cognitive  gen/xsd/analyze.py:207     _sccs                              37
  size       gen/xsd/analyze.py:311     report                             152
  size       gen/press/context.py:145   plate_context                      96
  size       gen/plates/build.py:533    _value_plate                       89
  size       gen/plates/build.py:956    _validate_config_against_ir        89
  size       gen/ir/resolve.py:119      flat_elements                      78

Commit 4cda961ba1756f5905628b9ee04c21a71296ed00.

Job display names now state the suites executed (gen gates, api mxtest + round-trip, core corert/unit/xmllint/probes, Go+C corert targets, coverage, macOS core+api, swift build-only); the coverage and macos names previously claimed core-only while also running api suites. Job keys are unchanged. Comments note what each job deliberately skips.

github-actions · 2026-07-03T12:46:58Z

Coverage report

Core-dev coverage `src/private/mx/core/`

Metric	Coverage	Covered / Total
Lines	77.8%	28487 / 36624
Functions	74.3%	6349 / 8550
Branches	50.6%	22632 / 44725

API coverage `src/private/mx/{api,impl,utility}/`

Metric	Coverage	Covered / Total
Lines	78.7%	6101 / 7754
Functions	64.3%	2091 / 3250
Branches	47.8%	5182 / 10834

Core HTML report | API HTML report

Commit 6485b6e81b62e528ea776ee82d0e12e665bca53a.

github-actions · 2026-07-03T12:46:59Z

gen-quality `gen/`

gen-quality: 84.5 / 100   (floor 84.5, +0.0)

  structure     86.5  x0.50   [fn 90.5 / file 82.6]
  cyclomatic    88.4  x0.25
  cognitive     76.6  x0.25

  409 functions across 31 files, 7702 lines (largest file 1044)
  max cc 56  max cognitive 44  max fn loc 152

Worst offenders (top 5 per axis; full lists in score.json):
  cyclomatic gen/xsd/analyze.py:311     report                             56
  cyclomatic gen/plates/build.py:956    _validate_config_against_ir        35
  cyclomatic gen/press/context.py:145   plate_context                      34
  cyclomatic gen/__main__.py:46         _ir                                23
  cyclomatic gen/tests/test_ir.py:102   _check_references                  20
  cognitive  gen/xsd/analyze.py:311     report                             44
  cognitive  gen/ir/resolve.py:119      flat_elements                      40
  cognitive  gen/tests/test_ir.py:102   _check_references                  38
  cognitive  gen/press/context.py:145   plate_context                      37
  cognitive  gen/xsd/analyze.py:207     _sccs                              37
  size       gen/xsd/analyze.py:311     report                             152
  size       gen/press/context.py:145   plate_context                      96
  size       gen/plates/build.py:533    _value_plate                       89
  size       gen/plates/build.py:956    _validate_config_against_ir        89
  size       gen/ir/resolve.py:119      flat_elements                      78

Commit 6485b6e81b62e528ea776ee82d0e12e665bca53a.

- mx-open-pr template gains a Human Summary section for the maintainer to fill in - api-headers review instructions: comments should serve humans and coding agents, crisp and focused on the surprising - cpp review instructions: comments state constraints and effects hard to ascertain from the code

github-actions · 2026-07-04T06:44:58Z

Coverage report

Core-dev coverage `src/private/mx/core/`

Metric	Coverage	Covered / Total
Lines	77.8%	28487 / 36624
Functions	74.3%	6349 / 8550
Branches	50.6%	22632 / 44725

API coverage `src/private/mx/{api,impl,utility}/`

Metric	Coverage	Covered / Total
Lines	78.7%	6117 / 7776
Functions	64.3%	2091 / 3252
Branches	47.9%	5206 / 10866

Core HTML report | API HTML report

Commit 41f7c9f3b46c8d45c1a96c51094b8a1edc92553f.

github-actions · 2026-07-04T06:44:59Z

gen-quality `gen/`

gen-quality: 84.5 / 100   (floor 84.5, +0.0)

  structure     86.5  x0.50   [fn 90.5 / file 82.6]
  cyclomatic    88.4  x0.25
  cognitive     76.6  x0.25

  409 functions across 31 files, 7702 lines (largest file 1044)
  max cc 56  max cognitive 44  max fn loc 152

Worst offenders (top 5 per axis; full lists in score.json):
  cyclomatic gen/xsd/analyze.py:311     report                             56
  cyclomatic gen/plates/build.py:956    _validate_config_against_ir        35
  cyclomatic gen/press/context.py:145   plate_context                      34
  cyclomatic gen/__main__.py:46         _ir                                23
  cyclomatic gen/tests/test_ir.py:102   _check_references                  20
  cognitive  gen/xsd/analyze.py:311     report                             44
  cognitive  gen/ir/resolve.py:119      flat_elements                      40
  cognitive  gen/tests/test_ir.py:102   _check_references                  38
  cognitive  gen/press/context.py:145   plate_context                      37
  cognitive  gen/xsd/analyze.py:207     _sccs                              37
  size       gen/xsd/analyze.py:311     report                             152
  size       gen/press/context.py:145   plate_context                      96
  size       gen/plates/build.py:533    _value_plate                       89
  size       gen/plates/build.py:956    _validate_config_against_ir        89
  size       gen/ir/resolve.py:119      flat_elements                      78

Commit 41f7c9f3b46c8d45c1a96c51094b8a1edc92553f.

webern added 5 commits July 3, 2026 06:34

skills: apply audit fixes to agent skills (in progress)

eb63cbc

Corrects stale claims and trims redundancy across .claude/skills per the instructions audit; a follow-up commit completes the remaining skill fixes.

docs: inline api design-principles digest into AGENTS.md

48c2d25

Agents reliably read AGENTS.md (SessionStart hook) but often skip linked docs; carry the seven rules inline, keep the full doc for examples and per-rule tests.

webern added non-breaking fixes or implementation that do not require breaking changes ai Issues opened by, or through, a coding agent. labels Jul 3, 2026 — with Claude

webern commented Jul 4, 2026

View reviewed changes

Comment thread .claude/skills/mx-open-pr/SKILL.md

Comment thread .github/instructions/api-headers.instructions.md

Comment thread .github/instructions/cpp.instructions.md Outdated

webern merged commit e50e33a into main Jul 4, 2026
7 checks passed

webern deleted the claude/audit-agent-instructions-skills-o4s0sl branch July 4, 2026 06:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: audit agent instructions and skills for accuracy and brevity#292

docs: audit agent instructions and skills for accuracy and brevity#292
webern merged 8 commits into
mainfrom
claude/audit-agent-instructions-skills-o4s0sl

webern commented Jul 3, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026

Uh oh!

github-actions Bot commented Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

webern commented Jul 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

AI Summary

Testing

References

Uh oh!

github-actions Bot commented Jul 3, 2026

Coverage report

Core-dev coverage src/private/mx/core/

API coverage src/private/mx/{api,impl,utility}/

Uh oh!

github-actions Bot commented Jul 3, 2026

gen-quality gen/

Uh oh!

github-actions Bot commented Jul 3, 2026

Coverage report

Core-dev coverage src/private/mx/core/

API coverage src/private/mx/{api,impl,utility}/

Uh oh!

github-actions Bot commented Jul 3, 2026

gen-quality gen/

Uh oh!

github-actions Bot commented Jul 3, 2026

Coverage report

Core-dev coverage src/private/mx/core/

API coverage src/private/mx/{api,impl,utility}/

Uh oh!

github-actions Bot commented Jul 3, 2026

gen-quality gen/

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026

Coverage report

Core-dev coverage src/private/mx/core/

API coverage src/private/mx/{api,impl,utility}/

Uh oh!

github-actions Bot commented Jul 4, 2026

gen-quality gen/

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

webern commented Jul 3, 2026 •

edited

Loading

Core-dev coverage `src/private/mx/core/`

API coverage `src/private/mx/{api,impl,utility}/`

gen-quality `gen/`

Core-dev coverage `src/private/mx/core/`

API coverage `src/private/mx/{api,impl,utility}/`

gen-quality `gen/`

Core-dev coverage `src/private/mx/core/`

API coverage `src/private/mx/{api,impl,utility}/`

gen-quality `gen/`

Core-dev coverage `src/private/mx/core/`

API coverage `src/private/mx/{api,impl,utility}/`

gen-quality `gen/`