Skip to content

docs: add real-world bash/ops-tooling repo example to Performance#747

Open
saddestmartian wants to merge 1 commit into
DeusData:mainfrom
saddestmartian:docs/bash-ops-tooling-example
Open

docs: add real-world bash/ops-tooling repo example to Performance#747
saddestmartian wants to merge 1 commit into
DeusData:mainfrom
saddestmartian:docs/bash-ops-tooling-example

Conversation

@saddestmartian

Copy link
Copy Markdown

What does this PR do?

Adds a small "Real-World Example" subsection to the Performance section of the README.

The existing Performance benchmarks (Linux kernel, Django) are measured on large polyglot application codebases. Bash already scores in the "Excellent" parsing tier under Language Support, but there's no worked example of what that translates to for a different repo shape: infra/ops-tooling repos that are mostly shell scripts, YAML/JSON config, and Markdown docs rather than a typical multi-language application.

This adds one real session's measurement on an internal dev-tooling repo (bash + JSON + Markdown, ~14.9k indexed nodes / ~20.5k edges): a single search_code call reproducing a config-file consumer inventory in ~375ms vs. a grep-based sub-agent fan-out taking ~131s across 12 tool calls for the same result. It's explicitly labeled as one real-world data point, not a controlled multi-trial benchmark, to keep it honest alongside the hardware-benchmarked numbers above it.

Docs-only change — no source touched.

Checklist

  • Every commit is signed off (git commit -s) — required, CI rejects
    unsigned commits (DCO, see CONTRIBUTING.md)
  • Tests pass locally (make -f Makefile.cbm test) — N/A, docs-only change, no C source touched
  • Lint passes (make -f Makefile.cbm lint-ci) — N/A, docs-only change, no C source touched
  • New behavior is covered by a test (reproduce-first for bug fixes) — N/A, not new behavior, a README addition

The existing Performance section benchmarks large polyglot application
codebases (Linux kernel, Django). Bash already scores in the Excellent
parsing tier under Language Support, but there's no worked example of
the agentic token-efficiency payoff on a repo shape that's mostly shell
scripts, YAML/JSON config, and Markdown docs rather than a typical
multi-language application.

Adds one real session's measurement: a single search_code call vs a
grep-based sub-agent fan-out on an internal dev-tooling repo, clearly
labeled as one real-world data point rather than a controlled benchmark.

Signed-off-by: saddestmartian
Signed-off-by: saddestmartian <saddestmartian@gmail.com>
@saddestmartian saddestmartian requested a review from DeusData as a code owner July 1, 2026 21:32
@DeusData

DeusData commented Jul 1, 2026

Copy link
Copy Markdown
Owner

Hey, looks good. Can you remove the last line? Basically embedd the insights naturally :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants