Skip to content

Pull requests: OpenHands/benchmarks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

DRAFT: swtbench: strip non-test files from model_patch in post-processing (option 2 of #708) build-swt-bench Build SWT-Bench images based on SDK version on this PR. investigation
#711 opened May 13, 2026 by juanmichelini Collaborator Draft
DRAFT: swtbench: tighten default prompt to discourage non-test edits (option 1 of #708) build-swt-bench Build SWT-Bench images based on SDK version on this PR. investigation
#710 opened May 13, 2026 by juanmichelini Collaborator Draft
Upgrade LiteLLM to 1.84.0rc1
#709 opened May 12, 2026 by neubig Contributor Draft
[codex] Add EvoClaw benchmark inference
#705 opened May 7, 2026 by xingyaoww Contributor Draft
Filter SWE-Bench Multimodal image builds to curated subset
#644 opened Apr 6, 2026 by juanmichelini Collaborator Loading…
fix: reset BuildKit cache between retries for base/assembly builds
#631 opened Apr 4, 2026 by simonrosenberg Collaborator Loading…
3 tasks
Update Claude ACP package references
#629 opened Apr 3, 2026 by simonrosenberg Collaborator Loading…
build(deps): bump the version-all group across 1 directory with 21 updates dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code
#596 opened Mar 31, 2026 by dependabot Bot Loading…
build(deps): bump the version-all group across 1 directory with 5 updates dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#492 opened Mar 9, 2026 by dependabot Bot Loading…
NeMo Evaluator Integration
#455 opened Feb 26, 2026 by simonrosenberg Collaborator Loading…
Add security benchmark with ASTRA
#361 opened Jan 26, 2026 by XZ-X Loading…
Agentic code search
#141 opened Dec 8, 2025 by adityasoni9998 Contributor Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.