Add benchmark comparison workflow with PR comments
- benchmark.yaml: on PRs, runs pytest-benchmark on both main and PR,
compares results via tests/benchmark/compare.py. On tags, saves
benchmark data permanently as artifacts (retention-days: 0).
- benchmark_comment.yaml: posts the comparison as a PR comment
(hides old comments, same pattern as primer_comment.yaml)
- tests/benchmark/compare.py: reads two pytest-benchmark JSON files,
computes median deltas, outputs markdown with ±5% threshold flags
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
3 files changed