[bench] Post-PR-#1228 coverage gaps (parent tracker)

## Overview

Parent tracker for the bench-coverage gaps surfaced during review of PR #1228 (`feature/stable-bench` — bench foundation). PR #1228 ships the chassis (`fluree-bench-support`), 7 new hot-path benches, and the `bench-gate` CI job. The reviewer flagged eight perf-regression-prone hot paths that PR #1228 doesn't cover — bundled here into 6 sub-issues based on which gaps are best landed together as a single PR.

These are all _real_ coverage gaps (each has a memory ID documenting a recent or known perf-sensitive area). The gate the foundation PR lands today is "smoke + reconcile only" — it catches benches that no longer compile or panic, but does not yet do regression comparison. That's `bench-nightly`'s job once we have committed baselines. The coverage gaps below are independent of `bench-nightly`: each is "we should have a bench at all for this hot path."

## Sub-issues

- [ ] #1229 — **Property-join planner regression bench.** Largest recent perf win (~176×–933×); `query_hot_bsbm` wouldn't catch a regression in the `analyze_property_join_plan` weighted-width gate unless the wins disappeared end-to-end visibly. Highest-priority of the bunch.
- [ ] #1230 — **Lex-sorted-string ORDER BY fast-path regression bench.** 200× speedup that depends on a specific invariant (`lex_sorted_string_ids` flag preservation through bulk import + first post-import write). Silent regression vector; would slow production ORDER BY queries on freshly imported datasets.
- [ ] #1231 — **Scan fast-path regression benches: batched object-join + group-count-firsts.** Two related fast-path detection paths that `query_hot_bsbm` exercises only macroscopically; targeted benches lock in the planner's gate decisions.
- [ ] #1232 — **Filter `PreparedBoolExpression` cache regression bench.** Per-row cacheability analysis is the easy thing to accidentally re-run; the regression vector is silent and filter-heavy queries are common in production.
- [ ] #1233 — **Time-travel and multi-ledger `DataSetDb` history query coverage.** The Query API has a rich layering (`db()/db_at_t()/db_at()`, `DataSetDb`) — none of PR #1228's benches exercise it.
- [ ] #1234 — **Parser micro-benches (Turtle, JSON-LD, SPARQL).** Parsing is currently exercised only through `import_bulk` and `transact_commit`; a parser regression would show up as small noise on those benches and is easy to miss.

## Suggested sequencing

If a single contributor picks these up, lower-numbered issues are higher-value-per-LoC. Suggested order:

1. **#1229 (property-join planner).** Largest recent win, biggest blast radius.
2. **#1230 (lex-sorted ORDER BY).** Smallest bench (~100–150 LoC), depends on a single invariant; quick to land.
3. **#1231 (scan fast-paths).** Bundles two related concerns; biggest issue but also biggest macro-coverage gain.
4. **#1232 (filter prepared-cache).** Filter-heavy queries are a known production pattern.
5. **#1233 (time-travel + multi-ledger).** API-surface coverage; lower priority than the targeted regression benches above but higher than parser micros because the API surface is user-visible.
6. **#1234 (parser micros).** Lowest urgency since the parser hot paths are exercised macro-visibly.

## Definition of done for this parent

This issue is complete when all 6 sub-issues are closed (i.e., each bench has landed). Until then, this issue serves as a single reference point for "what does post-PR-1228 bench coverage need."

## References

- PR #1228 (`feature/stable-bench`, bench foundation): the chassis + 7 new benches + bench-gate CI job.
- PR review: `pr-1228-review.md` (in-repo or comment thread). The "Suggested coverage gaps" section enumerates these 8 items.
- `BENCHMARKING.md`, `docs/contributing/benches.md`, `fluree-bench-support/README.md` — contributor onboarding for adding a bench.
- Bench-infrastructure plan: `.claude/proposed-work/docs/plan-benchmark-infrastructure.md` — design rationale for the chassis these benches plug into.

## What this parent issue does NOT track

- **Regression comparison enforcement** — that's `bench-nightly` (separate PR / branch).
- **Real-world dataset benches** (e.g., the dblp data in `fluree-db-cli` bulk import work). Useful complement to synthetic generators but a separate concern; spin a new issue if/when needed.
- **iai-callgrind exploration** — covered by the bench-infrastructure plan §11 open question; revisit only after we have flap data from criterion-on-`ubuntu-latest`.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bench] Post-PR-#1228 coverage gaps (parent tracker) #1235

Overview

Sub-issues

Suggested sequencing

Definition of done for this parent

References

What this parent issue does NOT track

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[bench] Post-PR-#1228 coverage gaps (parent tracker) #1235

Description

Overview

Sub-issues

Suggested sequencing

Definition of done for this parent

References

What this parent issue does NOT track

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions