feat(berlinmod): streaming-matrix throughput benchmark + parity on real BerlinMOD data (stacks on #20)#21
Closed
estebanzimanyi wants to merge 1 commit into
Conversation
…nd verify parity BerlinMODCorpus loads the real BerlinMOD instants corpus from berlinmod_instants.csv, reprojecting EPSG:3857 to EPSG:4326 through MEOS geo_transform, and derives the per-query parameters and window/tick granularity from the corpus. BerlinMODBenchmark gains a --csv option and reports the 27-cell matrix over the 216075-event corpus. BerlinMODParity verifies the streaming contract on the continuous form: it emits predicate(event) per event and matches a batch pass over the same corpus through the same MEOSBridge call exactly — Q3 and Q8 agree event-for-event with zero mismatches. docs/benchmark-results.md records the real-data throughput and the parity result.
025eaeb to
0d29a39
Compare
Member
Author
|
Superseded by the Path-B consolidation: the former 18-deep stack is collapsed into two reviewable topical PRs on top of the merged scaffold — MEOS integration #30 → benchmark #31 — each one clean squashed commit with the generated-facade bulk, dead family-flag profiles, committed target/ artifacts, and invented synthetic corpus removed. Closing as folded into #30/#31. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
BerlinMODBenchmark runs all 27 BerlinMOD-9 × 3-form cells as Flink local mini-cluster jobs and reports per-cell throughput and output rows. BerlinMODCorpus loads the real BerlinMOD instants corpus (reprojected EPSG:3857→4326 through MEOS) and derives the per-query parameters; BerlinMODParity verifies the continuous form matches a batch pass over the same MEOS predicate exactly. docs/benchmark-results.md records the real-data figures and the parity result.