Current pairing
full/enhanced vs backbone/enhanced
Latest harvested pair stays reporting-comparable.
canonical public root
A flagship landing surface for repo-native benchmark storytelling: visible proof links, explicit methodology, and sibling comparison context without pretending the Copilot and Cursor repos share one implementation contract.
canonical public root
This isolated presentation boundary keeps Copilot and Cursor in their own repo-native score models while making the shared view explicitly reporting-comparable.
The compare view keeps Copilot and Cursor in separate repo-native score models. Current shared rows are reporting-comparable observations, not mechanism-equivalent totals.
Current pairing
full/enhanced vs backbone/enhanced
Latest harvested pair stays reporting-comparable.
Capture skew
1.68 minutes
101 seconds between the latest Copilot and Cursor records.
Export shell
Next 16 static export
The proven Pages workflow and out/ build contract stay intact.
landing evidence rails
Methodology and History remain first-class routes, while GitHub-hosted docs and benchmark notes stay one click away from the landing surface.
Route
Inspect allowed comparability classes, repo-level provenance, and why reporting- comparable rows do not claim mechanism equivalence.
Open methodology →Route
Review benchmark runs, validation artifacts, and the generated manifest event in a single chronology.
Open history →Proof surface
Public repo overview, Copilot CLI-first scope, and current evidence framing.
Boundary proof for repo-owned surfaces, install state, and validation expectations.
GitHub-source-backed citations for Copilot host-product and comparison-scoped wording.
Repo-native benchmark contract, harness notes, and release-gate context.
repo-native comparative readout
Each card explains advantage inside its own harness first, then labels cross-host context as reporting-comparable rather than mechanism-equivalent parity.
oh-my-copilot
Full enhanced records 100/100 in the repo-native scoring model.
reporting-comparable
9/9 required checks are passing with named proof links and timestamps. oh-my-copilot full/enhanced stays an observed repo-native benchmark row. It is safe for cross-host reporting, but it is not a mechanism-equivalent harness match.
oh-my-cursor
Backbone baseline records 100/120 in the repo-native scoring model.
reporting-comparable
6/6 required checks are passing with named proof links and timestamps. oh-my-cursor backbone/baseline stays an observed repo-native benchmark row. It is safe for cross-host reporting, but it is not a mechanism-equivalent harness match.
state confidence
The flagship surface highlights required checks, timestamps, and proof links instead of turning benchmark output into marketing-only copy.
oh-my-copilot
oh-my-copilot starts by establishing a repo-native baseline before making any comparison claim. Cross-host reporting remains reporting-comparable and does not claim mechanism parity. 9/9 required checks are passing with named proof links and timestamps.
9/9 required checks passing
oh-my-cursor
oh-my-cursor starts by establishing a repo-native baseline before making any comparison claim. Cross-host reporting remains reporting-comparable and does not claim mechanism parity. 6/6 required checks are passing with named proof links and timestamps.
6/6 required checks passing
recent evidence
These recent entries keep benchmark runs and validation artifacts visible from the homepage before a reader ever leaves the landing surface.
Apr 21, 2026, 3:38 PM · history
3 observed rows were harvested with reporting-comparable semantics.
Proof: apps/cross-host-benchmark-site/generated/copilot-snapshots.json
Apr 21, 2026, 3:38 PM · history
2 observed rows were harvested with reporting-comparable semantics.
Proof: apps/cross-host-benchmark-site/generated/cursor-snapshots.json
Apr 21, 2026, 4:53 AM · benchmark
100/100 with 100/100 release gate
Proof: benchmark/results/current-full-enhanced
Apr 21, 2026, 4:53 AM · validation
100/100 with reporting-comparable comparability metadata.
Proof: benchmark/results/current-full-enhanced
Apr 21, 2026, 4:52 AM · benchmark
100/100 with 100/100 release gate
Proof: benchmark/results/current-quick-enhanced
Apr 21, 2026, 4:52 AM · benchmark
120/120 with 120/120 release gate
Proof: benchmark/results/current-enhanced