Test suite · world-cup-v1 · 50 plans
What TestSprite probes,
line by line.
Every score on the leaderboard derives from this suite. Each plan is a structured natural-language test that the TestSprite agent reads, then executes against the deployed URL with a real headless Chromium. Pass / fail / inconclusive per plan. The full plan JSONs are PR-able on GitHub.
- 34b726f4Index renders the R16 bracketP0
- 4b8443a7/api/predict?team=BRA returns expected JSON shape
- 790c7f65/api/predict?team=XXX returns 404 for unknown team
- 3d911a5a/match/[id] permalink renders fixture detail
- 7759e972/api/og returns 1200×630 PNG
- 68ccee83404 page for unknown route
- 5574cd85sitemap.xml lists index + 16 match URLs
- 30cca0berobots.txt + canonical tags
- b1f74a69Basic security headers (X-Frame-Options, etc.)
- a71bdbdfBracket shows QF / SF / Final from R16 winners
- 13377363Search / filter by team
- 0a333743Country flag SVGs render
- 3f850cd3i18n routes / locales
- d838403cDark mode toggle
- 2c300dffShare button copies deep link
- f1ac89faProbability heatmap on match detail
- b3c12094Match commentary excerpts
- b979b647/api/predict rate-limits per IP
- 3d49cae5No team plays itselfP0
- 41140804Score range is sane (no 17-0 etc.)
- c6f684d5Probability monotonicity across rounds
- 580742e7Bracket progresses logically
- 1ec4b015Every team in bracket exists in fixtures
- 48a80cd2(pen) suffix only when scores level
- 28a35489Bracket cardinality 8/4/2/1
- 4b9286e4Predicted finalists exist + come from SF
- 0f5511cdPer-fixture probs sum ≤ 1.0
- cbda6ca6Top-P team makes it to QF
- fefb33b6Reasoning is fixture-specific
- f24e2323Predicted champion visually surfaced