Skip to content

Pull requests: petergpt/bullshit-benchmark

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Publish v2.0.21: Add Mistral Medium 3-5 benchmark results
#28 opened Jun 12, 2026 by ThomsenDrake Loading… updated Jun 12, 2026
[codex] Add clickable org legend filters
#26 opened May 21, 2026 by petergpt Owner Draft updated May 21, 2026
Add x-ai/grok-4.3 v2 benchmark results (xhigh)
#25 opened May 1, 2026 by patelnav Loading… updated May 1, 2026
15 tasks done
Extend primary-metric sort to scatters and heatmap coloring
#23 opened Apr 22, 2026 by peterkirgis Loading… updated Apr 22, 2026
Add primary-metric sort toggle to the main view
#22 opened Apr 22, 2026 by peterkirgis Loading… updated Apr 22, 2026
Add MiniMax as direct LLM provider (M2.7 + M2.7-highspeed)
#16 opened Mar 30, 2026 by octo-patch Loading… updated Mar 30, 2026
1 of 3 tasks
Add nonsensical question related to Waymo and PHP
#1 opened Feb 25, 2026 by drewhamlett Loading… updated Feb 25, 2026
ProTip! no:milestone will show everything without a milestone.