Run History

A complete archive of every weekly leaderboard run. Each row shows when the battery ran, how many responses and mentions were collected, and its completion status. Click any run to view the full leaderboard standings for that week and inspect the raw model answers behind the scores.

Loading…