Shows MMLU, BigCodeBench, and ARC MC scores pulled from model-index metadata or their pull requests for trending text-generation models.
Loading leaderboard...
Links: