Sitemap
Main Pages
Home - Generative AI Explorer
AI Benchmarks
AI Model Comparer
AI Companies
OpenAI
Anthropic
Google DeepMind
Amazon
xAI
Meta
Mistral AI
Deepseek
Alibaba
Microsoft
Cohere
Adobe
Midjourney
Stability.ai
ByteDance
Gen 4
Pika
KlingAI
Suno
Udio
Perplexity
Character.AI
Descript
HeyGen
GitHub
AI Benchmarks
BrowseComp
COLLIE
ComplexFuncBench
IFEval
Multi-IF
MultiChallenge
Tau-bench Airline
Tau-bench Retail
Aider Polyglot
Bird-SQL
Codeforces
HumanEval
LiveCodeBench
SWE-Bench Verified
SWE-Lancer
SWE-Lancer: IC SWE Diamond
BIG-Bench-Hard
Chatbot Arena
MMLU
MMLU-pro
Multilingual MMLU
FACTS Grounding
LOFT (128k)
MRCR (1M)
SimpleQA
AIME 2024
AIME 2025
GSM8K
HiddenMath
MATH
Math 500
MathVista
Mathematical Grade School Math
CoVoST2
DocVQA
EgoSchema
MMMU
Video-MME
ARC
ARC v2
CharXiv-Reasoning
DROP
Graphwalks BFS <128k accuracy
Hellaswag
Humanity's Last Exam
SimpleBench
AI2D
GPQA Diamond