Hacker News new | past | comments | ask | show | jobs | submit | from login
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
3 points by foolswisdom 21 days ago | past
Gemini 2.5 Pro tops LiveBench, +6 pts overall over Claude 3.7 Sonnet Thinking (livebench.ai)
1 point by ankeshanand 34 days ago | past
Google's latest Gemini-exp-1206 seems to be great, near the top of livebench (livebench.ai)
4 points by KaoruAoiShiho 4 months ago | past
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
1 point by belter 10 months ago | past
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
6 points by georgehill 10 months ago | past

Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: