Mikhail Doroshenko

Archives
Log in
Subscribe
June 1, 2026

AI Benchmark Digest — 2026-06-01

AI Benchmark Digest — 2026-06-01

=== DAILY === NEW #1 LEADERS (3) - EQ-Bench Creative Writing v3 (Elo): Claude Opus 4.7 (2050.8) beat GPT-5.4 (1906.0) by 144.8 - Design Arena (Data Viz) (Elo): GLM-5.1 (1367.0) beat Claude Opus 4.7 (Thinking) (1344.0) by 23.0 - Chatbot Arena (Image-to-Video) (Elo): Grok 1.5 (1473.0) beat dreamina-seedance-2.0-720p (1462.0) by 11.0


View on AI Benchmark Hub

Don't miss what's next. Subscribe to Mikhail Doroshenko:
Powered by Buttondown, the easiest way to start and grow your newsletter.