Mikhail Doroshenko

Archives
Log in
May 16, 2026

AI Benchmark Digest — 2026-05-16

AI Benchmark Digest — 2026-05-16

=== DAILY === NEW SCORES FROM TOP-10 MODELS (1) - GPT-5.5 (xHigh) on Chatbot Arena (Code): 1501.0 Elo (#9/79)

NEW #1 LEADERS (2) - MathArena - ARXIVLEAN March (Accuracy (%)): AlephProver (34.15) beat Aristotle (17.07) by 17.08 - GAIA (Accuracy (%)): Co-Sight Pro v1.0.1 (93.02) beat OPS-Agentic-Search (92.36) by 0.66


View on AI Benchmark Hub

Don't miss what's next. Subscribe to Mikhail Doroshenko:
Powered by Buttondown, the easiest way to start and grow your newsletter.