Silv's AI weekly: GPT-5.5 Instant, Claude agent dreaming

        May 11, 2026

Silv's AI weekly: GPT-5.5 Instant, Claude agent dreaming

        Hey — here's this week's AI roundup from silv.blog.
Curated from 148 tweets liked by @mattsilv, May 3 - May 11, 2026:
AI for Everyone
GPT-5.5 Instant rolls out as the default for every ChatGPT user. Available in the API as gpt-5.5-chat-latest. Factuality gains in medicine, law, and finance, plus a new "memory sources" panel that shows what context ChatGPT pulled from your chats, files, and Gmail. Full-duplex voice mode coming. Read more →
Coinbase cuts 14% and says the quiet part out loud. Brian Armstrong's email blames it directly on AI productivity, not market conditions. Coinbase is moving to AI-native pods, one-person teams, 5 org layers max, and every leader as an individual contributor. Read more →
Google launches an AI Health Coach, a $99 screenless Fitbit, and a 14,000-patient study. Fitbit app rebranding as Google Health, Gemini coach connects to U.S. medical records. The study found structured AI symptom interviews beat passive symptom entry by 27% in diagnostic accuracy. Read more →
Kevin Rose quietly relaunches Digg as an AI news aggregator. Live alpha at di.gg with 9M graph connections, 15+ AI judges, real-time X ingestion, influence-flow tracking. Matt Van Horn already shipped a Digg CLI with Claude Code + Hermes skills. Read more →
Apple's camera AirPods hit late-stage testing. Gurman confirmed both earbuds carry cameras capturing low-resolution visual data for Siri. AirPods Pro 3-style design, September target depends on the Gemini-powered Siri rebuild hitting Apple's quality bar. Read more →
Anthropic and Wall Street form a $1.5B joint venture. Blackstone, Goldman, Hellman & Friedman are partners; targets private-equity-owned portfolio companies. The JV is supposed to actually rebuild workflows, not resell API access. Read more →
Anthropic co-founder Jack Clark puts 60% odds on recursive self-improvement by end of 2028. Based on reading hundreds of public AI development sources. One tweet, but an unusual source: he's on the inside and not a hype account. Treat as a planning input. Read more →
AI for Developers
Claude Managed Agents get Dreaming, Outcomes, and multi-agent orchestration. Dreaming reviews past agent sessions to curate memories — Harvey reported ~6x completion improvement (research preview, waitlisted). Outcomes runs a rubric-driven self-grading loop until it clears your bar (public beta, +10pp task success, Wisedocs reports 50% faster reviews). Multi-agent orchestration runs specialists in parallel on shared filesystem. Read more →
Claude Code rate limits doubled, SpaceX compute deal announced. Pro/Max/Team plans got 2x the 5-hour limit, peak-hours throttle removed, Opus API limits raised. Claude Code is up 15x since January 1. Read more →
OpenRouter ships free response caching with one header. Add X-OpenRouter-Cache: true and identical calls return in 80-300ms at zero token cost. Best for agent retries and test suites. Works alongside prompt caching. Read more →
Voice AI convergence day (May 7): OpenRouter Audio APIs, OpenAI GPT-Realtime-2, ElevenLabs cuts prices 55%. OpenRouter now routes TTS/STT across OpenAI, Google, Mistral, Whisper, Chirp 3, and Groq. GPT-Realtime-2 brings GPT-5-class reasoning to voice (instruction retention 36.7% → 70.8% APR). Read more →
Firefox fixed 271 security bugs in April using Claude Mythos. 423 total bugs shipped (vs. the normal 20-30/month baseline) including a 20-year-old XSLT bug. Mozilla built an agentic harness on existing fuzzing infrastructure; Claude created reproducible test cases to validate hypothesized vulnerabilities. Read more →
Codex ships a Chrome extension for background tab work. Per-site permissions, cross-tab debugging, parallel DevTools, no browser takeover. The right answer for browser agent work. Read more →
Pareto Code routes your coding calls to the cheapest capable model. OpenRouter's free experimental router: set min_coding_score, get the cheapest model that clears your bar. DeepSeek V4 Pro, GPT-5.4 Mini, and Gemini 3.1 Pro are at the top right now. Read more →
Hermes Agent v0.13 ships with Autobrowse that self-optimizes 80% cheaper. Demo: browser automation task dropped from 102s to 35s, $1.46 to $0.28 in two iterations, by switching from step-by-step clicking to direct JS eval. Still very low-level; Claude Cowork is the friendlier starting point. Read more →
Honorable Mentions
For everyone:

Claude for Excel, Word, and PowerPoint hit general availability, with Claude for Outlook in public beta. Context carries across Microsoft apps. (source)
NotebookLM Mind Maps got prompt-driven steering, renaming, and sharing. Scope a map to a specific topic instead of letting it auto-generate. (source)
iOS 27 will let you pick Claude or Gemini instead of ChatGPT for Apple Intelligence. Opens the current OpenAI-only escalation path. (source)
Google Finance Beta added AI-powered key moments that explain stock price swings and jump you to the relevant part of the earnings call. (source)
Gemini agent for macOS in development per 9to5Google. Uses Screen Access to organize files, convert to Sheets, batch-rename, draft email summaries from meeting transcripts. (source)
A Gemini "Omni" video model leak surfaced today. First output claimed by @chetaslua; @kimmonismus speculates it ships at Google I/O on May 19, possibly as Veo 3.1's successor. (source)

For developers:

Gemini 3.1 Flash Lite went GA on May 7 at $0.25/M input, $1.50/M output. Gemini 3.2 Flash also briefly visible before Google I/O on May 19. (source)
Grok 4.3 launched on the xAI API at $1.25/M input, $2.50/M output, 1M context. Tops Artificial Analysis on instruction following and Vals AI on case law + corporate finance. (source)
Anthropic's leaked "Orbit" feature for Claude Cowork would connect Gmail/Slack/GitHub/Calendar/Drive/Figma and generate proactive briefings. Dev gate "tibro enabled" (orbit backwards). (source)
OpenRouter's GPT-5.5 cost analysis: short-prompt workloads cost ~92% more on GPT-5.5 vs GPT-5.4; 49-69% more for medium/long. (source)
Claude Platform now supports keyless auth via AWS, GCP, or Azure cloud identity. Anthropic's most-requested security feature. (source)
Codex "Ultrafast mode" briefly appeared in the GitHub repo before being deleted: "The fastest available responses for latency-sensitive work." Unintended push. (source)
OpenAI's October 2025 employee secondary stock sale totaled $6.6B, per WSJ. 600+ current and former employees, $11M average per person. (source)

Try This Weekend
For everyone: Open ChatGPT and look at memory sources the next time you ask something personal, try Google Finance Beta on a stock you actually follow to see the AI key moments annotations, steer a NotebookLM Mind Map with a prompt like "focus on the risks", or bookmark Digg's AI alpha and see if it beats your current habit of scrolling X.
For developers: Add X-OpenRouter-Cache: true to your existing OpenRouter test suite, drop Pareto Code into one call by setting min_coding_score, install the Codex Chrome extension for background-tab work, or set up one cron job that runs Claude Code on a recurring chore (Boris Cherny's PR babysitter pattern).

Read the full post with all sources and links
Know someone who'd find this useful? Forward this email or have them subscribe:

You're receiving this email because you subscribed to the silv.blog weekly AI digest. Unsubscribe anytime.

                                Don't miss what's next. Subscribe to Matt Silverman:

            Email address (required)