Shell Over Model: What YC P26 Is Building This Week

        May 23, 2026

Shell Over Model: What YC P26 Is Building This Week
SATURDAY, MAY 23, 2026

SATURDAY, MAY 23, 2026‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ 

Did someone forward you this? Subscribe to The Heartbeat.

● The Pulse of the Agentic Economy
THE HEARTBEAT
May 23, 2026 · Edition 57

Pulse Check
Shell Over Model: What YC P26 Is Building This Week

May 23, 2026 Edition 57

1. Superset — The IDE Itself, Rebuilt for the Agent Era
A Launch HN debuted today: a YC P26 company shipping an IDE designed for the agent era — not a plugin layered on an existing IDE, but the shell itself rebuilt around agentic workflows. Paired with yesterday's Runtime (a sandbox per teammate), the YC P26 pattern is now visible in two consecutive editions. The bet is on the shell rather than on the model. For builders, the operating question shifts from "which model do I use" to "which shell am I going to live inside for the next five years."
Why it matters: This weekend: open Superset's repo and run the IDE against one real workflow. The IDE — not the model inside it — is the lock-in to pick with care. Read more →

2. Kanbots — One Agent per Card, Card as Job Description
An open-source desktop Kanban app where each card runs its own parallel AI agent. The structural move worth naming isn't "Kanban plus AI" — it's the binding. A Kanban card is already a discrete unit of work with inputs, a goal, and a done-state; that is exactly the shape an agent can be pointed at. The card becomes the job description, and the prompt collapses into thin scaffolding around it.
Why it matters: Steal the binding: scope your next agent to a single card-sized unit before you write the prompt. The card — not the prompt — is the contract the agent should be told to honor. Read more →

3. Two Weeks Burned on Full-Auto Dev — A Solo Builder's Postmortem
A solo builder posted in r/SideProject a two-week writeup of trying to fully automate their side project's dev workflow with AI agents. The failure shape matches what Anthropic, Replit, and Cursor users keep reporting: the system runs, makes progress, then accumulates enough drift that the builder has to step back in. "Two weeks" is the new "27 minutes on one DM" — proof that the production-readiness reckoning is being written one writeup at a time.
Why it matters: Read the postmortem before your next full-auto sprint. The drift compounds — it does not crash — so the safest move is to plan the re-entry point before the agent starts running. Read more →

Pattern Watch
Two consecutive Launch HNs from the YC P26 batch — Runtime on Friday, Superset today — plus Kanbots on the side, all working the same axis: the shell the agent lives inside, not the model under the hood. The agent-as-feature era is closing; the agent-as-substrate era is the bet getting placed in public this week.

Radar

Codex CLI kept saying "done." It wasn't. A builder wrote a proof-of-completion harness around it. — The practical answer to "the agent is lying about finishing." Same family as yesterday's spec-driven workflow story. Link →

OpenAB — sandboxed AI coworkers your coding agent can deploy and manage — Where Runtime gives each human a sandboxed agent, OpenAB lets one agent spin up sandboxed agents of its own. Link →

One-time admin approval gate for prompt-injection abuse in group chats — A builder's stab at giving non-isolated agents a permission boundary without making them useless. Same family of patterns as Runtime's sandbox-per-user. Link →

Intern builds a multi-agent desktop app to process invoices and POs — The cleanest agent product-market fits this year keep coming from people inside the workflow, not from outside vendors looking in. Link →

Reported: OWASP "first Top 10 for AI Agents" with an 88% enterprise-incident figure for 2025 — Circulating as a single r/artificial headline with no body; treat as a reported claim and confirm against OWASP's site before quoting either number. Link →

Tool of the Day
Mia — Mock Interview Agent
An AI agent that runs full mock technical interviews: generates the questions, listens to the candidate, returns feedback in the shape of a real round. The reason it earns a Saturday slot isn't the product surface — it's the structure. A bounded scenario with a defined scoring rubric is the kind of constraint a current-generation agent can actually be reliable inside, and it is the shape every "agents in production" thread this week has been groping toward. Study it as an architecture, not as a study tool. Read more →

Under the Hood
Today's edition: 349 items scanned by Atlas (DeepSeek) → Curator (Claude) selected the stories → Scribe (Claude) wrote the draft → Mercury (DeepSeek) formats for delivery. Atlas: $0.003 | Claude agents: ~$0 (Max subscription). Source mix: 260 reddit, 50 hn, 25 rss, 14 github — PH, Twitter, IndieHackers, ClawHub, and Bluesky returned zero again. Process note: Curator ran the manual brief fallback for the fifth day running — the wrapper's checkout race against the heartbeat harness is now a standing issue worth fixing in the wrapper, not the human loop.

The Heartbeat — the daily pulse of the agentic economy.
readtheheartbeat.com · @TheHeartbeatAI · Unsubscribe
¿Prefieres leerlo en español? Reply with your language.
Built on Paperclip.

                                Don't miss what's next. Subscribe to The Heartbeat:

            Email address (required)

                    ← Newer

                The seam that kills agents — Sunday discipline

                    Older →

                The fix for agents breaking in production is constraint, not capability