ethantroy.dev · ethantroy.com · hackidle.com · bioidle.dev

Ethan Troy logo

Ethan Troy

The Roll Up #001

2026-04-15


Read on the web: https://rollup.hackidle.com/issues/2026-04-14-issue-001/


My Projects This Week

Demo: FedRAMP 20x KSI coverage for Kubernetes and M365 landing in Prowler

Demo: Early look at the gap-assessment feature I'm building inside myctrl.tools

Demo: Navigating the FedRAMP docs corpus in the fedramp-browser TUI

grclanker: Terminal-first UI for the GRC-spec experiment

Demo: Claude answering FedRAMP questions from the source of truth via fedramp-docs-mcp

Research Notes

AI Assistance Reduces Persistence and Hurts Independent Performance

AI · RESEARCH · Grace Liu, Brian Christian, Tsvetomira Dumbalska, Michiel A. Bakker, Rachit Dubey

Rollup. Ten minutes of AI during practice was enough for lower unassisted performance when the AI was taken away. How you use it matters more than whether. Answer mode tanks, hint mode holds.

The juice. My read: value and learning live in the friction. When you dissolve that friction with AI and then take the AI away, the skill it was standing in for gets less practice. If you're using LLMs and coding agents to solve problems faster, you need to be reaching for higher-friction work somewhere else to keep growing. If you're just using AI to do the same things you already do, you may be trading away some of the practice that keeps those skills sharp. Use AI to extend yourself. Learn something new. Push into directions you wouldn't have gone.

"Machines of Loving Grace"

AI · ESSAY · Dario Amodei, Anthropic

Rollup. Dario's claim: powerful AI compresses 50 to 100 years of biology into 5 to 10. The constraint isn't compute. It's the speed of the physical world and biological complexity.

The juice. An old piece I'd somehow never read. I come from a biophysics background, and didn't know Dario did too.

I did mitochondrial research at the University of Maryland over a decade ago. Lately I've been mapping where AI can compress medical research timelines the way Dario describes here. That's why I started bioidle.dev as a separate research track.

Open source died in March. It just doesn't know it yet.

SECURITY · SUPPLY CHAIN · Dan Lorenc, Chainguard

Rollup. Five supply-chain attacks in twelve days in March. Lorenc argues the default mitigation stack is weaker than people want to admit against compromised distribution paths.

The juice. Lorenc isn't saying open source is broken. He's saying the consumption model is. Scan-and-pin is a fine defense against a single compromised artifact; it's not a defense against a compromised maintainer account or poisoned distribution, which is what March actually was. If you're writing an SBOM requirement this quarter, decide whether "pinned hash" means "pinned to a maintainer-signed commit I reviewed" or "pinned to whatever CI grabbed last week." Those are different control states.

GitHub Commit Autopsy

SECURITY · FORENSICS · ramimac, High Signal Security

Rollup. A commit impersonating Vercel CEO Guillermo Rauch appeared in actions/checkout without ever being merged, thanks to cross-fork object sharing and GitHub's trust-by-display model.

The juice. Rami (ramimac) puts out high-signal security research, consistently. Got to catch his Shai-Hulud talk at Unprompted earlier this year and the bar's only gone up. This piece is a clean reference for what malicious commit metadata actually looks like in the wild, and I want it handy for future repo-provenance work.

Seeing like an agent: how we design tools in Claude Code

AI · CRAFT · Thariq Shihipar, Anthropic

Rollup. Tool design for agents is an art, not a science. The tools your model needs depend on what it can already do, and those abilities keep changing under you.

The juice. The part I keep coming back to: the scaffold you build for one model generation quietly becomes a cage for the next. Thariq's TodoWrite-to-Task story is the cleanest version of that I've seen written down. Before you ship any new tool, ask whether it's something the current model needs or something older models needed. Different answer.

Vercel Sandboxes push agent microVM infra

INFRA · AGENTS · Vercel

Rollup. Vercel's bet on agent-exec infra: Firecracker microVMs, Active CPU pricing, and sandboxing tied into the broader Fluid Compute stack. Same week Cloudflare is shipping Sandboxes GA and Mesh. Two different visions of where agent code should actually run.

The juice. Personal bias up front: I still treat hosted sandboxes as a design tradeoff, not automatic isolation. The perf and DX wins from Cloudflare and Vercel are real, but I would test either one against the actual job before putting sensitive agent workloads there. Cloudflare's bet is Workers-native: persistent isolated environments with Mesh tying agents into private networks. Vercel's bet is Firecracker microVMs on a unified compute stack. Different answers to where agent code should run.

Building a CLI for all of Cloudflare

INFRA · DEV · Matt Taylor, Dimitri Mitropoulos, Dan Carter, Cloudflare

Rollup. Cloudflare is rebuilding Wrangler toward a single cf command for its broader API surface: ~3,000 HTTP API operations across 100+ products. The interesting piece isn't the CLI. It's Local Explorer.

The juice. Local Explorer is the actually-useful piece for anyone building on Cloudflare. Most agent-dev stories break at "now reach the local dev resources deterministically," and this closes that loop. The typed-schema-over-OpenAPI call is also worth watching for anyone shipping agent-consumable APIs.

The Folder Is the Agent

AI · TOOLING · Kieran Klaassen, Every

Rollup. Three months of trying to make agent swarms work, then the punchline: the folder is the agent. A project folder with a well-tuned CLAUDE.md plus skill files is the specialist. You don't need an orchestration layer.

The juice. Maps onto how I'm using Claude Code in this repo. Kieran pitches folders-as-agents as a general principle, not just his workflow. I think he's right for solo and small-team work, less sure for teams that need explicit state machines or reliability guarantees. Either way, the broader signal I take from it: context engineering is doing more work than model choice in 2026, and a lot of "agent framework" shopping is solving a problem most people don't have yet.

OpenAI Frontier agents: product + platform deep dive

AI · INDUSTRY · The Batch, DeepLearning.AI

Rollup. OpenAI's enterprise move is managing agents, not only building more of them. Frontier is a platform for agent identity, data access, evaluation, and billing across whatever framework the agent was built in.

The juice. Bundling Promptfoo into Frontier doesn't prove agent red-teaming is becoming platform-layer. It proves OpenAI wants it to. Worth watching as a signal, not a conclusion. What I'd bet on: enterprise auditors start asking for audit trails, access scoping, and red-team evidence living inside the agent console. Frontier is already pointing in that direction.

Also on my desk

Claude Code + coding agents

Cloudflare

(plus the cf CLI + Local Explorer writeup above)

Agent discourse

Tweets

Companies already can't prioritize vulnerabilities effectively. Flooding the pipeline with thousands of AI discovered bugs and framing them all as weaponized exploits doesn't help.

Source: Justin Elze · @HackingLZ · Apr 13

Build skills first, fall back to tools. Make reusable agent behavior a skill first; reach for a dedicated tool when the skill shape breaks down.

Source: eric zakariasson · @ericzakariasson · Apr 12

Pi with zero extensions. Start vanilla. Add extensions only for recurring pain your workflow cannot absorb.

Source: Mario Zechner · @badlogicgames · Apr 12

Cloudflare Sandboxes, secure credential injection: Workers act as a trusted proxy so agents can make authenticated calls without seeing raw credentials. Egress policies can be customized globally or per sandbox.

Source: Mike Nomitch · @mikenomitch · Apr 13

Agent harnesses aren't black magic. The framework layer is easier to overcomplicate than people admit.

Source: Theo (t3.gg) · @theo · Apr 13

Z/L Continuum at AI Engineer Europe: Ryan Lopopolo's "be token billionaires" versus Mario Zechner's "slow down and read the code."

Source: Alex Volkov · @altryne · Apr 13

Hugging Face Kernels on the Hub: precompiled GPU kernels for your GPU, PyTorch, and OS, with claimed 1.7x to 2.5x speedups over PyTorch baselines.

Source: Clem Delangue · @ClementDelangue · Apr 14

Interesting GitHub Repos

Upcoming Events & Talks

Fri Apr 17 · Exploring GRC Engineering

PANEL · DUNE SECURITY · Moderated by Alan Luk (Principal TPM, Microsoft)

Rollup. I'm on this virtual panel with James Tabron (Aquia) and Ryan Schoeller (Treasure Data). We're digging into what "GRC Engineering" actually means, and how it moves compliance programs past spreadsheets into automated, continuous risk management.

The juice. GRC Engineering is the through-line for most of what I'm working on lately, and this panel pulls together people building that muscle from different angles. Come say hi.



Got forwarded this? Subscribe below so you don't miss the next one.


You're reading The Roll Up, a weekly Tuesday newsletter from hackIDLE. Forward freely. Archive · Unsubscribe


Don't miss what's next. Subscribe to Ethan Troy:

Add a comment: