Anthropic Trades Its Safety Pledge for More Power

        February 26, 2026

Anthropic Trades Its Safety Pledge for More Power

        1. Anthropic Killed Its Promise to Pause Unsafe AI. The Same Week, It Bought a Company to Make Claude More Powerful
In 2023, Anthropic made a pledge that became central to its identity: the company would never train an AI system unless it could guarantee in advance that its safety measures were adequate.
2. AI Coding Tools Are Becoming APIs, Not Chat Partners
Three data points landed in the same week. Anthropic shipped Remote Control for Claude Code, letting users connect to a local coding session from a phone or browser.
3. Researchers Expose Government Surveillance Pipeline Behind OpenAI's Identity Screening — While Its Own Threat Report Targets the Same Tactics
On February 25, OpenAI released its latest threat intelligence report, detailing how it disrupted malicious actors who used ChatGPT for romance scams, Chinese law enforcement influence operations,

In Brief

Salesforce Posts Strong Year-End Earnings, Benioff Dismisses AI Threat to SaaS Salesforce beat expectations on its year-end earnings and used the call to push back against recurring predictions that AI will gut traditional SaaS businesses. CEO Marc Benioff framed the threat as familiar, comparing it to previous cycles the company survived. TechCrunch

Benedict Evans: Most AI Users Can't Think of What to Do With It on an Average Day Benedict Evans pointed out that if people use AI only a couple of times a week and struggle to find daily use cases, the technology hasn't changed their lives. He noted OpenAI's own admission of a "capability gap" between what models can do and what people actually do with them — reframing it as a product-market fit problem. Simon Willison

Simon Willison Publishes Guide on Red/Green TDD for Coding Agents Willison released a pattern guide arguing that test-driven development — write failing tests first, then let the agent iterate until they pass — produces significantly better results from coding agents. The approach gives agents a concrete, verifiable goal instead of open-ended instructions. Simon Willison

Gushwork Raises $9M Seed for AI-Powered Customer Lead Search Gushwork closed a $9 million seed round led by SIG and Lightspeed. The startup builds AI search tools for generating customer leads and reports early traction from users discovering businesses through ChatGPT and similar interfaces. TechCrunch

Researchers Release Systematic Study of Training Data for Terminal-Based AI Agents A new paper presents Terminal-Task-Gen, a synthetic task generation pipeline for training LLM-based terminal agents, alongside an analysis of data and training strategies. The work addresses a gap: most top-performing terminal agents keep their training data recipes undisclosed. Hugging Face Papers

VLANeXt Standardizes Training Recipes for Vision-Language-Action Robot Models Hugging Face-surfaced research audits the fragmented VLA model space, where inconsistent training and evaluation protocols make it hard to compare design choices. VLANeXt provides controlled recipes and benchmarks to identify which architectural decisions actually improve robot policy learning. Hugging Face Papers

Paper Shows Test-Time Training with KV Binding Is Equivalent to Linear Attention Researchers proved that a broad class of test-time training architectures — widely interpreted as online meta-learning — can be rewritten as learned linear attention operators. The finding explains several previously puzzling behaviors and opens new directions for efficient sequence modeling. Hugging Face Papers

PyVision-RL Solves Interaction Collapse in Multimodal Agent Training A new RL framework for open-weight multimodal models prevents "interaction collapse," where agents learn to skip tool use and multi-turn reasoning during training. PyVision-RL combines oversampling-filtering-ranking rollouts with cumulative tool rewards to keep agents engaged across turns. Hugging Face Papers

SimToolReal Achieves Zero-Shot Sim-to-Real Dexterous Tool Manipulation A new object-centric policy trained entirely in simulation transfers to real-world dexterous tool manipulation without fine-tuning. The method eliminates the per-object engineering overhead that has limited prior sim-to-real RL approaches for thin-object grasping and forceful tool interactions. Hugging Face Papers

Read the full edition →

                            Don't miss what's next. Subscribe to AI News Digest:

            Email address (required)