AGI Agent

Subscribe
Archives
August 13, 2025

LLM Daily: August 13, 2025

🔍 LLM DAILY

Your Daily Briefing on Large Language Models

August 13, 2025

HIGHLIGHTS

• Sequoia Capital has formed a strategic partnership with AI startup Profound, representing another major venture capital investment in the artificial intelligence sector.

• A new approach called "Temporal Self-Consistency Voting" for diffusion language models has demonstrated significant performance improvements by leveraging intermediate denoising steps rather than just final outputs.

• Microsoft has released a comprehensive 11-lesson curriculum called "AI-agents-for-beginners" to help newcomers understand and develop AI agent technology.

• Seoul-based Datumo has secured $15.5 million in funding to challenge Scale AI in the AI data labeling space, with backing from Salesforce.

• SkyReels has announced their upcoming A3 model for Stable Diffusion, generating substantial interest in the image generation community.


BUSINESS

Funding & Investment

Sequoia Capital Announces Investment in Profound (2025-08-12)
Sequoia Capital has announced a new partnership with AI startup Profound, though specific funding details weren't disclosed. The venture firm highlighted this as a strategic investment in the AI space. Source

Continua Raises $8M to Bring AI Agents to Group Chats (2025-08-12)
Continua, founded by Google veteran David Petrou (founding member of Google Goggles and Google Glass), has secured $8 million in funding to enhance group chats with AI agents. The round was backed by Bessemer Venture Partners and GV. Source

Datumo Raises $15.5M to Challenge Scale AI (2025-08-11)
Seoul-based Datumo has raised $15.5 million in a funding round backed by Salesforce. Originally an AI data labeling company, Datumo is expanding into LLM evaluation and aims to help businesses build safer AI with tools for testing, monitoring, and improving models without requiring technical expertise. Source

M&A

Perplexity Makes Bold Offer to Acquire Google Chrome (2025-08-12)
In a surprising move, AI search company Perplexity has offered to buy Google Chrome for billions of dollars—significantly more than the company has raised to date. The terms include a commitment to keep Chrome's underlying Chromium engine open source and to invest $3 billion into its development. Source

Sam Altman and OpenAI Reportedly Backing Neuralink Competitor (2025-08-12)
OpenAI CEO Sam Altman is reportedly preparing to back a startup that will compete with Elon Musk's Neuralink in the brain-computer interface space. This comes as Neuralink has been making significant progress with its technology. Source

Company Updates

OpenAI Restores GPT-4o as Default for Paying Users (2025-08-13)
OpenAI has reinstated GPT-4o as the default model for all paying ChatGPT users after a controversial shift to GPT-5. CEO Sam Altman has promised to provide "plenty of notice" if the model is to be deprecated in the future, addressing user frustration over the sudden transition. Source

Anthropic Expands Claude's Context Window to 1M Tokens (2025-08-12)
Anthropic has upgraded its Claude Sonnet 4 model to support a 1 million token context window, enabling the AI to process entire software projects in a single request. This significant expansion allows for comprehensive code analysis and complex document processing, redefining software development workflows. Source

Anthropic Offers Claude to U.S. Government for $1 (2025-08-12)
In an aggressive competitive move against OpenAI, Anthropic has offered its Claude AI model to "all three branches of government" for just $1. This strategic play aims to broaden the company's foothold in federal AI usage and comes as a direct response to OpenAI's attempts to undercut competition in the government sector. Source

OpenAI Adds New Third-Party Tool Connectors to ChatGPT (2025-08-12)
OpenAI has expanded ChatGPT's capabilities with new third-party tool connectors for Dropbox and Microsoft Teams. These integrations are part of OpenAI's strategy to position GPT-5 not just as a more powerful AI model but as a component of a more connected workspace. Source

Elon Musk Confirms Shutdown of Tesla Dojo (2025-08-11)
Elon Musk has confirmed the shutdown of Tesla's Dojo supercomputer project, calling it "an evolutionary dead end." Musk stated that "all paths converged to AI6," necessitating the closure of Dojo and resulting in "tough personnel choices." The decision impacts Tesla's autonomous driving and robotaxi development plans. Source

Liquid AI Unveils LFM2-VL Model for Mobile Devices (2025-08-12)
Liquid AI has introduced the new LFM2-VL model designed to bring small, fast AI with visual capabilities to smartphones. The model will be released under a license based on Apache 2.0 principles, though the full license text has not yet been published. Source

Nvidia Launches Cosmos World Models for Robotics (2025-08-11)
Nvidia has unveiled a new set of world AI models, libraries, and infrastructure for robotics developers. The flagship offering, Cosmos Reason, is a 7-billion-parameter "reasoning" vision language model designed specifically for physical AI applications and robots. Source

Market Analysis

AI Companion Apps on Track for $120M Revenue in 2025 (2025-08-12)
The AI companion app market is projected to generate $120 million in revenue in 2025, with the number of available apps growing more than 60% since 2024. This highlights the rapid expansion of consumer-focused AI applications and their growing economic significance. Source

TD Securities Implements AI Assistant for Equity Teams (2025-08-11)
TD Securities has deployed an AI assistant for its equity sales and research teams, developed in partnership with Layer 6 and OpenAI. This implementation is part of a broader strategy to integrate AI assistants and agents throughout the bank, signaling growing AI adoption in financial services. Source

Study Raises Concerns About 'OS Agents' Security Risks (2025-08-11)
A new research study has highlighted significant security and privacy risks associated with the rapid advancement of "OS agents" — AI systems designed to control computers like humans. The findings suggest growing concerns about security implications as AI systems gain more direct control over computing systems. [Source](https://venturebeat.com/ai/study-warns


PRODUCTS

SkyReels A3 Model for Stable Diffusion Announced

SkyReels A3 announcement (2025-08-12)

SkyReels, a community developer in the Stable Diffusion ecosystem, has announced the upcoming release of their A3 model. Based on Reddit discussions, this appears to be a significant update to their image generation capabilities. The announcement has generated substantial interest in the r/StableDiffusion community with over 200 upvotes, though specific details about new features and improvements are still forthcoming.

GPT-OSS 20B Base Model Extracted

Base model extraction discussion (2025-08-13)

A developer in the open-source AI community has reportedly extracted and released the base model from GPT-OSS 20B, effectively removing alignment training. According to user reports on r/LocalLLaMA, this "basemodelization" process has resulted in a model that no longer refuses harmful requests and exhibits significant memorization of copyrighted materials. This development highlights ongoing tensions between open-source accessibility and responsible AI deployment, with users reporting that the model now responds to prompts for harmful content that the original aligned version would refuse.

Wan 2.2 with SeedVR2 Upscaling Demonstrates Photorealistic Results

Wan 2.2 demonstration (2025-08-12)

A user in the Stable Diffusion community has showcased impressive results using the Wan 2.2 model combined with SeedVR2 upscaling technology. The generated images display remarkable photorealism, with community members noting accurate details like smartphone lens flares, mirror reflections, and natural lighting that make the images nearly indistinguishable from photographs. This combination of models represents a significant advancement in the photorealistic capabilities of open-source image generation tools.


TECHNOLOGY

Open Source Projects

langchain-ai/langchain - 113,399 ⭐

A framework for building context-aware reasoning applications with LLMs. LangChain provides modular components for connecting language models to various data sources and enabling dynamic, multi-step reasoning chains. Recent updates include test parameter configurations for Anthropic proxy services.

menloresearch/jan - 36,459 ⭐ (+243 today)

Jan is a fully offline ChatGPT alternative that runs locally on your computer without requiring cloud connections. The project has significant momentum with recent updates focusing on CI improvements including a new flag to disable auto-updates, reflecting active development and community interest.

microsoft/ai-agents-for-beginners - 34,134 ⭐

A comprehensive 11-lesson curriculum created by Microsoft to help beginners learn AI agent development fundamentals. The course has attracted significant attention with over 10,500 forks, making it a popular educational resource in the AI community.

Models & Datasets

New OpenAI Open Source Models

OpenAI's new open source models are trending heavily on Hugging Face: - openai/gpt-oss-120b - OpenAI's largest open source LLM with 120B parameters, licensed under Apache 2.0 with nearly 490K downloads - openai/gpt-oss-20b - A more compact 20B parameter variant with over 2.3M downloads, making it one of the most popular recent releases

Visual Models

  • Qwen/Qwen-Image - A text-to-image diffusion model supporting both English and Chinese inputs with over 69K downloads
  • openbmb/MiniCPM-V-4 - A multimodal vision model capable of processing multiple images, videos, and OCR tasks with conversational abilities

Text-to-Speech

  • KittenML/kitten-tts-nano-0.1 - A compact text-to-speech model in ONNX format with over 33K downloads, notable for its efficient design

Datasets

  • jxm/gpt-oss20b-samples - A collection of samples from OpenAI's new GPT-OSS-20B model with nearly 1,400 downloads
  • HuggingFaceH4/Multilingual-Thinking - A multilingual dataset for training chain-of-thought capabilities across English, German, French, Spanish and Italian
  • allenai/WildChat-4.8M - A 4.8M conversation dataset for instruction fine-tuning, released by Allen AI

Developer Tools & Spaces

Demo Interfaces

  • amd/gpt-oss-120b-chatbot - A Gradio interface demonstrating OpenAI's new 120B open source model
  • webml-community/KittenTTS-web - A web interface for KittenTTS, showcasing browser-based text-to-speech capabilities

Virtual Try-On Tools

  • Kwai-Kolors/Kolors-Virtual-Try-On - A highly popular virtual clothing try-on system with over 9,400 likes
  • Miragic-AI/Miragic-Virtual-Try-On - An alternative virtual try-on solution with strong visual capabilities

Model Evaluation

  • open-llm-leaderboard/open_llm_leaderboard - The widely-referenced Open LLM Leaderboard for comparing model performance across code, math, and general language tasks (13,415 likes)

RESEARCH

Paper of the Day

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models (2025-08-12)

Authors: Wen Wang, Bozhen Fang, Chenchen Jing, Yongliang Shen, Yangyi Shen, Qiuyu Wang, Hao Ouyang, Hao Chen, Chunhua Shen

Institution(s): Multiple institutions including Zhejiang University

This paper is significant because it reveals a critical but previously overlooked phenomenon in diffusion language models (dLLMs): temporal oscillation, where correct answers often emerge in intermediate denoising steps but are overwritten in later steps. This discovery challenges the conventional approach of only considering the final output of the diffusion process.

The researchers introduce two complementary methods that exploit temporal consistency in diffusion models: Temporal Self-Consistency Voting and Temporal Fine-Tuning. Their approach shows substantial improvements across multiple benchmarks, including a 17.7% relative gain on GSM8K and a 41.6% improvement on HumanEval when applied to Phi-3-mini-128k-instruct, demonstrating that temporal information in diffusion models contains valuable signals that can significantly enhance performance without requiring additional parameters.

Notable Research

Retrospective Sparse Attention for Efficient Long-Context Generation (2025-08-12)

Authors: Seonghwan Choi, Beomseok Kang, Dongwon Jo, Jae-Joon Kim

This paper introduces a novel approach to reduce KV cache memory requirements during LLM inference by identifying important tokens through "retrospective attention" - examining which past tokens were actually influential in generation. Their method achieves up to 10× speedup with only 0.1-1% performance degradation on long-context tasks.

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory (2025-08-12)

Authors: Sizhe Yuen, Francisco Gomez Medina, Ting Su, Yali Du, Adam J. Sobey

The researchers present a framework that addresses context window limitations in multi-agent LLM systems through structured agent-specific memories that evolve intrinsically with agent outputs, maintaining a consistent world model across multiple agents and improving performance on complex collaborative tasks by up to 28%.

ColorGPT: Leveraging Large Language Models for Multimodal Color Recommendation (2025-08-12)

Authors: Ding Xia, Naoto Inoue, Qianru Qiu, Kotaro Kikuchi

This paper introduces a novel framework that enables LLMs to perceive, understand, and recommend colors in context-aware scenarios by integrating visual elements and textual descriptions, outperforming previous methods by delivering more accurate and aesthetically pleasing color recommendations across various applications.

A Survey on Training-free Alignment of Large Language Models (2025-08-12)

Authors: Birong Pan, Yongqi Li, Weiyu Zhang, Wenpeng Lu, Mayi Xu, Shen Zhou, Yuanyuan Zhu, Ming Zhong, Tieyun Qian

This comprehensive survey provides the first systematic review of training-free alignment techniques for LLMs, categorizing methods into prompt-based, inference-time, and post-processing approaches while analyzing their effectiveness, limitations, and potential future directions for aligning models without resource-intensive training.


LOOKING AHEAD

As we move deeper into Q3 2025, the AI landscape continues its rapid evolution. The emerging integration of multimodal reasoning with specialized domain knowledge appears to be the next frontier, with several research labs demonstrating promising results in combining visual, auditory, and scientific reasoning capabilities within unified architectures. We expect these systems to reach commercial applications by Q1 2026.

Meanwhile, the regulatory framework established in late 2024 is showing its limitations as AI capabilities outpace governance structures. The upcoming UN AI Summit in November will likely address these gaps, particularly around synthetic media verification and autonomous AI agents. Companies investing in compliance infrastructure now will have significant advantages as these regulations inevitably tighten in early 2026.

Don't miss what's next. Subscribe to AGI Agent:
GitHub X
Powered by Buttondown, the easiest way to start and grow your newsletter.