AGI Agent

Subscribe
Archives
June 16, 2025

LLM Daily: June 16, 2025

🔍 LLM DAILY

Your Daily Briefing on Large Language Models

June 16, 2025

HIGHLIGHTS

• Meta has made a strategic $14.3 billion investment in Scale AI for a 49% stake, valuing the data-labeling company at $29 billion and bringing co-founder Alexandr Wang onto Meta's team as part of their aggressive push to remain competitive in the AI race.

• A developer has created an open-source Swift application that wraps Apple's new on-device intelligence models in an OpenAI-compatible API, allowing users to access Apple's local language models through standard API calls without data leaving their Mac.

• Microsoft's "AI-Agents-for-Beginners" educational repository has gained significant traction with over 26,000 GitHub stars, offering 11 comprehensive lessons for building AI agents with recent updates focused on translations for global accessibility.

• Meta AI Research has released V-JEPA 2, a breakthrough self-supervised video model trained on over 1 million hours of internet-scale video data combined with limited robot interaction data, enabling AI systems that can understand, predict, and plan actions in physical environments.


BUSINESS

Funding & Investment

Scale AI Secures $14.3B Investment from Meta at a $29B Valuation

Meta has made a major strategic investment in Scale AI, acquiring a 49% stake in the data-labeling company for $14.3 billion. This partnership includes bringing Scale's co-founder Alexandr Wang onto Meta's team, signaling Meta's urgent push to remain competitive in the AI race. The deal values Scale AI at approximately $29 billion. (2025-06-13) TechCrunch

Clay Raises New Round at $3B Valuation

Sales automation startup Clay has secured a new funding round that doubles its valuation to $3 billion, just one month after conducting a tender offer at a $1.5 billion valuation. CapitalG is among the investors in this rapidly growing AI-powered sales platform. (2025-06-13) TechCrunch

Sequoia Capital Backs Nominal in Hardware Engineering Push

Sequoia Capital announced its investment in Nominal, a startup focused on advancing hardware engineering capabilities. The venture firm is positioning this partnership as supporting "the next era of hardware engineering," likely with AI applications at its core. (2025-06-12) Sequoia Capital

Company Updates

Google Reportedly Ending $200M Scale AI Partnership

Following Meta's massive investment in Scale AI, Google is reportedly planning to terminate its relationship with the data-labeling startup. According to Reuters, Google had planned to pay Scale AI $200 million this year but is now exploring partnerships with Scale's competitors instead. (2025-06-14) TechCrunch

TensorWave Deploys AMD's New MI355X GPUs in Cloud Platform

TensorWave has announced the deployment of AMD's latest Instinct MI355X GPUs in its high-performance cloud platform, expanding the availability of advanced AI compute options for developers and enterprises. (2025-06-12) VentureBeat

AMD Launches Instinct MI350 Series Accelerators

AMD has unveiled its new Instinct MI350 Series accelerators, claiming performance that's four times faster for AI compute and 35 times faster for inferencing compared to previous generations. This release positions AMD to compete more aggressively in the AI chip market currently dominated by NVIDIA. (2025-06-12) VentureBeat

Google Tests Audio Overviews for Search Queries

Google has begun testing Audio Overviews for certain search queries, expanding the AI-powered features available in its core search product. This feature likely leverages Google's recent advancements in text-to-speech technology. (2025-06-13) TechCrunch

Market Analysis

NVIDIA Excludes China from Revenue Forecasts Amid Export Restrictions

NVIDIA CEO Jensen Huang has announced that the company will no longer include China in its revenue and profit forecasts due to ongoing U.S. chip export restrictions. Huang expressed pessimism about any near-term changes to these restrictions, highlighting the continuing geopolitical tensions affecting the AI hardware market. (2025-06-13) TechCrunch

Taiwan Implements Export Controls on Huawei and SMIC

Taiwan has placed export controls on Chinese companies Huawei and SMIC, potentially limiting their access to resources needed for AI chip development. This move adds another layer to the ongoing semiconductor trade restrictions affecting the global AI hardware ecosystem. (2025-06-15) TechCrunch

DeepSeek Challenges High-Spend AI Development Paradigm

DeepSeek is disrupting the conventional high-spend, high-compute paradigm of AI development with a more efficient approach that has accelerated certain AI advancements by "a few years." Their strategy offers an alternative path for AI development that doesn't require the massive computational resources typical of industry leaders. (2025-06-14) VentureBeat


PRODUCTS

New Apple Intelligence API Wrapper for Local LLMs

Developer: Individual Developer (FixedPt) | Released: 2025-06-15 GitHub Repository

A developer has created an open-source Swift application that wraps Apple's new on-device intelligence models (from macOS 26) in an OpenAI-compatible API. This allows users to access Apple's local language models through standard OpenAI API calls, with everything running completely on-device without data leaving the Mac. The project enables compatibility with any OpenAI client by pointing it to a local endpoint (http://127.0.0.1:11535). The MIT-licensed tool gives developers and users a way to leverage Apple's AI capabilities through familiar API interfaces that many existing applications already support.

Vace FusionX Video Generation Tool

Developer: Unspecified (possibly startup) | Featured: 2025-06-15 Reddit Demonstration

A new AI video generation tool called Vace FusionX is gaining attention in the Stable Diffusion community. The system appears to combine reference images, background images, and controlnet guidance to create and extend AI-generated videos. A user demonstrated the tool's capabilities by generating a video in 4-second chunks, with each extension adding approximately 3 seconds of additional footage while maintaining relative consistency through frame overlapping. Community reception has been positive, with users noting the impressive quality despite some expected degradation after multiple extensions. The tool's ability to maintain visual coherence across 20 extensions represents progress in the challenging field of longer-form AI video generation.


TECHNOLOGY

Open Source Projects

Microsoft/AI-Agents-for-Beginners

A comprehensive educational course featuring 11 lessons to get started building AI agents. This project has gained significant traction with over 26,000 GitHub stars and 7,000+ forks, showing strong community interest. The repository is actively maintained with recent updates focused on translations, making it accessible to a global audience.

DataWhaleChina/Self-LLM

A detailed guide for fine-tuning and deploying open-source large language models on Linux environments, specifically designed for Chinese users. With 18,471 stars and nearly 2,000 forks, this project provides comprehensive instructions for working with models like LLaMA, ChatGLM, and InternLM. Recent commits show active maintenance with bug fixes and content updates.

Models & Datasets

MistralAI/Magistral-Small-2506

A multilingual conversational model from Mistral AI supporting 25+ languages including English, French, German, Spanish, Japanese, Korean, and many others. With 425 likes and 13,500+ downloads, this model builds upon the Mistral-Small architecture with enhanced multilingual capabilities as detailed in the arxiv:2506.10910 paper.

Nanonets/Nanonets-OCR-s

A specialized OCR model built on Qwen2.5-VL-3B-Instruct, optimized for converting images and PDFs to markdown text. With 270 likes and nearly 8,000 downloads, this model addresses document processing needs with a focus on high-quality text extraction from visual content.

Echo840/MonkeyOCR

A transformers-based OCR solution for image-to-text conversion with 252 likes. Based on research detailed in arxiv:2506.05218, this model offers endpoints compatibility for integration into production environments under an Apache 2.0 license.

OpenBMB/MiniCPM4-8B

An 8 billion parameter conversational model supporting both Chinese and English, with 243 likes and over 7,000 downloads. Built on the MiniCPM architecture (arxiv:2506.07900), this model is compatible with AutoTrain for easier fine-tuning and deployment.

Qwen/Qwen3-Embedding-0.6B-GGUF

A lightweight embedding model in GGUF format based on Qwen3-0.6B-Base, optimized for vector representations and semantic search. With 365 likes and 18,000+ downloads, this model offers efficient embedding generation with endpoints compatibility for production use.

Datasets

NVIDIA/Nemotron-Personas

A synthetic persona dataset from NVIDIA containing between 100K and 1M entries for text generation tasks. With 101 likes and 8,800+ downloads, this CC-BY-4.0 licensed dataset provides diverse persona information to improve conversational AI capabilities.

Open-Thoughts/OpenThoughts3-1.2M

A large-scale dataset containing 1.2M entries focused on reasoning, mathematics, code, and science text generation. With 100 likes and 15,500+ downloads, this Apache 2.0 licensed dataset (arxiv:2506.04178) provides rich content for training models that require advanced reasoning capabilities.

Miriad/Miriad-5.8M

A substantial text dataset containing 5.8M entries documented in arxiv:2506.06091. With 28 likes and nearly 1,800 downloads, this dataset is compatible with multiple libraries including datasets, dask, mlcroissant, and polars for efficient processing.

Developer Tools & Platforms

ResembleAI/Chatterbox

A Gradio-based application with over 1,000 likes, enabling conversational interfaces powered by ResembleAI technology. This space represents a user-friendly front-end for voice and chat interactions.

AiSheets/Sheets

A Docker-based application with 198 likes that likely provides spreadsheet-like functionality enhanced with AI capabilities, enabling data analysis and manipulation through AI assistance.

Kwai-Kolors/Kolors-Virtual-Try-On

An extremely popular Gradio application with over 9,000 likes that enables virtual clothing try-on using AI. This tool demonstrates practical application of computer vision and generative AI in the fashion and e-commerce domains.

WebML-Community/Conversational-WebGPU

A static web application with 184 likes that likely showcases WebGPU capabilities for running AI models directly in the browser, enabling conversational AI without server-side processing.


RESEARCH

Paper of the Day

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning (2025-06-11)

Authors: Mido Assran, Adrien Bardes, David Fan, Quentin Garrido, Russell Howes, Mojtaba Komeili, Matthew Muckley, Ammar Rizvi, Claire Roberts, Koustuv Sinha, Artem Zholus, Sergio Arnaud, Abha Gejji, Ada Martin, Francois Robert Hogan, Daniel Dugas, Piotr Bojanowski, Vasil Khalidov, Patrick Labatut, Francisco Massa, Marc Szafraniec, Kapil Krishnakumar, Yong Li, Xiaodong Ma, Sarath Chandar, Franziska Meier, Yann LeCun, Michael Rabbat, Nicolas Ballas

Institution(s): Meta AI Research

This paper represents a significant advancement in self-supervised AI that can understand, predict, and act in the physical world. V-JEPA 2 combines internet-scale video data (over 1 million hours) with limited robot interaction data to create models capable of understanding physical environments and planning actions, bridging the gap between passive video understanding and active embodied intelligence.

The researchers demonstrate that their joint-embedding-predictive architecture can perform complex planning tasks without explicit reinforcement learning, outperforming specialized action models on robotics benchmarks. The approach shows that self-supervised learning on large-scale video data can develop representations that transfer effectively to downstream planning tasks, suggesting a promising path toward more general AI systems that learn primarily through observation.

Notable Research

Long-Short Alignment for Effective Long-Context Modeling in LLMs (2025-06-13)

Authors: Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang

This paper introduces a fresh perspective on length generalization in LLMs, proposing "Long-Short Alignment" that leverages short-range patterns to improve models' ability to process sequences longer than those seen during training, significantly enhancing long-context performance without additional training data.

Revealing Political Bias in LLMs through Structured Multi-Agent Debate (2025-06-13)

Authors: Aishwarya Bandaru, Fabian Bindley, Trevor Bluth, Nandini Chavda, Baixu Chen, Ethan Law

This research investigates how LLM type and agent gender attributes influence political bias by creating a structured multi-agent debate framework with Neutral, Republican, and Democrat American LLM agents discussing politically sensitive topics, revealing important insights about bias manifestation in multi-agent LLM systems.

Configurable Preference Tuning with Rubric-Guided Synthetic Data (2025-06-13)

Authors: Víctor Gallego

This paper challenges the assumption of monolithic preferences in AI alignment by introducing Configurable Preference Tuning (CPT), a framework that enables language models to dynamically adjust their behavior based on explicit human-interpretable directives, using synthetically generated preference data guided by detailed rubrics.

SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks (2025-06-13)

Authors: Hwiwon Lee, Ziqi Zhang, Hanxiao Lu, Lingming Zhang

The researchers introduce SEC-bench, a comprehensive benchmark for evaluating LLM agents on real-world software security tasks, providing automated evaluation metrics for vulnerability detection, exploit generation, and patch development across 100 diverse security challenges from recent CVEs and security competitions.


LOOKING AHEAD

As we move toward Q3 2025, we're seeing significant momentum in multimodal AI systems that can seamlessly process and generate across text, vision, audio, and physical interactions. The integration of specialized domain knowledge into general-purpose models is accelerating, with healthcare and scientific research deployments showing particular promise. Regulatory frameworks are finally catching up, with the EU's AI Act implementation providing a potential template for other regions.

By year-end, expect widespread adoption of localized, energy-efficient models that can run entirely on-device without cloud dependencies. The emerging "AI orchestration" field—coordinating multiple specialized models rather than relying on single monolithic systems—will likely dominate enterprise AI strategy discussions in early 2026 as organizations seek both performance and cost optimization.

Don't miss what's next. Subscribe to AGI Agent:
GitHub X
Powered by Buttondown, the easiest way to start and grow your newsletter.