AGI Agent

Subscribe
Archives
June 20, 2025

LLM Daily: June 20, 2025

πŸ” LLM DAILY

Your Daily Briefing on Large Language Models

June 20, 2025

HIGHLIGHTS

β€’ SportsVisio has secured $3.2 million to develop AI sports analytics technology, with Sony Innovation Fund's participation indicating growing interest in AI applications for athletic performance optimization.

β€’ The Chroma diffusion model is gaining significant traction in the Stable Diffusion community, with users reporting impressive image generation results and quality improvements over previous models.

β€’ Meta's Segment Anything Model (SAM) repository has expanded to include SAM 2, which extends the original image segmentation capabilities to now handle video segmentation as well.

β€’ Researchers from Singapore University of Technology and Design have developed GRPO (Group Reinforcement Learning from Preferences Optimization), a novel approach using verifiable rewards to enhance citation-based grounding in LLMs.

β€’ An educational repository "LLMs-from-scratch" has become extremely popular (51,000+ stars) for teaching developers how to build ChatGPT-like models in PyTorch, recently updating with a Qwen3 implementation.


BUSINESS

Funding & Investment

SportsVisio Raises $3.2M for AI Sports Analytics (2025-06-18)
SportsVisio has secured $3.2 million in funding to develop AI technology for athletes, coaches, and fans. The round included participation from the Sony Innovation Fund, signaling growing interest in AI applications for sports analytics and performance optimization. Source

Multiplier Secures $27.5M for AI-Powered Accounting (2025-06-18)
Multiplier, founded by an ex-Stripe executive, has raised $27.5 million in combined seed and Series A funding led by Lightspeed Venture Capital and Ribbit Capital. The company is developing AI-powered solutions for accounting roll-ups, demonstrating continued investor confidence in AI applications for financial services. Source

Sequoia Capital Announces Investment in Traversal (2025-06-18)
Sequoia Capital has announced a partnership with Traversal, an AI-powered troubleshooting platform for engineers. The investment highlights Sequoia's continued focus on enterprise AI tools that address specific pain points in the software development lifecycle. Source

Sequoia Capital Backs Crosby's AI-Powered Law Firm (2025-06-17)
Sequoia Capital has invested in Crosby, an AI-augmented law firm designed to operate "at the speed of AI." This partnership indicates growing investor interest in legal tech startups that leverage AI to transform traditional professional services. Source

M&A

Wix Acquires Base44 for $80M Cash (2025-06-18)
Website building platform Wix has acquired Base44, a six-month-old "vibe coding" startup, for $80 million in cash. The solo-owned company reportedly grew to 250,000 users and was generating nearly $200,000 in monthly profits before the acquisition. This deal highlights the accelerating pace at which AI startups can scale and achieve successful exits. Source

Company Updates

Google Launches Production-Ready Gemini 2.5 Models (2025-06-17)
Google has released its production-ready Gemini 2.5 Pro and Flash AI models, positioning them as enterprise solutions to challenge OpenAI's market dominance. The company also introduced a cost-efficient Flash-Lite model, targeting price-sensitive segments. The new models are available through Google's Vertex AI platform, marking a significant step in Google's AI enterprise strategy. Source

OpenAI Secures $200M Department of Defense Contract (2025-06-17)
OpenAI has secured a $200 million contract with the U.S. Department of Defense, potentially creating tension with Microsoft, one of its major investors. The contract could put OpenAI in direct competition with Microsoft's own efforts to sell AI services to the DoD, highlighting the complex relationship between the two companies. Source

OpenAI Releases Open-Source Customer Service Agent Framework (2025-06-18)
OpenAI has open-sourced a new Customer Service Agent framework, offering transparent tooling and implementation examples to accelerate the adoption of agentic systems. This release aligns with OpenAI's expanding enterprise strategy, moving AI agents from experimental to practical business applications. Source

Midjourney Launches First AI Video Generation Model (2025-06-18)
Midjourney has released V1, its first AI video generation model, expanding beyond its core image generation capabilities. This launch signals Midjourney's entry into the increasingly competitive AI video generation market, following similar moves by companies like Runway and OpenAI. Source

Market Analysis

Nvidia's AI Investment Portfolio Grows to 80+ Startups (2025-06-19)
Over the past two years, Nvidia has leveraged its surging profits to invest in more than 80 AI startups, establishing itself as a dominant force in AI venture capital. This investment strategy extends Nvidia's influence throughout the AI ecosystem while providing early access to promising technologies that could complement its hardware business. Source

Amazon Plans to Reduce Corporate Workforce Due to AI (2025-06-17)
Amazon has announced expectations to reduce its corporate workforce as AI technologies increase efficiency across the organization. This development signals a growing trend of major corporations restructuring their operations in response to AI adoption, with potential implications for the broader labor market. Source

GenLayer Launches AI-Blockchain Marketing Platform (2025-06-19)
GenLayer has introduced a new platform combining AI and blockchain technologies to incentivize brand marketing. The company's Rally application, currently in beta, represents an emerging category of intelligent blockchain infrastructure with potential applications in decentralized marketing campaigns. Source

24 US AI Startups Raise $100M+ in 2025 So Far (2025-06-18)
A new report highlights that 24 US-based AI startups have raised funding rounds of $100 million or more in 2025 to date. This metric provides insight into the continued strength of AI venture funding, following what was described as a "monumental" year for the AI industry in 2024. Source


PRODUCTS

Chroma Diffusion Model Gains Traction

Source: Reddit Discussion (2025-06-19)

The Chroma diffusion model is receiving enthusiastic community reception based on recent discussions. Users report impressive results when using negative prompts and highlight the model's ability to generate high-quality images. While the exact release date isn't specified, it appears to be a relatively new addition to the Stable Diffusion ecosystem. Users note it offers quality improvements over previous models, though with the characteristic occasional finger-counting issues common to many image generation models.

Self-Hosted Databricks Alternative

Source: Reddit Post (2025-06-19)

An ML Engineer has developed a self-hosted alternative to Databricks, aiming to reduce infrastructure overhead while maintaining end-to-end project capabilities. The tool appears to focus on streamlining data pipelines and basic model deployment (like XGBoost) for organizations that don't require enterprise-scale solutions. The developer cites frustration with the complexity and process overhead of larger platforms as motivation for creating this alternative solution.


TECHNOLOGY

Open Source Projects

rasbt/LLMs-from-scratch

A comprehensive educational repository for building ChatGPT-like LLMs in PyTorch from scratch. This project serves as the official code companion for the book "Build a Large Language Model (From Scratch)" and walks through the entire process of developing, pretraining, and fine-tuning GPT-like models. The repository has garnered significant attention with over 51,000 stars and was recently updated with a Qwen3 implementation.

facebookresearch/segment-anything

Meta's Segment Anything Model (SAM) repository provides code for running inference with their powerful image segmentation model. The repository recently added information about SAM 2, which extends the original capabilities to handle video segmentation as well. With over 50,500 stars, SAM continues to be a foundational tool for computer vision tasks, providing pre-trained model checkpoints and example notebooks.

Models & Datasets

Models

nanonets/Nanonets-OCR-s

A fine-tuned version of Qwen2.5-VL-3B-Instruct specialized for OCR (Optical Character Recognition) tasks. This model excels at converting PDFs to markdown and has accumulated 881 likes with over 28,000 downloads, making it particularly valuable for document processing applications.

MiniMaxAI/MiniMax-M1-80k

A large language model supporting an 80k context window, enabling processing of very long documents. With 415 likes and specialized for text generation and conversational tasks, this model offers capabilities for handling extensive context information in a single prompt.

Menlo/Jan-nano

A fine-tuned version of Qwen3-4B optimized for conversational AI applications. Despite its compact size, this model has gained popularity with 276 likes and nearly 5,000 downloads, making it suitable for deployment in resource-constrained environments.

vrgamedevgirl84/Wan14BT2VFusioniX

A merged model built on the Wan2.1-T2V-14B base, specialized for text-to-video generation. With 234 likes, this diffusion model represents the growing trend of accessible video generation capabilities in the open-source community.

Datasets

institutional/institutional-books-1.0

A large-scale text dataset containing between 100K and 1M samples, focused on book-length content. With 140 likes and over 9,300 downloads, this dataset provides valuable long-form text for training models that need to understand extended narratives and complex relationships.

EssentialAI/essential-web-v1.0

A massive web-scraped dataset containing between 10B and 100B entries. With 107 likes and over 8,500 downloads since its recent release on June 19th, this dataset provides extensive training material for large language models requiring diverse web content.

nvidia/Nemotron-Personas

NVIDIA's dataset of synthetic personas for text generation tasks. With 130 likes and over 14,300 downloads, this collection helps develop conversational AI systems with consistent personality traits and behaviors across interactions.

Developer Tools & Infrastructure

MiniMaxAI/MiniMax-M1

A Gradio-based interface for interacting with the MiniMax-M1 language model. This space provides developers and users with a simple way to test the model's capabilities without setting up their own infrastructure.

aisheets/sheets

A Docker-based application that brings AI capabilities to spreadsheet-like interfaces. With 261 likes, this tool represents the growing trend of integrating AI directly into familiar productivity tools that knowledge workers use daily.

ResembleAI/Chatterbox

A voice conversation platform built on Gradio that has gained significant traction with 1,128 likes. Chatterbox enables natural voice interactions with AI systems, making advanced audio capabilities more accessible to developers.

Kwai-Kolors/Kolors-Virtual-Try-On

An extremely popular application (9,087 likes) that allows users to virtually try on clothing. This space demonstrates the practical application of computer vision and generative AI for e-commerce and fashion retail, showing how AI can enhance shopping experiences.


RESEARCH

Paper of the Day

Lessons from Training Grounded LLMs with Verifiable Rewards (2025-06-18)

Authors: Shang Hong Sim, Tej Deep Pala, Vernon Toh, Hai Leong Chieu, Amir Zadeh, Chuan Li, Navonil Majumder, Soujanya Poria

Institution: Singapore University of Technology and Design

This paper stands out for tackling one of the most critical challenges in modern LLMs: ensuring responses are properly grounded and trustworthy. The researchers present a novel approach using reinforcement learning with verifiable rewards to enhance citation-based grounding in LLMs, addressing widespread issues where models fail at using evidence correctly even when it's available.

The authors introduce GRPO (Group Reinforcement Learning from Preferences Optimization) to enhance grounding in LLMs, showing that a specially designed reward function targeting three key aspects (hallucination reduction, citation accuracy, and helpfulness) leads to significant improvements. Their research demonstrates that explicit reasoning steps through a Chain-of-Thought approach is crucial for proper grounding, particularly for complex queries, providing important insights for developers working on factual reliability in AI systems.

Notable Research

PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning (2025-06-18)
Authors: Yuhui Shi, Yehan Yang, Qiang Sheng, et al.
This paper addresses the emerging challenge of detecting text generated by privately-tuned LLMs, introducing a "family-aware learning" approach that identifies subtle patterns preserved across model families even after customization, significantly outperforming existing detectors against these previously "phantom" private models.

RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments (2025-06-18)
Authors: Yuchuan Fu, Xiaohan Yuan, Dongxia Wang
The researchers introduce the first comprehensive security benchmark for LLM agents operating in dynamic environments, featuring 80 test cases and 3,802 attack tasks across 11 CWE categories, with extensive evaluation showing that even leading models like GPT-4 remain vulnerable to various attacks when using tools in real-world settings.

Creating User-steerable Projections with Interactive Semantic Mapping (2025-06-18)
Authors: Artur AndrΓ© Oliveira, Mateus Espadoto, Roberto Hirata Jr., et al.
This innovative paper leverages Multimodal LLMs for customizable data visualization, allowing users to define semantic dimensions through natural language for steering projections of high-dimensional data, demonstrating superior interpretability and usability compared to traditional dimensionality reduction techniques across diverse datasets.

Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning (2025-06-18)
Authors: Stanley Ngugi
This research presents a novel fine-tuning approach called Targeted Lexical Injection that significantly improves cross-lingual alignment for low-resource languages like Swahili, using early-layer LoRA fine-tuning to efficiently enhance bilingual lexical retrieval with minimal computational resources and training data.


LOOKING AHEAD

As we move into the second half of 2025, we're witnessing the early stages of truly multimodal AI assistants that seamlessly integrate with physical environments through IoT networks. The convergence of large language models with sophisticated vision, audio, and haptic systems is enabling new applications in healthcare diagnostics and personalized education that were merely theoretical a year ago.

Looking toward Q4 2025 and beyond, we anticipate significant breakthroughs in computational efficiency as neuromorphic computing approaches commercial viability. The regulatory landscape will likely tighten further following the EU's comprehensive AI Act implementation, with the US expected to announce its federal framework by year's end. Companies that have invested in interpretable AI architectures will find themselves advantageously positioned as these regulations take effect.

Don't miss what's next. Subscribe to AGI Agent:
GitHub X
Powered by Buttondown, the easiest way to start and grow your newsletter.