LLM Daily: July 10, 2025

Neha Verma, Kenton Murray, Kevin Duh

                July 10, 2025

            LLM Daily: July 10, 2025

            🔍 LLM DAILY
Your Daily Briefing on Large Language Models
July 10, 2025
HIGHLIGHTS
• Microsoft has reported $500M in AI savings from call center operations over the past year, coinciding with 9,000 job cuts, highlighting the tangible financial impact of AI implementation on workforce dynamics.
• OpenAI is breaking from its closed-source tradition with plans to release an open source reasoning model on July 15th, potentially around 14B parameters in size, marking a significant shift in the company's approach.
• LangChain is poised to achieve unicorn status with a new funding round led by IVP that will value the AI infrastructure startup at approximately $1 billion, demonstrating continued strong investor interest in AI development tools.
• The DOTResize technique, developed by Johns Hopkins researchers, has achieved impressive model compression results by reducing Llama-2-7B's width by up to 40% while maintaining 95% of performance through discrete optimal transport-based neuron merging.
• Open source AI projects continue to show strong community momentum, with repositories like langchain-ai/langchain and Shubhamsaboo/awesome-llm-apps gaining significant traction (90 and 300+ daily stars respectively).

BUSINESS
Microsoft Reports $500M in AI Savings After Cutting 9,000 Jobs
Microsoft has disclosed internally that its AI implementations have generated over $500 million in savings in its call center operations alone over the past year. This announcement comes just days after the company cut 9,000 jobs, raising questions about the relationship between AI efficiency gains and workforce reductions.
TechCrunch (2025-07-09)
LangChain Set to Achieve Unicorn Status with New Funding Round
AI infrastructure startup LangChain is reportedly raising a new funding round led by IVP that will value the company at approximately $1 billion. This milestone highlights the continued investor interest in AI infrastructure and development tools.
TechCrunch (2025-07-08)
Mistral AI in Talks to Raise $1 Billion
French AI startup Mistral is reportedly in discussions to raise up to $1 billion in equity funding, with Abu Dhabi's MGX fund among the potential investors. This massive fundraising effort, if successful, would significantly strengthen Mistral's position in the competitive AI model development space.
TechCrunch (2025-07-08)
Hugging Face Launches $299 Robot to Democratize AI Development
Hugging Face has entered the hardware market with the release of Reachy Mini, an affordable $299 open-source desktop robot aimed at making AI robotics development accessible to millions of builders worldwide. This move could potentially disrupt the robotics industry by dramatically lowering the barrier to entry.
VentureBeat (2025-07-09)
Replit Partners with Microsoft in Strategic Cloud Deal
In a significant cloud partnership shift, coding platform Replit has announced a new deal with Microsoft Azure. This arrangement represents a loss for Google Cloud, Replit's previous partner, and strengthens Microsoft's position in the developer tools ecosystem.
TechCrunch (2025-07-08)
OpenAI Reportedly Launching AI Browser Soon
OpenAI is reportedly preparing to release an AI-powered browser in the coming weeks that will reimagine how users interact with the web. According to reports, the browser will keep some user interactions within ChatGPT rather than directing users to external websites, potentially disrupting traditional web navigation patterns.
TechCrunch (2025-07-09)

PRODUCTS
OpenAI to Release Open Source Reasoning Model
OpenAI is set to release their first open source large language model next Thursday (2025-07-15). According to discussions on Reddit, this model will focus specifically on reasoning capabilities. While details remain limited, there's speculation about the model size, with some suggesting it could be around 14B parameters. This release marks a significant shift in OpenAI's approach, which has historically kept its core models proprietary.
Source: Reddit discussion
Invoke 6.0 Released with Major Interface Overhaul
Invoke AI (established player) has launched version 6.0 of their image generation platform (2025-07-09), featuring a completely redesigned user interface and several significant new features. The update includes a reimagined AI canvas, integrated Flux Kontext Dev support, and layered PSD exports. The new interface is designed to be faster and more intuitive, bringing AI image generation tools closer to professional design software like Adobe Photoshop. Community reception has been overwhelmingly positive, with users praising the professional quality and artist-friendly approach.
Source: Reddit announcement

TECHNOLOGY
Open Source Projects
langchain-ai/langchain - 111,095 ⭐
LangChain is a framework for building context-aware reasoning applications that leverage LLMs. Recent updates include improvements to text splitters, fixing NaN handling in embedding vectors, and code quality enhancements. The framework continues to maintain strong momentum with nearly 90 new stars daily.
Shubhamsaboo/awesome-llm-apps - 49,319 ⭐
A comprehensive collection of LLM applications featuring AI agents and RAG implementations using various models from OpenAI, Anthropic, Gemini, and open-source alternatives. This repository has seen exceptional growth with over 300 stars added today, highlighting the community's interest in practical LLM implementations.
ultralytics/yolov5 - 54,535 ⭐
YOLOv5 provides high-performance object detection in PyTorch with export capabilities to ONNX, CoreML, and TFLite. Recent updates include fixing downstream impacts from torch.load replacements and security improvements in GitHub workflows, maintaining its position as a go-to computer vision solution.
Models & Datasets
New and Notable Models
THUDM/GLM-4.1V-9B-Thinking
A multimodal vision-language model that emphasizes reasoning capabilities. Built on GLM-4-9B, this model excels at handling complex visual reasoning tasks with its explicit thinking process, making it valuable for applications requiring visual understanding and logical inference.
black-forest-labs/FLUX.1-Kontext-dev
FLUX.1-Kontext is a powerful diffusion model for image generation with nearly 190K downloads. It's notable for its high-quality outputs and single-file diffusion architecture, making it particularly efficient for deployment in various image generation scenarios.
kyutai/tts-1.6b-en_fr
A bilingual text-to-speech model supporting both English and French, built on the Moshi architecture. With over 15K downloads, this model demonstrates the growing interest in multilingual speech synthesis capabilities.
HuggingFaceTB/SmolLM3-3B
SmolLM3 is a compact but capable 3B parameter language model supporting multiple languages including English, French, Spanish, Italian, Portuguese, Chinese, Arabic, and Russian. Despite its small size, it offers competitive performance for text generation and conversational tasks.
apple/DiffuCoder-7B-cpGRPO
A specialized code generation model from Apple based on a diffusion language model architecture. This model represents an interesting approach to code generation using diffusion techniques rather than traditional autoregressive methods.
Datasets
hackaprompt/Pliny_HackAPrompt_Dataset
A dataset focused on prompt engineering, red-teaming, and security evaluation of language models. With nearly 500 downloads, it provides valuable resources for testing model safety against prompt injections and jailbreaks.
HuggingFaceFW/fineweb-2
A massive multilingual web dataset for text generation with over 360K downloads. This dataset stands out for its comprehensive language coverage and scale, making it particularly valuable for training multilingual models.
Developer Tools & Spaces
FunAudioLLM/ThinkSound
An interactive demo for audio processing using large language models, allowing users to experiment with audio understanding and generation capabilities through a Gradio interface.
Kwai-Kolors/Kolors-Virtual-Try-On
A virtual try-on application with over 9,200 likes, demonstrating practical applications of computer vision for fashion e-commerce. This space allows users to visualize how clothing items would look on different models.
open-llm-leaderboard/open_llm_leaderboard
The definitive leaderboard for evaluating open large language models with over 13,000 likes. It provides standardized benchmarks across code, mathematics, and other domains to facilitate comparison between different models.
jbilcke-hf/ai-comic-factory
A popular tool with over 10,400 likes for generating AI comics. This space showcases how generative AI can be applied to creative storytelling and visual narrative construction.
kontext-community/FLUX.1-Kontext-portrait
A specialized implementation of the FLUX.1-Kontext model focused on portrait generation, demonstrating how general-purpose image generation models can be fine-tuned for specific use cases.

RESEARCH
Paper of the Day
DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging (2025-07-06)
Neha Verma, Kenton Murray, Kevin Duh
Johns Hopkins University
This paper addresses a critical challenge in LLM deployment: reducing model size without sacrificing performance. DOTResize stands out for its innovative approach to model compression, framing neuron merging as a discrete optimal transport problem, which allows for more principled compression decisions compared to previous heuristic methods. The technique demonstrated impressive results, reducing the width of Llama-2-7B by up to 40% while maintaining 95% of its original performance, offering a practical path to making large models more accessible.
Notable Research
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Models (2025-07-09)
Jing Liang, Hongyao Tang, Yi Ma, Jinyi Liu, Yan Zheng, Shuyue Hu, Lei Bai, Jianye Hao
This paper introduces an off-policy reinforcement learning approach that significantly improves the efficiency of LLM finetuning by reusing previously generated data, reducing training costs by 75% while achieving comparable or better performance than on-policy methods.
The Dark Side of LLMs: Agent-based Attacks for Complete Computer Takeover (2025-07-09)
Matteo Lupinacci, Francesco Aurelio Pironti, Francesco Blefari, Francesco Romeo, Luigi Arena, Angelo Furfaro
The researchers present the first comprehensive security evaluation demonstrating how LLM agents can be exploited as attack vectors for complete computer takeover, highlighting critical vulnerabilities that extend beyond traditional prompt injection attacks.
SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments (2025-07-09)
Tianshun Li, Tianyi Huai, Zhen Li, Yichun Gao, Haoang Li, Xinhu Zheng
This novel framework integrates vision-and-language navigation with Nonlinear Model Predictive Control, enabling UAVs to understand natural language instructions and visual observations for autonomous navigation in complex urban environments.
Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor (2025-07-09)
Vatsal Agarwal, Matthew Gwilliam, Gefen Kohavi, Eshan Verma, Daniel Ulbricht, Abhinav Shrivastava
The authors demonstrate that pre-trained text-to-image diffusion models can serve as instruction-aware visual encoders, capturing fine-grained details often missed by CLIP encoders and improving multimodal large language models' ability to answer detailed visual questions.

LOOKING AHEAD
As we move deeper into Q3 2025, the convergence of multimodal LLMs with specialized hardware is accelerating development cycles beyond previous projections. The recent breakthroughs in sparse attention mechanisms are enabling context windows approaching 10 million tokens, while maintaining inference costs at sustainable levels. Watch for the first wave of truly autonomous AI research agents to emerge by early Q4, capable of designing and conducting novel experiments with minimal human oversight.
Meanwhile, the regulatory landscape continues to evolve rapidly. With the EU's AI Act now fully implemented and similar frameworks advancing in the US and Asia, we anticipate new compliance-focused AI tools will become essential infrastructure for organizations by Q1 2026. These developments will likely catalyze the next evolution of human-AI collaboration systems designed specifically for regulated industries.

Don't miss what's next. Subscribe to AGI Agent: