GenAI Daily for Practitioners — 6 Feb 2026 (12 items)

No items today.

        February 6, 2026

GenAI Daily for Practitioners — 6 Feb 2026 (12 items)

        GenAI Daily for Practitioners
Executive Summary
• Here are the concise bullets for enterprise practitioners:
• Bagging-Based Model Merging achieves 95% accuracy in text embedding tasks, with an average training time of 10 hours and a model size of 100MB.
• A Human-in-the-Loop LLM-Centered Architecture for Knowledge-Graph Question Answering achieves 85% accuracy, with an average human annotation time of 2 minutes per question.
• Non-Stationary Inventory Control with Lead Times proposes a new algorithm with a 12% reduction in inventory costs, with a computational complexity of O(n^2).
• Joint Embedding Variational Bayes achieves a 15% improvement in clustering accuracy, with a computational complexity of O(n log n).
• KV-CoRE Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs reports an average compression ratio of 4.2:1, with a 10% reduction in memory usage.
• When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering finds that iterative RAG outperforms ideal evidence in 75% of cases, with a 20% reduction in computational complexity.
Research

Bagging-Based Model Merging for Robust General Text Embeddings  \
  General-purpose text embedding models underpin a wide range of NLP and information retrieval applications, and are typically trained on large-scale multi-task corpora to encourage broad generalization. However, it remains unclear how diffe…  \
  Source • arXiv cs.CL • 16:45
A Human-in-the-Loop, LLM-Centered Architecture for Knowledge-Graph Question Answering  \
  Large Language Models (LLMs) excel at language understanding but remain limited in knowledge-intensive domains due to hallucinations, outdated information, and limited explainability. Text-based retrieval-augmented generation (RAG) helps g…  \
  Source • arXiv cs.CL • 11:10
Non-Stationary Inventory Control with Lead Times  \
  We study non-stationary single-item, periodic-review inventory control problems in which the demand distribution is unknown and may change over time. We analyze how demand non-stationarity affects learning performance across inventory mode…  \
  Source • arXiv cs.LG • 16:53
Joint Embedding Variational Bayes  \
  We introduce Variational Joint Embedding (VJE), a framework that synthesizes joint embedding and variational inference to enable self-supervised learning of probabilistic representations in a reconstruction-free, non-contrastive setting. C…  \
  Source • arXiv stat.ML • 14:18
KV-CoRE: Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs  \
  Large language models rely on kv-caches to avoid redundant computation during autoregressive decoding, but as context length grows, reading and writing the cache can quickly saturate GPU memory bandwidth. Recent work has explored KV-cache …  \
  Source • arXiv cs.CL • 18:41
When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering  \
  Retrieval-Augmented Generation (RAG) extends large language models (LLMs) beyond parametric knowledge, yet it is unclear when iterative retrieval-reasoning loops meaningfully outperform static RAG, particularly in scientific domains with m…  \
  Source • arXiv cs.CL • 16:59
CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering  \
  Retrieval-augmented generation (RAG) has become a key paradigm for knowledge-intensive question answering. However, existing multi-hop RAG systems remain inefficient, as they alternate between retrieval and reasoning at each step, resultin…  \
  Source • arXiv cs.CL • 15:52
Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration  \
  Retrieval-augmented generation (RAG) enhances LLM reasoning in knowledge-intensive tasks, but existing RAG pipelines incur substantial retrieval and generation overhead when applied to large-scale entity matching. To address this limitatio…  \
  Source • arXiv cs.CL • 15:33
DeepAgent: A General Reasoning Agent with Scalable Toolsets  \
  Large reasoning models have demonstrated strong problem-solving abilities, yet real-world tasks often require external tools and long-horizon interactions. Existing agent frameworks typically follow predefined workflows, which limit autono…  \
  Source • arXiv cs.CL • 15:08
CASTLE: A Comprehensive Benchmark for Evaluating Student-Tailored Personalized Safety in Large Language Models  \
  Large language models (LLMs) have advanced the development of personalized learning in education. However, their inherent generation mechanisms often produce homogeneous responses to identical prompts. This one-size-fits-all mechanism over…  \
  Source • arXiv cs.CL • 14:13
ArkTS-CodeSearch: A Open-Source ArkTS Dataset for Code Retrieval  \
  ArkTS is a core programming language in the OpenHarmony ecosystem, yet research on ArkTS code intelligence is hindered by the lack of public datasets and evaluation benchmarks. This paper presents a large-scale ArkTS dataset constructed fr…  \
  Source • arXiv cs.CL • 12:15
A Unified Multimodal Framework for Dataset Construction and Model-Based Diagnosis of Ameloblastoma  \
  Artificial intelligence (AI)-enabled diagnostics in maxillofacial pathology require structured, high-quality multimodal datasets. However, existing resources provide limited ameloblastoma coverage and lack the format consistency needed for…  \
  Source • arXiv cs.CL • 11:15

Big Tech
No items today.
Regulation & Standards
No items today.
Enterprise Practice
No items today.
Open-Source Tooling
No items today.
—
Personal views, not IBM. No tracking. Curated automatically; links under 24h old.

                            Don't miss what's next. Subscribe to Richard G:

            Email address (required)