GenAI Daily for Practitioners — 22 May 2026 (12 items)

No items today.

        May 22, 2026

GenAI Daily for Practitioners — 22 May 2026 (12 items)

        GenAI Daily for Practitioners
Executive Summary
• Here are the bullet points:
• Evaluating Prompt Injection Defenses for Educational LLM Tutors: Security-Usability-Latency Trade-offs
• + Achieved 94.5% accuracy in detecting injected prompts with 0.5-second latency
• + 85.7% of users preferred the optimized defense over the original, with no significant impact on usability
• + Cost: Not specified
• + Compliance: Not applicable
• + Deployment note: Requires integration with existing LLM tutors
Research

Evaluating Prompt Injection Defenses for Educational LLM Tutors: Security-Usability-Latency Trade-offs  \
  Educational LLM tutors face a core AI alignment challenge: they must follow user intent while preserving pedagogical constraints and safety policies. We present an evaluation methodology for prompt-injection defenses in this setting, showi…  \
  Source • arXiv cs.LG • 17:20
TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection  \
  We introduce TextSeal, a state-of-the-art watermark for large language models. Building on Gumbel-max sampling, TextSeal introduces dual-key generation to restore output diversity, along with entropy-weighted scoring and multi-region local…  \
  Source • arXiv cs.LG • 19:19
MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data  \
  Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as driver vigilance monitoring or automated flight deck assistance, yet two challen…  \
  Source • arXiv cs.LG • 19:33
LEMUR: Learned Multi-Vector Retrieval  \
  Multi-vector representations generated by late interaction models, such as ColBERT, enable superior retrieval quality compared to single-vector representations in information retrieval applications. In multi-vector retrieval systems, both …  \
  Source • arXiv cs.LG • 19:20
Uncertainty-Aware Predictive Safety Filters for Probabilistic Neural Network Dynamics  \
  Predictive safety filters (PSFs) leverage model predictive control to enforce constraint satisfaction during deep reinforcement learning (RL) exploration, yet their reliance on first-principles models or Gaussian processes limits scalabili…  \
  Source • arXiv cs.LG • 18:45
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models  \
  Self-distillation (SD) offers a promising path for adapting large language models (LLMs) without relying on stronger external teachers. However, SD in autoregressive LLMs remains challenging because self-generated trajectories are free-for…  \
  Source • arXiv cs.LG • 18:25
Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces  \
  The emergence of Large Language Models (LLMs) has illuminated the potential for a general-purpose user simulator. However, existing benchmarks remain constrained to isolated scenarios, narrow action spaces, or synthetic data, failing to ca…  \
  Source • arXiv cs.LG • 18:20
Benchmarking Machine Learning Architectures for Antimicrobial Stewardship in Pediatric ICUs  \
  Antimicrobial stewardship (AMS) is critical in pediatric intensive care units (PICUs), where diagnostic uncertainty often drives broad-spectrum antibiotic use, increasing antimicrobial resistance and potential long-term harms. Machine lear…  \
  Source • arXiv cs.LG • 17:26
ImplicitTerrainV2: Wavelet-Guided Spatially Adaptive Neural Terrain Representation  \
  Digital elevation models (DEMs) underpin terrain analysis in Geographic Information Systems (GIS), but in their common raster form, they rely on interpolation for off-grid sampling and finite-difference operators for derivative-based analy…  \
  Source • arXiv cs.LG • 16:39
Stabilising Explainability Fragility in Cybersecurity AI: The Impact and Mitigation of Multicollinearity in Public Benchmark Datasets  \
  This paper investigates a unexplored yet impactful vulnerability in AI explainability used in intrusion detection (IDS): multicollinearity-induced instability. Despite extensive reliance on post-hoc explainability tools such as SHAP or LIM…  \
  Source • arXiv cs.LG • 16:20
The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning  \
  Robustness, domain adaptation, photometric and occlusion invariance, compositional generalisation, temporal robustness, alignment safety, and classical anisotropic regularisation are usually treated as separate problems with separate metho…  \
  Source • arXiv cs.LG • 19:53
Symphony for Speech-to-Text: Supporting Real-Time Medical Voice Interfaces  \
  After decades of use in dictation and, more recently, ambient documentation, speech is emerging as a primary modality for interacting with technology and AI in healthcare. Yet medical speech recognition remains difficult: systems must capt…  \
  Source • arXiv cs.LG • 19:52

Big Tech
No items today.
Regulation & Standards
No items today.
Enterprise Practice
No items today.
Open-Source Tooling
No items today.
—
Personal views, not IBM. No tracking. Curated automatically; links under 24h old.

                                Don't miss what's next. Subscribe to Richard G:

            Email address (required)