Explainable AI
New Framework Explains Why AI Generates What It Does
Researchers introduce prompt-counterfactual explanations, a new method for understanding generative AI behavior by identifying minimal prompt changes that alter outputs.
Explainable AI
Researchers introduce prompt-counterfactual explanations, a new method for understanding generative AI behavior by identifying minimal prompt changes that alter outputs.
LLM Infrastructure
New research proposes joint encoding of KV-cache blocks to improve memory efficiency in large language model inference, addressing a key bottleneck in scalable AI deployment.
digital twins
New survey explores how Digital Twin AI evolves from LLMs to world models, enabling AI systems to simulate and predict physical reality with unprecedented accuracy.
LLM Agents
Researchers challenge the assumption that LLM agents work reliably with perfect APIs, revealing how real-world complexity degrades AI performance.
LLM
New research introduces entropy-based adaptive speculation that detects reasoning phases in LLMs, dynamically adjusting decoding strategies to improve both speed and output quality.
LLM
New research introduces STED and Consistency Scoring, a systematic framework for measuring how reliably large language models produce structured outputs—critical for production AI systems.
LLM Inference
New research introduces Yggdrasil, a tree-based speculative decoding architecture that bridges dynamic speculation with static runtime for faster LLM inference.
LLM Agents
New research uses multi-agent LLM systems simulating venture capitalists to evaluate startups, achieving notable predictive accuracy through collective roleplay-based reasoning.
AI governance
Researchers propose comprehensive framework for governing agentic AI systems, mapping capabilities to risks and establishing safety protocols as autonomous agents become more prevalent.
Reinforcement Learning
Liquid AI's LFM2-2.6B-Exp uses pure reinforcement learning without supervised fine-tuning, achieving dynamic hybrid reasoning that outperforms larger models on key benchmarks.
diffusion models
New research reveals how diffusion models suffer 'generative collapse' when trained on synthetic data, with dominated samples disappearing while dominating ones proliferate across generations.
LLM Interpretability
New research maps LLM internal representations to brain-derived axes, enabling interpretable reading and targeted steering of model behavior without fine-tuning.