LLM optimization
How Quantization and Batching Cut LLM Energy Costs
New research explores how quantization, batching strategies, and serving optimizations dramatically reduce LLM energy consumption while maintaining performance.
LLM optimization
New research explores how quantization, batching strategies, and serving optimizations dramatically reduce LLM energy consumption while maintaining performance.
LLM Agents
New research reveals systematic failures in how large language models approach multi-step planning, with implications for AI agents in content generation and autonomous systems.
LLM Reliability
New research achieves enterprise-grade 99.99966% reliability in LLM systems through consensus-driven decomposed execution, bringing Six Sigma quality standards to AI agents.
Conformal Prediction
New research introduces adaptive cluster-based density estimation for conformal prediction in generative models, enabling statistical guarantees on AI-generated content quality and reliability.
LLM Research
New research introduces DAJ, a data-reweighting approach for LLM judges that improves test-time scaling in code generation by better identifying correct solutions.
LLM evaluation
New research reveals smaller language models can outperform large LLMs at evaluation tasks through semantic capacity asymmetry, challenging the dominant LLM-as-a-Judge paradigm.
LLM evaluation
Researchers challenge claims that LLMs are narcissistic evaluators, examining whether AI models truly favor their own outputs when judging text quality.
AI Agents
New research presents a framework for building capable small language model agents using synthetic tasks, simulated environments, and structured rubric-based rewards—democratizing agentic AI development.
LLM Security
Researchers discover that simulating intoxicated speech patterns can bypass AI safety guardrails. The 'In Vino Veritas' attack reveals fundamental weaknesses in how LLMs handle linguistic degradation.
AI Agents
Learn how to implement short-term, long-term, and episodic memory systems in AI agents, enabling persistent context and improved reasoning capabilities across sessions.
LoRA
Learn how Low-Rank Adaptation lets you customize billion-parameter AI models on standard laptops—the same technique powering custom deepfakes and AI video generation.
xAI
Indonesia reinstates access to Elon Musk's Grok AI after xAI implements new safeguards against synthetic image abuse, marking a key regulatory moment for AI image generation.