LLM research
New Research Teaches LLMs to Extract Context Automatically
Researchers propose a novel approach to train LLMs to automatically identify and extract relevant context, improving inference efficiency and accuracy in long-context scenarios.
LLM research
Researchers propose a novel approach to train LLMs to automatically identify and extract relevant context, improving inference efficiency and accuracy in long-context scenarios.
deep learning
New research demonstrates that deep neural networks exhibit phase transitions during training, revealing hierarchical feature organization that could reshape how we understand and design AI architectures.
Diffusion Models
New research introduces Generative Stochastic Optimal Transport (GenSOT), combining harmonic path-integral methods with optimal transport theory to improve guided diffusion model generation.
deepfake
A couple lost $45,000 to scammers using AI-generated deepfake videos of Elon Musk promoting fraudulent cryptocurrency investments, highlighting the growing sophistication of synthetic media fraud.
Google releases an updated version of Gemini Deep Research, its AI-powered research assistant that autonomously explores topics and synthesizes information across sources.
AI Infrastructure
AI infrastructure startup Runware secures $50M to build a universal API connecting developers to multiple generative AI models, streamlining access to image, video, and audio synthesis capabilities.
Nvidia
Nvidia purchases SchedMD, maker of Slurm open-source workload manager used by most AI supercomputers. The acquisition strengthens Nvidia's grip on AI training infrastructure.
AI safety
Researchers present a framework for making multi-turn LLM agents more trustworthy through behavioral guidance, addressing critical safety concerns as AI systems become more autonomous.
LLM
Researchers introduce Adaptive Soft Rolling KV Freeze with entropy-guided recovery, achieving sublinear memory scaling for long-context LLM inference without significant quality loss.
deepfake detection
New research reveals 27% of IT leaders lack confidence in their organization's ability to detect deepfake attacks, highlighting critical gaps in enterprise synthetic media defenses.
quantum computing
Quantum computing meets generative AI with QGANs and hybrid architectures promising exponential speedups for media synthesis, molecular modeling, and beyond.
AI Agents
Kaggle's intensive AI agent program reveals practical insights on building production-ready systems, covering orchestration patterns, tool integration, and deployment strategies for real-world applications.