LLM Security
Special Token Attacks: The 96% LLM Jailbreak Exploit
Security researchers uncover how special tokens in LLM architectures create hidden attack surfaces, enabling jailbreak success rates as high as 96% across major models.
LLM Security
Security researchers uncover how special tokens in LLM architectures create hidden attack surfaces, enabling jailbreak success rates as high as 96% across major models.
LLM Research
Researchers introduce S-RLS, a novel method for continuous LLM knowledge updates that avoids catastrophic forgetting through soft memory preservation instead of rigid constraints.
neuro-symbolic AI
New research proposes tensor network mathematics to unify neural networks with symbolic AI, potentially enabling more interpretable and reasoning-capable AI systems.
synthetic data
Researchers analyze why Empirical Risk Minimization fails when models train on synthetic data, revealing fundamental barriers that affect AI video generation and deepfake systems.
Generative AI
New research explores how generative models can iteratively improve their own training datasets, potentially enhancing quality across AI video, image synthesis, and synthetic media generation.
LLM Agents
Researchers introduce a determinism-faithfulness assurance harness for tool-using LLM agents, enabling reliable replay testing to catch unpredictable AI behavior in critical applications.
LLM Agents
New research introduces Aeon, a memory management system combining neural and symbolic approaches to help LLM agents maintain coherent reasoning across extended task sequences.
LLM alignment
Researchers introduce GRADE, a technique that replaces traditional policy gradient methods with direct backpropagation for aligning large language models, potentially offering more efficient training.
LLM Research
Researchers propose a physics-inspired framework treating LLM token embeddings as discrete semantic states governed by Hamiltonian dynamics, offering new insights into transformer interpretability.
LLM Agents
Researchers propose a constrained-topology planning approach for LLM agents that improves reliability in automated feature engineering, addressing key challenges in ML pipeline automation.
LLM Research
New research reveals how LLMs develop 'directional attractors' during reasoning tasks, showing that similarity-based retrieval mechanisms systematically steer iterative summarization toward predictable patterns.
AI research
New research formally disproves the assumed universal trade-off between certainty and scope in AI systems, with implications for how we understand LLM reliability and knowledge boundaries.