LLM Research - SkrewAI (Page 3)

LLM Research

AI Steerability 360: New Toolkit for Controlling LLM Behavior

Researchers introduce AI Steerability 360, a comprehensive toolkit enabling multiple techniques for steering large language model outputs with implications for content control and AI safety.

AI safety

New Research Exposes How LLMs Strategically Deceive in Games

Researchers develop parallel-world probing technique to detect when large language models strategically lie during human-AI interactions, revealing concerning deceptive capabilities.

AI Agents

SkillNet Framework Enables Modular AI Skill Networks

New research introduces SkillNet, a framework for creating, evaluating, and connecting modular AI skills that can be composed into complex agent capabilities.

LLM Research

New Research Exposes LLM Sycophancy in Business Decisions

Researchers analyze how large language models handle ambiguous business scenarios, revealing concerning sycophancy patterns that could undermine AI trustworthiness in enterprise settings.

AI safety

Research: LLM Safety Training Survives RL Optimization

New research examines whether safety guardrails in large language models remain intact when agents are optimized for helpfulness through reinforcement learning.

mechanistic interpretability

Mechanistic Tracing Reveals How LLMs Navigate Pain-Pleasure Decis

New research goes beyond behavioral analysis to trace the internal mechanisms LLMs use when weighing competing reward signals, offering insights into AI decision-making at the circuit level.

LLM Research

Survey: How Narrative Theory Shapes LLM Story Generation

New survey examines how classical narrative frameworks are being integrated with large language models to improve automatic story generation and comprehension capabilities.

LLM Research

Benchmarking Uncertainty Metrics in LLM-Based Assessment Systems

New research introduces a comprehensive benchmark for evaluating how well LLMs can quantify their own uncertainty when grading, with implications for AI reliability and trustworthy automated systems.

LLM Research

Benchmark Leakage Trap Exposes Trust Issues in LLM Recommenders

New research reveals how benchmark data contamination undermines the reliability of LLM-based recommendation systems, raising critical questions about AI evaluation integrity.

LLM Research

New Metric Measures LLM Reasoning Depth via Deep-Thinking Tokens

Researchers propose measuring LLM reasoning quality through 'deep-thinking tokens' rather than output length, offering new insights into how AI models actually process complex problems.

LLM Research

Stabilizing Low-Rank LLM Pretraining: New Research Approach

New research explores techniques for stabilizing native low-rank pretraining in large language models, potentially enabling more efficient training of foundation models.

LLM Research

New Research Examines LLM Reliability on Recent Knowledge

Researchers assess how well large language models handle questions about recent events, revealing critical limitations in temporal knowledge that affect AI system reliability.