AI Research - SkrewAI (Page 7)

AI Research

Multi-Agent AI Workflows Reshape Automated Scientific Discovery

New research proposes interactive multi-agent architectures for AI scientists, moving beyond single-model approaches to collaborative systems that could transform how AI tackles complex research problems.

LLM Alignment

GRADE: New Backpropagation Method Replaces Policy Gradients for L

Researchers introduce GRADE, a technique that replaces traditional policy gradient methods with direct backpropagation for aligning large language models, potentially offering more efficient training.

LLM Agents

New LLM Agent Framework Tackles ML Feature Engineering Reliabilit

Researchers propose a constrained-topology planning approach for LLM agents that improves reliability in automated feature engineering, addressing key challenges in ML pipeline automation.

AI Research

Research Challenges AI Certainty-Scope Trade-Off Assumptions

New research formally disproves the assumed universal trade-off between certainty and scope in AI systems, with implications for how we understand LLM reliability and knowledge boundaries.

Multimodal AI

Omni-R1: Unifying Multimodal AI Reasoning with New Framework

New research introduces Omni-R1, a unified generative paradigm combining vision-language models with reinforcement learning for enhanced multimodal reasoning capabilities.

LLM Agents

Task2Quiz: New Framework Tests How AI Agents Understand Environme

Researchers introduce Task2Quiz, a systematic paradigm for evaluating what LLM agents actually know about their operating environments, revealing critical gaps in agent world models.

AI Research

AI Memory Systems: How Close Are We to Human Hippocampus?

New research examines the gap between AI memory architectures and the human hippocampus, exploring how neuroscience insights could transform machine learning systems.

LLM safety

Q-Realign: New Method Restores LLM Safety During Quantization

Researchers introduce Q-realign, a technique that piggybacks safety realignment onto quantization, solving the problem of safety degradation in compressed LLMs for efficient deployment.

LLM Architecture

MoE-LoRA Framework Advances Multi-Task LLM Specialization

New research combines Mixture-of-Experts with Low-Rank Adaptation to create specialized AI models that maintain generalist capabilities while excelling at domain-specific tasks.

Deep Learning

NOVAK: A Unified Adaptive Optimizer for Deep Neural Networks

New research introduces NOVAK, a unified framework that bridges popular adaptive optimizers like Adam and AdaGrad, potentially improving training efficiency for deep learning models.

LLM safety

Global Subspace Projection: A New Approach to LLM Detoxification

Researchers propose a novel technique for removing toxic behaviors from large language models by projecting out malicious representations in the model's latent space.

LLM Alignment

ECLIPTICA: New Framework Enables Switchable LLM Alignment

Researchers introduce ECLIPTICA, a framework using Contrastive Instruction-Tuned Alignment (CITA) to enable dynamic switching between aligned and unaligned LLM behaviors for safety research.