SkrewAI (Page 45)

LLM safety

Global Subspace Projection: A New Approach to LLM Detoxification

Researchers propose a novel technique for removing toxic behaviors from large language models by projecting out malicious representations in the model's latent space.

LLM Alignment

ECLIPTICA: New Framework Enables Switchable LLM Alignment

Researchers introduce ECLIPTICA, a framework using Contrastive Instruction-Tuned Alignment (CITA) to enable dynamic switching between aligned and unaligned LLM behaviors for safety research.

Transformer Architecture

Transformer Architecture Explained: The Engine Behind Modern AI

A deep dive into the transformer architecture that powers everything from ChatGPT to AI video generators. Understanding attention mechanisms and why this design revolutionized machine learning.

AI Agents

Deep Agents: Solving Multi-Step AI Agent Failure Modes

AI agents often fail after several steps due to error compounding and context degradation. Deep Agents architecture introduces new mechanisms to maintain coherence across extended task execution.

deepfake detection

HONOR Brings On-Device Deepfake Detection to Mobile World Congres

HONOR will showcase AI-powered deepfake detection technology at MWC 2025, marking a significant push to bring synthetic media authentication directly to consumer smartphones.

deepfake regulation

UK Set to Enforce Law Targeting Deepfake Tool Providers

The UK government is preparing to enforce legislation targeting companies that provide tools for creating AI deepfakes, marking a significant regulatory shift in synthetic media governance.

deepfakes

Deepfake Summit Launches as First Prism Project Industry Event

The Deepfake Summit debuts as the inaugural Prism Project event, bringing together fraud prevention and identity verification leaders to address AI-driven synthetic media threats.

deepfake detection

WEF Report Exposes Critical Deepfake Vulnerabilities in KYC Syste

New World Economic Forum-backed report details how synthetic media threatens Know Your Customer verification systems, highlighting urgent need for enhanced deepfake detection in financial identity processes.

LLM Security

New Research Exposes How LLMs Fall for Fake Evidence

Researchers reveal how large language models can be manipulated with fabricated evidence, raising critical questions about AI reliability and the spread of misinformation through synthetic content.

LLM Quantization

FLRQ: Faster LLM Quantization via Low-Rank Matrix Sketching

New quantization method FLRQ achieves up to 2.5x faster compression of large language models while maintaining accuracy through flexible low-rank matrix approximation techniques.

deepfakes

UK Government Demands X Platform Action on Grok Deepfakes

The UK government pressures Elon Musk's X platform to address AI-generated deepfakes created by Grok chatbot, marking escalating regulatory scrutiny of synthetic media on social platforms.

deepfake detection

UNITE Detection System Advances Multi-Modal Deepfake Analysis

New UNITE framework combines facial, audio, and temporal analysis for comprehensive deepfake detection, moving beyond single-modality approaches that struggle with advanced synthetic media.

Latest

Global Subspace Projection: A New Approach to LLM Detoxification

ECLIPTICA: New Framework Enables Switchable LLM Alignment

Transformer Architecture Explained: The Engine Behind Modern AI

Deep Agents: Solving Multi-Step AI Agent Failure Modes

HONOR Brings On-Device Deepfake Detection to Mobile World Congres

UK Set to Enforce Law Targeting Deepfake Tool Providers

Deepfake Summit Launches as First Prism Project Industry Event

WEF Report Exposes Critical Deepfake Vulnerabilities in KYC Syste

New Research Exposes How LLMs Fall for Fake Evidence

FLRQ: Faster LLM Quantization via Low-Rank Matrix Sketching

UK Government Demands X Platform Action on Grok Deepfakes

UNITE Detection System Advances Multi-Modal Deepfake Analysis