LLM safety
Selective Geometry Control: A New Approach to LLM Safety
New research proposes geometric methods to enhance LLM safety alignment robustness, offering potential improvements for AI systems that moderate synthetic media and deepfake content.