LLM Alignment
ECLIPTICA: New Framework Enables Switchable LLM Alignment
Researchers introduce ECLIPTICA, a framework using Contrastive Instruction-Tuned Alignment (CITA) to enable dynamic switching between aligned and unaligned LLM behaviors for safety research.