content moderation
ML Sampling + LLM Labeling: A New Framework for Content Moderatio
New research proposes combining ML-assisted sampling with LLM labeling to measure policy-violating content at scale, offering a methodological breakthrough for detecting synthetic media and deepfakes.