Computer Science - Computation and Language

HUMANLM: Simulating Users with State Alignment Beats Response Imitation

Shirley Wu

• Jan 1, 2026 • 1 min read

Vicarious Offense and Noise Audit of Offensive Speech Classifiers

Testing 9 different AI content moderation systems on 92 million YouTube comments reveals wildly inconsistent results, while human annotators show strong political bias—proving that …

Tharindu Cyril Weerasooriya

• Feb 1, 2023 • 1 min read