Christopher M. Homan

LPI-RIT at LeWiDi-2025: Improving Distributional Predictions via Metadata and Loss Reweighting with DisCo

Mandira Sawkar

• Nov 1, 2025 • 1 min read

ARTICLE: Annotator Reliability Through In-Context Learning

Using LLMs to identify high-quality human annotators by checking if their labels are consistent with AI predictions—helping build better training data while preserving diverse …

Sujan Dutta

• Mar 1, 2025 • 1 min read

Prompt Engineering

ProRefine: Inference-Time Prompt Refinement with Textual Feedback

ProRefine automatically improves AI prompts during inference by having one AI agent give feedback to refine another agent's prompts—boosting accuracy by 3-37% and helping smaller …

Deepak Pandita

• Jan 1, 2025 • 1 min read

Vicarious Annotation

Rater Cohesion and Quality from a Vicarious Perspective

Asking people to predict how others with different political views would label content reveals hidden biases and improves data quality for content moderation AI.

Deepak Pandita

• Nov 1, 2024 • 1 min read

Offensive Speech Detection

Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive

We ran a massive experiment: 9 different AI content moderation systems analyzed 92 million YouTube comments about US politics. The results were shocking—different AI systems …

Tharindu Cyril Weerasooriya

• Dec 2, 2023 • 1 min read

Label Distribution Learning

Subjective Crowd Disagreements for Subjective Data: Uncovering Meaningful CrowdOpinion with Population-level Learning

CrowdOpinion uses unsupervised learning to group similar content and predict the full range of human opinions about it, rather than forcing everyone into a single 'correct' …

Tharindu Cyril Weerasooriya

• Jul 1, 2023 • 1 min read

Label Distribution Learning

Disagreement Matters: Preserving Label Diversity by Jointly Modeling Item and Annotator Label Distributions with DisCo

Tharindu Cyril Weerasooriya

• Jul 1, 2023 • 1 min read

Machine Translation

Findings from the Bambara - French Machine Translation Competition (BFMT 2023)

Ninoh Agostinho Da Silva

• May 1, 2023 • 1 min read

Computer Science - Computation and Language

Vicarious Offense and Noise Audit of Offensive Speech Classifiers

Testing 9 different AI content moderation systems on 92 million YouTube comments reveals wildly inconsistent results, while human annotators show strong political bias—proving that …

Tharindu Cyril Weerasooriya

• Feb 1, 2023 • 1 min read

Label Distribution Learning

Improving Label Quality by Jointly Modeling Items and Annotators

Tharindu Cyril Weerasooriya

• Jan 1, 2021 • 1 min read