Lakra on AI Content Moderation

Rudraksh Lakra (O. P. Jindal Global University, Jindal Global Law School (JGLS); Independent Research) has posted Deciphering AI-Powered Content Moderation: Approaches, Constraints, and Future Horizons (ORF, AI F4: Facts, Fiction, Fears and Fantasies Series) on SSRN. Here is the abstract:

Due to the sheer scale of content on social media platforms, content moderation practices have shifted from traditional community moderation methods to increasingly rely on AI-powered automated moderation tools. This rise of AI-based content moderation has been subject to both fantasization and exaggerated apprehensions. The allure of AI-based content moderation for governments is that they view it as a silver bullet that can swiftly tackle the complex challenges in this domain. However, at the same time, there have been exaggerated fears about the inefficacy of AI-powered content moderation and its potential use as a tool to stifle dissenting voices. In reality, the effectiveness of AI-based content moderation falls somewhere between these extremes.

This article aims to demystify the workings of AI-based moderation. For this purpose, the article offers a technical explanation of the different approaches to AI-based content moderation, including their current capabilities, effectiveness, and areas of limitation. It explores the diverse perspectives presented in the existing literature. The objective is to pinpoint gaps deserving of further research and delve into their implications for shaping policy responses. The final section provides a high-level guidance on the path forward.