New research has found that AI-powered content moderation systems from Google, OpenAI, Anthropic, and DeepSeek don’t always come to the same conclusions about bad language on the internet. New research has found that AI-powered content moderation systems from Google, OpenAI, Anthropic, and DeepSeek don’t always come to the same conclusions about bad language on the internet.
Should AI moderate online hate speech?
Get alerts for these topics