Stay on top of the latest changes affecting your Trust & Safety operations.
This changelog is crowdsourced, feel free to suggest missing items.
November 2023 | ||
Australia is releasing new online safety standards to tackle terror and CSAM, including deepfakes created using generative AI. Read more or even more | Australia | RegulatoryChild SafetyExtremismMisinformationViolence |
TikTok is taking action to remove videos promoting terrorism from the platform. Read more | WorldTikTok | ProductExtremismViolence |
June 2023 | ||
The Global Project Against Hate and Extremism (GPAHE) is releasing a database compiling more than 300 hate and far-right extremist symbols. Read more or even more | World | ProductExtremismHate |
March 2023 | ||
Meta's oversight board is reviewing the moderation of the Arabic word "shaheed", meaning "martyr" in English, the word associated with most content removals. Read more | WorldMeta | ProductExtremism |
January 2023 | ||
The Supreme Court is taking up Section 230 for two cases to be heard in February, involving social media's relationship to terrorist activity. Read more | US | RegulatoryExtremism |
Meta's oversight board is announcing the removal of the Ukrainian far-right military group Azov Regiment from its list of dangerous individuals and organizations. Read more | WorldMeta | ProductExtremism |
Google is developing a free moderation tool for terrorist material identification and removal for smaller platforms. Read more | WorldGoogle | ProductExtremism |
December 2022 | ||
Meta is launching HMA, a new free tool helping platforms identify and remove violating content. Read more or even more | WorldMeta | ProductExtremism |
October 2022 | ||
Singapore is introducing a bill into Parliament to fight egregious and harmful online content. Read more | Singapore | RegulatoryAdultChild SafetyExtremismSelf-harm |
September 2022 | ||
YouTube is announcing updated content moderation policies to prohibit violent extremist content. Read more | WorldYouTube | ProductExtremism |
California is requiring platforms to report how they moderate hate speech, extremism, harassment and other objectionable behaviors. Read more | US | RegulatoryExtremismHarassmentHate |
See missing items? You can add them here.
This is a guide to detecting, moderating and handling insults, personal attacks and identity attacks in texts and images.
This is a complete guide to keyword filtering to text moderation through keyword filtering and keyword lists in 2024. Learn about best practices, trends and use-cases like profanity, toxicity filtering, obfuscation, languages and more.