Trust & Safety Changelog

Stay on top of the latest changes affecting your Trust & Safety operations.
This changelog is crowdsourced, feel free to suggest missing items.

Filters: Regulatory Product Geos: US EU UK China RoW

Adult content Child Safety Extremism Harassment Hate Misinformation Privacy Self-Harm Substances Violence

September 2023
Meta's Oversight Board is urging the platform to improve the distinction between hate speech and criticism of hate speech. Read more	WorldMeta	ProductHate
August 2023
ADL is urging Meta’s Oversight Board to take action to improve addressing Holocaust denial and distortion. Read more	WorldMeta	ProductHateMisinformation
Meta's Oversight Board is announcing Holocaust Denial as a new case for consideration, inviting people and organizations to submit public comments. Read more	WorldMeta	ProductHateMisinformation
Meta's Oversight Board is urging the company to have stricter rules bannning gender-based violence. Read more	WorldMeta	ProductHate
June 2023
The Global Project Against Hate and Extremism (GPAHE) is releasing a database compiling more than 300 hate and far-right extremist symbols. Read more or even more	World	ProductExtremismHate
May 2023
The UK is proposing an amendment to the Online Safety Bill, requiring platforms to prevent online abuse and violence against women. Read more	UK	RegulatoryHateViolence
April 2023
Twitter is removing a policy prohibiting the targeted deadnaming or misgendering of transgender people from its moderation guidelines. Read more	WorldTwitter	ProductHate
Germany is accusing Twitter of repeatedly failing to comply with the NetzDG, a social media hate speech takedowns law. Read more	GermanyTwitter	Regulatory Hate
March 2023
UNESCO is launching a National Coalition on Freedom of Expression and Content Moderation in Kenya. Read more	Kenya	RegulatoryHateMisinformation
Twitter is now prohibiting "wishes of harm" in its new violent speech policy, banning users expressing desire for harm. Read more	WorldTwitter	ProductHateViolence
October 2022
Spotify is acquiring Kinzen, a firm specialized in identifying harmul audio content. Read more	WorldSpotify	ProductHateMisinformation
September 2022
California is requiring platforms to report how they moderate hate speech, extremism, harassment and other objectionable behaviors. Read more	US	RegulatoryExtremismHarassmentHate
August 2022
OpenAI is introducing a content moderation endpoint assessing whether the content is sexual, hateful or promoting self-harm. Read more	WorldOpenAI	ProductAdultHateSelf-harm

See missing items? You can add them here.

Detecting and moderating AI-generated content with technology

Learn about the evolution of GenAI models and the best strategies to detect AI-generated content.

Sightengine Ranks #1 in AI-Media Detection Accuracy: Insights from an Independent Benchmark

Learn how Sightengine performed in an independent AI-media detection benchmark, outperforming competitors with advanced methodologies.

Products

September 2023

August 2023

June 2023

May 2023

April 2023

March 2023

October 2022

September 2022

August 2022

Read more

Detecting and moderating AI-generated content with technology

Sightengine Ranks #1 in AI-Media Detection Accuracy: Insights from an Independent Benchmark

Head back to the Knowledge Center