Content Moderation / Guide

Trust & Safety changelog

This changelog is in Beta. As we iterate to make it as useful as possible, let us know if you have feedback.

Stay on top of the latest changes affecting your Trust & Safety operations.
This changelog is crowdsourced, feel free to suggest missing items.

Filters: RegulatoryProduct     Geos: USEUUKChinaRoW

Adult contentChild SafetyExtremismHarassmentHateMisinformationPrivacySelf-HarmSubstancesViolence

March 2023

UNESCO is launching a National Coalition on Freedom of Expression and Content Moderation in Kenya. Read more Kenya RegulatoryHateMisinformation
Twitter is now prohibiting "wishes of harm" in its new violent speech policy, banning users expressing desire for harm. Read more WorldTwitter ProductHateViolence

October 2022

Spotify is acquiring Kinzen, a firm specialized in identifying harmul audio content. Read more WorldSpotify ProductHateMisinformation

September 2022

California is requiring platforms to report how they moderate hate speech, extremism, harassment and other objectionable behaviors. Read more US RegulatoryExtremismHarassmentHate

August 2022

OpenAI is introducing a content moderation endpoint assessing whether the content is sexual, hateful or promoting self-harm. Read more WorldOpenAI ProductAdultHateSelf-harm

See missing items? You can add them here.
Want to keep an eye on what's going on? Consider subscribing to Ben Whitelaw's newsletter.

Read more

Cookies help us deliver our services. By using our services, you agree to our use of cookies. Learn more

OK