Stay on top of the latest changes affecting your Trust & Safety operations.
This changelog is crowdsourced, feel free to suggest missing items.
August 2024 | ||
TikTok is launching its Sub-Saharan Africa Safety Advisory Council to protect children from threats. Read more | WorldTikTok | ProductChild Safety |
July 2024 | ||
Meta's Oversight Board is asking the platform to refine its policies around AI-generated explicit images. Read more | WorldMeta | ProductAI |
June 2024 | ||
Snapchat is rolling out new safety features aimed at protecting teens from unwanted contacts. Read more | WorldSnapchat | ProductChild Safety |
The United Nations are launching global principles to combat online hate and lies. Read more | World | Regulatory |
YouTube is releasing a new feature allowing users to remove AI content that mimics their face or voice. Read more | WorldYouTube | ProductAI |
WeChat is requiring all creators on the platform to disclose whether a published post was generated using AI. Read more | WorldWeChat | ProductAI |
May 2024 | ||
Twitch is disbanding its Safety Advisory Council and is replacing it with Twitch Ambassadors. Read more | WorldTwitch | Product |
OpenAI is creating a safety and security committee led by senior executives. Read more | WorldOpenAI | Product |
TikTok is introducing "Content Credentials", a digital watermark to label AI-created images and videos. Read more or even more | WorldTikTok | ProductAI |
April 2024 | ||
TikTok is updating its guidelines to restrict weight loss content. Read more | WorldTikTok | ProductSubstances |
Tinder is adding a new safety feature to the app, so online daters can let friends and families know where they are. Read more | WorldTinder | Product |
Meta is testing features that blur messages containing nudity to safeguard teens on Instagram. Read more | WorldMeta | ProductChild Safety |
Meta is announcing an update to its AI labeling policy, expanding its definition of “manipulated media” to go beyond AI-generated videos. Read more or even more | WorldMeta | ProductAI |
March 2024 | ||
TikTok is unveiling a global Youth Council,aiming at improving younger user safety. Read more | WorldTikTok | ProductChild Safety |
January 2024 | ||
Substack is implementing a new “report” button in its app, allowing readers to flag posts and publications directly. Read more or even more | WorldSubstack | Product |
Meta is restricting teens from viewing content that deals with topics like suicide, self-harm, and eating disorders. Read more | WorldMeta | ProductChild SafetySelf-harm |
December 2023 | ||
Meta is rolling out end-to-end encryption for calls and messages across Facebook and Messenger. Read more | WorldMeta | Product |
Meta is planning to disable its cross-messaging feature between Facebook and Instagram to comply with the Digital Market Act. Read more or even more | WorldMeta | Product |
November 2023 | ||
Meta is updating its political advertising policies to cover AI-generated imaged and videos. Read more | WorldMeta | ProductMisinformation |
TikTok is taking action to remove videos promoting terrorism from the platform. Read more | WorldTikTok | ProductExtremismViolence |
Meta's Oversight Board is reviewing the app's handling of a video showing an unveiled woman in Iran. Read more | WorldMeta | Product |
Google is launching Notes, an experimental feature that allows users to share human insights on search results. Read more | WorldGoogle | Product |
YouTube is announcing a series of policy changes aiming to inform viewers when content has been generated by AI. Read more | WorldYouTube | RegulatoryMisinformation |
YouTube is launching new features to prevent harmful content exposure for teens. Read more | WorldYouTube | ProductChild Safety |
October 2023 | ||
X is announcing that posts corrected by Community Notes will now become ineligible for revenue share. Read more | World | ProductMisinformation |
Meta is declining its Oversight Board's advice from August 2023 to tighten oversight of drug-related posts. Read more | WorldMeta | ProductSubstances |
Discord is unveiling a system called “Teen Safety Assist” that will warn minors when they get a suspicious message. Read more or even more | WorldDiscord | ProductChild Safety |
Meta and TikTok are announcing they took action to counter misinformation following the attack by Hamas on Israel. Read more or even more | WorldTikTok | ProductMisinformation |
Snapchat is facing accusations over its failure to assess privacy risks of its generative AI chatbot "My AI". Read more | WorldSnapchat | ProductPrivacy |
Meta's Oversight Board is planning to open a case involving political deepfakes. Read more or even more | WorldMeta | ProductMisinformation |
September 2023 | ||
X is disabling a tool allowing users to report electoral fake news. Read more | World | ProductMisinformation |
The Mental Health Coalition is launching the Safe Online Standards for Kids' Mental Health (S.O.S) initiative to create a rating system across platforms. Read more | World | ProductChild Safety |
Meta's Oversight Board is urging the platform to improve the distinction between hate speech and criticism of hate speech. Read more | WorldMeta | ProductHate |
Snapchat is announcing new safety features such as In-app Warnings to protect teens from potential online risks. Read more | WorldSnapchat | ProductChild Safety |
August 2023 | ||
ADL is urging Meta’s Oversight Board to take action to improve addressing Holocaust denial and distortion. Read more | WorldMeta | ProductHateMisinformation |
Meta's Oversight Board is announcing Holocaust Denial as a new case for consideration, inviting people and organizations to submit public comments. Read more | WorldMeta | ProductHateMisinformation |
Meta is revising its Adult Sexual Exploitation policy, permitting sharing of content involving non-consensual sexual touching if posted with the aim of raising awareness. Read more | WorldMeta | ProductAdult |
Snap is announcing that its users will soon be able to opt out of content personalization. Read more | WorldSnapchat | Product |
Instagram's users users will now be able to access features without seeing content that's been ranked by Meta's recommendation algorithms. Read more | WorldMeta | Product |
X is announcing it will remove a protective feature letting users block other accounts, except for direct messages. Read more | World | Product |
Google is launching its new Transparency Center on which users can learn more about policies, reporting inappropriate content or appealing a ban. Read more | WorldGoogle | Product |
YouTube is updating its policies to tackle medical misinformation and remove harmful cancer claims. Read more | WorldYouTube | ProductMisinformation |
Meta's Oversight Board is urging the company to have stricter rules bannning gender-based violence. Read more | WorldMeta | ProductHate |
July 2023 | ||
Seven big tech companies are agreeing to commit to new standards in security, trust and safety. Read more | World | Product |
Xbox is adding a reactive voice chat moderation, allowing players to capture and submit 60-second audio clips of inappropriate voice chat messages. Read more | WorldMicrosoft | Product |
Discord is expanding its policies to address generative artificial intelligence that can create fake content and the sexualization of children. Read more | WorldDiscord | ProductChild Safety |
June 2023 | ||
TikTok is launching a new feature enabling parents to filter out videos they don't want their children to see. Read more | WorldTikTok | ProductChild Safety |
TikTok is developing a youth council to build safety tools that are more effective for teenagers. Read more | WorldTikTok | ProductChild Safety |
Meta is rolling back its measures to curb the spread of Covid misinformation. Read more | WorldMeta | ProductMisinformation |
Meta is implementing new updates requiring advertisers to designate the beneficiary and the payer for their ads. Read more | WorldMeta | Product |
The Global Project Against Hate and Extremism (GPAHE) is releasing a database compiling more than 300 hate and far-right extremist symbols. Read more or even more | World | ProductExtremismHate |
Apple is annoucing that a new feature will soon warn users when receiving unsolicited nudes. Read more | WorldApple | ProductAdult |
YouTube is announcing it will no longer remove videos with false claims of fraud in the 2020 presidential election. Read more | WorldYouTube | ProductMisinformation |
May 2023 | ||
Twitter is launching Community Notes for images in posts to address misleading images and emphasize crowdsourced moderation. Read more | WorldTwitter | ProductMisinformation |
Mozilla is launching a new Mastodon server to test content moderation that is different to other social media platforms, with no focus on free speech or ‘neutrality’. Read more | WorldMozilla | Product |
April 2023 | ||
TikTok is changing its misinformation policy to ban all climate change denial content on its platform. Read more | WorldTikTok | ProductMisinformation |
Meta's Oversight Board is asking the social media company to keep its covid misinformation policy and to be more transparent when removing content. Read more | WorldMeta | ProductMisinformation |
YouTube is updating its guidelines for eating disorder-related content, banning content including extreme calorie counting or purging after eating. Read more | WorldYouTube | ProductSelf-harm |
China is pledging to intensify its battle against "illegal" political content by boosting its system of tip-offs. Read more | China | Regulatory |
Twitter is removing a policy prohibiting the targeted deadnaming or misgendering of transgender people from its moderation guidelines. Read more | WorldTwitter | ProductHate |
Twitter is planning to add visible labels on tweets that have been identified as potentially violating its policies. Read more | WorldTwitter | Product |
China and the US are both requesting for public comment on accountability measures for advanced AI systems such as ChatGPT. Read more | ChinaUS | Regulatory |
Snapchat is launching new tools to improve its AI chatbot, including an age filter ensuring it responds according to the user's age. Read more | WorldSnapchat | ProductChild Safety |
Facebook is planning to roll out new policies making users able to have greater control over "demoted" and fact-checked content. Read more | WorldMeta | Product |
March 2023 | ||
Twitter is opening some source code to public inspection, including the algorithm used to recommend tweets to users. Read more | WorldTwitter | Product |
Meta is rolling out a new system to separate ads from harmful or controversial content. Read more | WorldMeta | Product |
Teleperformance is resuming its full-service content moderation services, including moderation of highly egregious content. Read more | WorldTeleperformance | Product |
Yubo and the AFNOR Group are forming a working group to create a new safety standard on the prevention of risks and protection of minors on social networks. Read more or even more | WorldAFNORYubo | ProductChild Safety |
TikTok is unveiling its updated community guidelines that will focus on improving content moderation on the platform and take effect on April 21st. Read more | WorldTikTok | ProductChild SafetyMisinformation |
Snapchat is publishing guidelines in its Family Center, detailing how content gets algorithmically recommended to minors. Read more | WorldSnapchat | ProductChild Safety |
Meta is introducing a new way for users to authenticate their account with a missed call. Read more | WorldMeta | ProductPrivacy |
Meta's oversight board is reviewing the moderation of the Arabic word "shaheed", meaning "martyr" in English, the word associated with most content removals. Read more | WorldMeta | ProductExtremism |
Whatsapp is agreeing to be more transparent on changes to its terms of service according to the European Commission. Read more or even more | WorldMeta | Product |
Facebook is revamping its "cross-check" moderation system after facing criticism for applying different review processes for VIP vs regular users. Read more | WorldMeta | Product |
Twitter is now prohibiting "wishes of harm" in its new violent speech policy, banning users expressing desire for harm. Read more | WorldTwitter | ProductHateViolence |
TikTok is setting a new default 60-minute daily screen time limit for minors. Read more | WorldTikTok | ProductChild Safety |
February 2023 | ||
Meta is reforming its penalty system "Facebook Jail", saying users will now be receiving a warning first for most violations. Read more | WorldMeta | Product |
Meta's oversight board is announcing it will now review more types of content moderation cases and publish some decisions on an expedited basis. Read more | WorldMeta | Product |
Meta is rolling out a new version of its ad-matching tool, providing more information about how users activities are feeding ML models. Read more | WorldMeta | ProductPrivacy |
Meta is launching new comment moderation tools allowing creators on Facebook to view moderation statistics and manage conversations. Read more | WorldMeta | Product |
Google is introducing a blur feature helping users avoiding explicit images while using the search engine. Read more | WorldGoogle | ProductAdultViolence |
TikTok is opening transparency and accountability centers to visitors. Read more | WorldTikTok | Product |
TikTok is rolling out a revamped account enforcement system, including a new strike system and features dealing with recommendations. Read more | WorldTikTok | Product |
January 2023 | ||
Twitter is planning to limit permanent suspensions of accounts breaking its rules, adding that any user will be able to appeal an account suspension. Read more | WorldTwitter | Product |
Meta's oversight board is announcing the removal of the Ukrainian far-right military group Azov Regiment from its list of dangerous individuals and organizations. Read more | WorldMeta | ProductExtremism |
Meta's oversight board is sharing their decision to redefine Facebook and Instagram's community rules regarding nudity in a less discriminatory way. Read more or even more | WorldMeta | ProductAdult |
China is implementing new rules on generative AI and deepfakes, requiring specific labeling and banning their use for fake news generation. Read more | China | RegulatoryMisinformation |
The United Nation’s 2022 Internet Governance Forum (IGF) is discussing internet fragmentation among democracies vs. driven by authoritarian states. Read more | World | Regulatory |
TikTok is announcing creators are now able to restrict their videos to adult viewers. Read more | WorldTikTok | ProductChild Safety |
Google is developing a free moderation tool for terrorist material identification and removal for smaller platforms. Read more | WorldGoogle | ProductExtremism |
December 2022 | ||
Meta is launching HMA, a new free tool helping platforms identify and remove violating content. Read more or even more | WorldMeta | ProductExtremism |
Apple is cancelling its plan to scan photos stored in iCloud to detect CSAM. Read more | WorldApple | ProductChild Safety |
Meta's oversight board is suggesting that Meta's cross check programme is more commercially driven than commited to human rights. Read more | WorldMeta | Product |
TikTok and Bumble are joining Meta to stop revenge porn by blocking images from StopNCII.org's bank of hashes. Read more or even more | WorldTikTokBumbleMeta | ProductAdult |
November 2022 | ||
Naver Z is introducing its Safety Advisory Council providing expertise on their policy and features. Read more or even more | WorldNaver | Product |
Twitter is announcing its Covid misinformation policy is no longer enforced. Read more or even more | WorldTwitter | ProductMisinformation |
Teleperformance is announcing it will no longer accept any new highly egregious content moderation work. Read more | WorldTeleperformance | Product |
October 2022 | ||
TikTok is introducing new and updated features and policies for its LIVE community. Read more | WorldTikTok | ProductChild Safety |
Twitter is reviewing its policies around permanently banning users. Read more | WorldTwitter | Product |
Spotify is acquiring Kinzen, a firm specialized in identifying harmul audio content. Read more | WorldSpotify | ProductHateMisinformation |
September 2022 | ||
Tumblr is creating community labels allowing users to avoid seeing unwanted content. Read more | WorldTumblr | ProductAdultSubstancesViolence |
Twitter is opening the Twitter Moderation Research Consortium (TMRC) to researchers. Read more | WorldTwitter | Product |
Instagram is developing a feature protecting users from receiving unsolicited nude photos. Read more | WorldMeta | ProductAdult |
Facebook is experimenting with asking 250 users to help moderate climate speech. Read more | WorldMeta | ProductMisinformation |
YouTube is announcing updated content moderation policies to prohibit violent extremist content. Read more | WorldYouTube | ProductExtremism |
Twitter is expanding its fact-checking feature Birdwatch, allowing users to add additional context to tweets. Read more | WorldTwitter | ProductMisinformation |
August 2022 | ||
OpenAI is introducing a content moderation endpoint assessing whether the content is sexual, hateful or promoting self-harm. Read more | WorldOpenAI | ProductAdultHateSelf-harm |
TikTok is said to be training its moderators to detect CSAM using graphic images and videos as a reference guide. Read more | WorldTikTok | ProductChild Safety |
See missing items? You can add them here.
This is a guide to detecting, moderating and handling sexual abuse and unsolicited sex in texts and images.
Results and insights from our AI or not game: how well humans identify AI images, when they get fooled and what we can learn from this.