FAQ / Text Moderation

How does Text Moderation work?

Our Text Moderation solution is entirely automated. There are no humans involved, meaning that no Human Moderators will be used to view and rate your content. This is important to achieve super fast turn-around times, high scalability and perfect privacy.

With Text Moderation, you simply submit any type of text (message, comment, description, review...) to our API. The API instantly responds with the moderation details. Any objectionable content found will be flagged and described to help you block, modify or review it.

Detection strength

Our Text Moderation is a lot stronger than word-based filters. It uses advanced language analysis to detect objectionable content, even when users specifically attempt to circumvent your filters.

As an example, for each word we will be looking up millions of variations that might be used to evade filtering, while smartly ignoring all situations that might generate false positives. Here is a partial list of the situations that we cover:

druuugggggggs
Repetitions

Characters being repeated to avoid basic word filtering

$#!t
Grawlix

Replacement of characters with typographical symbols

B__* 0 __ 0 -- B__s
Insertions

Adding spaces, punctuation and more within words

🅓rͬu̸🄶s̼
Obfuscation and Special characters

Unusual non-ASCII characters used to evade basic word filters

phok yu
Spelling mistakes & Phonetic Variations

Changing word spellings while retaining their original meaning or pronunciation

|)R|_|G5
Leet speak

Replacing some alphabetical characters with a combination of punctuation, digits and letters

123FuckBlablah
Smart embeddings

Catching profanity based embeddings, while smartly ignoring potential false positives such as bassguitar amass...

Was this page helpful?