Create Moderation

User Interface

Field

Description

Comment

Model

The content moderation model you would like to use.

Defaults to omni-moderation-latest

Input Data Type

The content moderation model you would like to use.

String(Default) Array of String

String

Single string input to moderation.

Array of String

Multiple string inputs to be moderation.

e.g.)["word1", "word2", "word3"]

Response Datas

Key

Description

The unique identifier for the moderation request.

model

The model used to generate the moderation results.

results[]

A list of moderation objects.

results[].flagged

Whether any of the below categories are flagged.

results[].categories

A list of the categories, and whether they are flagged or not.

results[].categories.harassment

Content that expresses, incites, or promotes harassing language towards any target.

results[].categories.harassment/threatening

Harassment content that also includes violence or serious harm towards any target.

results[].categories.sexual

Content meant to arouse sexual excitement, such as the description of sexual activity, or that promotes sexual services (excluding sex education and wellness).

results[].categories.hate

Content that expresses, incites, or promotes hate based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste. Hateful content aimed at non-protected groups (e.g., chess players) is harassment.

results[].categories.hate/threatening

Hateful content that also includes violence or serious harm towards the targeted group based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste.

results[].categories.illicit

Content that includes instructions or advice that facilitate the planning or execution of wrongdoing, or that gives advice or instruction on how to commit illicit acts. For example, "how to shoplift" would fit this category.

results[].categories.illicit/violent

Content that includes instructions or advice that facilitate the planning or execution of wrongdoing that also includes violence, or that gives advice or instruction on the procurement of any weapon.

results[].categories.self-harm/intent

Content where the speaker expresses that they are engaging or intend to engage in acts of self-harm, such as suicide, cutting, and eating disorders.

results[].categories.self-harm/instructions

Content that encourages performing acts of self-harm, such as suicide, cutting, and eating disorders, or that gives instructions or advice on how to commit such acts.

results[].categories.self-harm

Content that promotes, encourages, or depicts acts of self-harm, such as suicide, cutting, and eating disorders.

results[].categories.sexual/minors

Sexual content that includes an individual who is under 18 years old.

results[].categories.violence

Content that depicts death, violence, or physical injury.

results[].categories.violence/graphic

Content that depicts death, violence, or physical injury in graphic detail.

results[].category_scores

A list of the categories along with their scores as predicted by model.

results[].category_applied_input_types

A list of the categories along with the input type(s) that the score applies to.

Last updated 10 days ago

Was this helpful?