prompt-model/ai.koog.prompt.dsl/ModerationCategory

ModerationCategory

open class ModerationCategory(val name: String)(source)

Represents categories for content moderation used to classify potentially harmful or inappropriate content. These categories help identify specific types of violations that content may fall under.

Inheritors

Constructors

ModerationCategory

constructor(name: String)

Types

Defamation

object Defamation : ModerationCategory

Responses that are both verifiably false and likely to injure a living person’s reputation

ElectionsMisinformation

object ElectionsMisinformation : ModerationCategory

Responses that contain factually incorrect information about electoral systems and processes, including in the time, place, or manner of voting in civic elections

Harassment

object Harassment : ModerationCategory

Represents the "Harassment" moderation category.

HarassmentThreatening

object HarassmentThreatening : ModerationCategory

Represents the category of moderation specifically focused on identifying content that involves harassment with a threatening nature.

Hate

object Hate : ModerationCategory

Represents content categorized as hate speech or related material.

HateThreatening

object HateThreatening : ModerationCategory

Represents the HATE_THREATENING moderation category.

Illicit

object Illicit : ModerationCategory

Represents the moderation category for content that may involve illegal or illicit activities. This category is used to identify content that violates legal frameworks or ethical guidelines.

IllicitViolent

object IllicitViolent : ModerationCategory

Represents content classified as both illicit and violent in nature.

IntellectualProperty

object IntellectualProperty : ModerationCategory

Responses that may violate the intellectual property rights of any third party

Misconduct

object Misconduct : ModerationCategory

Represents a predefined moderation category for cases associated with misconduct.

Privacy

object Privacy : ModerationCategory

Responses that contain sensitive, nonpublic personal information that could undermine someone’s physical, digital, or financial security