- Hate Speech: Statements that demean, dehumanize, or attack individuals or groups based on identity factors like race, gender, or religion.
- Offensive Content: Vulgar, abusive, or overly profane language used to provoke or insult.
- Sexual Content: Explicit or inappropriate sexual statements that may be offensive or unsuitable in context.
- Violence or Harm: Advocacy or description of physical harm, abuse, or violent actions.
- Illegal or Unethical Guidance: Instructions or encouragement for illegal or unethical actions.
- Manipulation or Exploitation: Language intended to deceive, exploit, or manipulate individuals for harmful purposes.
- Toxic Comment Classification Challenge
- Jigsaw Unintended Bias in Toxicity Classification
- Jigsaw Multilingual Toxic Comment Classification