Brand Safety - Urban Dictionary for Business

Brand safety you can trust

Urban Dictionary reflects how people really talk online, including content that isn't always advertiser-friendly. Our safety system ensures your ads only appear on pages that meet your standards.

Request More Information

What we screen for

Every Urban Dictionary entry is scored across 15+ harm categories.

Core Toxicity

General toxicity

Severe toxicity

Obscene language

Insults

Identity & Hate

Identity-based attacks

Race, gender, religion, or sexuality-related content

Harmful Content

Threats

Sexually explicit language

Self-harm or suicide references

Violence or criminal content

Our classifier is based on Detoxify's unbiased-toxic-roberta model, which was trained on the Jigsaw Unintended Bias dataset.

This model was selected for its high accuracy and reduced false positives in content from marginalized groups. We calibrate our thresholds using guidance from Google Publisher Policies.

Our ABCD safety grades

Each page receives a composite safety grade based on the most concerning content on the page. We evaluate content the same way movie ratings work — giving you clear guidance on what's appropriate for your brand.

Grade A

Family Friendly

Words you'd see in mainstream media. Technical terms, brand names, and harmless slang.

Examples: "stan" (to be a fan), "GOAT" (Greatest Of All Time), "lowkey" (somewhat/secretly)

Grade B

Mainstream Safe

How people actually talk online. May include mild language or innuendo, but nothing explicit.

Examples: "throwing shade" (subtle disrespect), "salty" (bitter/upset), "thirsty" (desperate for attention)

Grade C

Mature Content

Edgier content with adult themes or strong language. Available only to brands that explicitly opt in.

Restricted access — not part of standard inventory

Grade D

Blocked Content

Explicit content that violates most advertising policies.

Not available through standard channels

Note: Pages without completed safety review default to Grade D to protect advertisers.

Significant content is available for standard advertising

Each page is evaluated independently. When we mark content as advertiser-safe, that specific page meets all standards.

How our safety system works

Urban Dictionary applies two distinct moderation layers to every piece of content — one for publication, and another for advertiser safety. This ensures ads never appear next to content that violates your risk tolerance.

Content moderation

Human review ensures entries meet Urban Dictionary's publication guidelines.

Advertiser-facing scoring

Published definitions are reanalyzed using unbiased-toxic-roberta, with precision-optimized thresholds that achieve 100% accuracy in preventing dangerous content from receiving safe grades.

Grade assignment

Each page is assigned a Grade A–D based on the most severe or compounding scores on the page.

Default protection

Pages without safety analysis default to Grade D until reviewed, protecting advertisers from unknown content.

Flexible targeting options

Choose the safety level that fits your brand's risk tolerance:

Strict Tier

Most restrictive safety level

✓ Only Grade A pages
✓ Ideal for family-safe or regulated brands
✓ Pristine content for maximum safety

Balanced Tier

Standard safety settings

✓ Grades A & B
✓ The sweet spot for reach and safety
✓ Suitable for most mainstream advertisers

Open Tier

Least restrictive

✓ Grades A–C (limited additional inventory)
✓ Broadest reach, suitable for mature brands
✓ Carefully managed for brand alignment

Custom controls

Set category-level thresholds (e.g. exclude only identity-based attacks or sexual content) to tailor safety enforcement.

Built for transparency

✓ Open methodology

Model details, thresholds, and scoring code are available for audit.

✓ Bias mitigation

We use fairness-optimized models and conduct regular audits to reduce disproportionate impact on minority content.

✓ Flexible targeting

Choose which safety grades align with your brand's risk tolerance. Target only Grade A for maximum safety, or include B and C for broader reach.

Have a question about our brand safety policies?

Contact Our Team Download Technical Specification (PDF)

Get detailed information about our moderation system, thresholds, and implementation.

Urban Dictionary for Business

The internet talks here. Your brand should too.