We use an industry-standard content moderation API that powers safety systems across major platforms. Each definition receives a probability score (0.0–1.0) for every category.
Every page is classified as either SFW (Safe For Work) or NSFW (Not Safe For Work). No ambiguity, no surprises.
Note: Pages without completed safety review default to NSFW to protect advertisers.
The majority of our content is classified as SFW
Each page is evaluated independently. When we mark content as advertiser-safe, that specific page meets all standards.
Urban Dictionary applies two distinct moderation layers to every piece of content — one for publication, and another for advertiser safety.
Industry-standard scoring
- We use the same content moderation API trusted by major platforms worldwide for content safety.
Binary classification
- Simple SFW/NSFW classification. No complicated tiers or ambiguous grades. Safe content or blocked content.
Conservative thresholds
- We err on the side of caution. We'd rather block borderline content than risk your brand.
Have a question about our brand safety policies?
Get detailed information about our moderation system, thresholds, and implementation.