Data Licensing
The largest living archive of how people actually talk.
We’ve been recording the unwritten English language since 1999. Every entry is written and ranked by people who use the words. Nothing here was scraped or auto-generated.
What’s available
The corpus
Millions of definitions with examples, going back to 1999. Each entry comes with author, timestamp, vote counts, and where it ranked in the community.
The live feed
New definitions, edits, votes, and searches as they happen. The earliest signal that a word is entering circulation.
Cultural signals
Search and definition popularity for recent activity, at fine resolution. The deeper historical trail is in the corpus itself. Every entry is timestamped, so you can see when a term first surfaced.
Safety and quality metadata
Every entry is scored across 13 brand-safety categories by an ML pipeline that gets retrained continuously. The scores can ship alongside the content.
Why this corpus is hard to substitute

Most language on the open web is formal: news, encyclopedias, product reviews, support forums. The vocabulary people use when they’re not being watched is mostly missing from training sets.

Urban Dictionary is the exception. The community has documented this layer of language for 26 years, in plain text, with definitions written by the people who use the words.

A model trained on it will know what “rizz” or “glaze” or “menty b” actually mean. A model that wasn’t, won’t.

Useful for
Language model training and fine-tuning
Vernacular coverage for general-purpose models, and grounding data for models that need to handle slang. Bulk corpus access or ongoing-update subscriptions.
Retrieval and answer-engine context
Licensed access for retrieval systems that need to answer questions about slang and culture at scale.
Linguistic and cultural research
How vernacular changes by region and over time. Research-tier terms available for academic users.
Trend and brand intelligence
Search and definition signals as an early indicator of cultural moments before they reach mainstream measurement.
How licensing works

The four assets above can be licensed separately or together. Commercial and research tiers are priced differently.

Delivery can be a one-time corpus dump, scheduled updates, or a live feed connection. Exclusivity is available for the live feed and for early access to new entries. The historical corpus is licensed non-exclusively.

Tell us what you’re trying to do. We’ll come back with terms.

Urban Dictionary Ads
Urban Dictionary for Business
The internet talks here. Your brand should too.
© 2026 Urban Dictionary LLC

Get in touch

We'll get back to you within 24 hours.