scientists remove Urban Dictionary database
IBM's Watson Memorized the Entire 'Urban Dictionary,' Then His Overlords Had to Delete It - Alexis C. Madrigal - The Atlantic
[ame="http://www.youtube.com/watch?v=WFR3lOm_xhE"]IBM's Watson supercomputer destroys all humans in Jeopardy - YouTube[/ame]
IBM's Watson Memorized the Entire 'Urban Dictionary,' Then His Overlords Had to Delete It - Alexis C. Madrigal - The Atlantic
And so, when IBM's famous artificial intelligence, Watson, he/she/it of Jeopardy-winning fame, was in development, its head researcher had a great idea. Humans created this repository of slang, The Urban Dictionary. For example, today on the site, we learn that 'healthy gas' is "the gas (fart) produced from a person who has eaten healthy foods like cabbage, beans, broccolli, grains, or other high fiber, high carbohydrate foods."
Brown realized that this formalization of informal language might be a great way for Watson to understand the way real people communicate. So, he and his team, fed the whole thing into their AI.
But one problem. Informal language has a tendency to be dirty, nasty language. Its insults and cuss words, new names for gross old things, old names for gross new things, etc. And so, we learn from Fortune's Michal Lev-Ram, they had to delete all that human messiness from Watson's memory.
Watson couldn't distinguish between polite language and profanity -- which the Urban Dictionary is full of. Watson picked up some bad habits from reading Wikipedia as well. In tests it even used the word "bullshyt" in an answer to a researcher's query.
Ultimately, Brown's 35-person team developed a filter to keep Watson from swearing and scraped the Urban Dictionary from its memory.
[ame="http://www.youtube.com/watch?v=WFR3lOm_xhE"]IBM's Watson supercomputer destroys all humans in Jeopardy - YouTube[/ame]