Technology

IBM Researchers Purge Urban Dictionary Data From Watson’s Memory Because It Learned To Swear

Stay ahead of the curve... Get top posts first!

Thank you for subscribing!

Get updates on Facebook

Watson, the name for IBM’s supercomputer best known for crushing ‘”Jeopardy!” contestants at their own game, briefly went from “smart” to “smart ass” with the help of the Urban Dictionary.

According to Eric Brown, an IBM research assistant and the “brains” behind Watson, he and his 35-person team wanted to get IBM’s supercomputer to sound more like a real human. In Brown’s mind, what better way to learn the intricacies of informal human communication and conversation than having Watson memorize the Urban Dictionary?

The Urban Dictionary defines itself as “a place formerly used to find out about slang, and now a place that teens with no life use as a burn book to whine about celebrities, their friends, etc., let out their sexual frustrations, show off their opinions, troll, and babble about things.”

However, the Urban Dictionary has a few useful definitions, including Internet abbreviations like OMG, and slang that humans use every day, such as calling someone a “hot mess.” Brown believed Watson could be more human if it could learn these kinds of language complexities, so in 2011, shortly after Watson’s reign as “Jeopardy!” champ, Brown taught Watson the Urban Dictionary.

What could’ve been another landmark for Watson — being able to participate and enjoy in a full conversation using natural, informal human language — turned out to be a step in the wrong direction.

Watson may have learned the Urban Dictionary, but it never learned the all-important axiom, “There’s a time and a place for everything.” Watson simply couldn’t distinguish polite discourse from profanity.

Watson unfortunately learned all of the Urban Dictionary’s bad habits, including throwing in overly-crass language at random points in its responses; in answering one question, Watson even reportedly used the word “bullshit” within an answer to one researcher’s question. Brown told Forbes that Watson picked up similarly bad habits from reading Wikipedia.

Brown and his team removed the Urban Dictionary from Watson’s vocabulary, and additionally developed a smart filter to keep Watson from swearing in the future. Yet another piece of evidence that robots will not replace genuine human interactions in the foreseeable future.

[More.](http://www.ibtimes.com/ibms-watson-gets-swear-filter-after-learning-urban-dictionary-1007734)

Dirty Robot Artwork: TM Tim Lahan

Want our best on Facebook?

Facebook comments

“IBM Researchers Purge Urban Dictionary Data From Watson’s Memory Because It Learned To Swear”