Sciweavers

LREC
2008

A Multi-Lingual Dictionary of Dirty Words

14 years 1 months ago
A Multi-Lingual Dictionary of Dirty Words
We present a multi-lingual dictionary of dirty words. We have collected about 3,200 dirty words in several languages and built a database of these. The language with the most words in the database is English, though there are several hundred dirty words in for instance Japanese too. Words are classified into their general meaning, such as what part of the human anatomy they refer to. Words can also be assigned a nuance label to indicate if it is a cute word used when speaking to children, a very rude word, a clinical word etc. The database is available online and will hopefully be enlarged over time. It has already been used in research on for instance automatic joke generation and emotion detection.
Jonas Sjöbergh, Kenji Araki
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where LREC
Authors Jonas Sjöbergh, Kenji Araki
Comments (0)