Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

167

EMNLP
2007

134views Natural Language Processing» more EMNLP 2007»

Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents

15 years 8 months ago

Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents

Download www.aclweb.org

Recognizing polarity requires a list of polar words and phrases. For the purpose of building such lexicon automatically, a lot of studies have investigated (semi-) unsupervised method of learning polarity of words and phrases. In this paper, we explore to use structural clues that can extract polar sentences from Japanese HTML documents, and build lexicon from the extracted polar sentences. The key idea is to develop the structural clues so that it achieves extremely high precision at the cost of recall. In order to compensate for the low recall, we used massive collection of HTML documents. Thus, we could prepare enough polar sentence corpus.

Nobuhiro Kaji, Masaru Kitsuregawa

Real-time Traffic

EMNLP 2007 | Natural Language Processing | Polar Sentence | Polar Words | Structural Clues |

claim paper

Related Content

» Learning the lexicon from raw texts for openvocabulary Korean word recognition

» A tool set for the quick and efficient exploration of large document collections

» UTDallas at TREC 2008 Blog Track

» Parsing NBest Lists of Handwritten Sentences

» Elimination of junk document surrogate candidates through pattern recognition

» A machine learning based approach for table detection on the web

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	EMNLP
Authors	Nobuhiro Kaji, Masaru Kitsuregawa

Comments (0)