Sciweavers

187 search results - page 9 / 38
» Clairlib: A Toolkit for Natural Language Processing, Informa...
Sort
View
SAC
2006
ACM
14 years 1 months ago
Light stemming approaches for the French, Portuguese, German and Hungarian languages
This paper describes and evaluates various general stemming approaches for the French, Portuguese (Brazilian), German and Hungarian languages. Based on the CLEF test-collections, ...
Jacques Savoy
CICLING
2009
Springer
14 years 8 months ago
Business Specific Online Information Extraction from German Websites
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...
Yeong Su Lee, Michaela Geierhos
WSDM
2010
ACM
197views Data Mining» more  WSDM 2010»
14 years 5 months ago
Adapting Information Bottleneck Method for Automatic Construction of Domain-oriented Sentiment Lexicon
Domain-oriented sentiment lexicons are widely used for finegrained sentiment analysis on reviews; therefore, the automatic construction of domain-oriented sentiment lexicon is a f...
Songbo Tan, Weifu Du, Xiaochun Yun, Xueqi Cheng
CIKM
2006
Springer
13 years 11 months ago
Multi-task text segmentation and alignment based on weighted mutual information
Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
Bingjun Sun, Ding Zhou, Hongyuan Zha, John Yen
CIKM
2010
Springer
13 years 6 months ago
Third workshop on exploiting semantic annotations in information retrieval (ESAIR): CIKM 2010 workshop
There is an increasing amount of structure on the Web as a result of modern Web languages, user tagging and annotation, and emerging robust NLP tools. These meaningful, semantic, ...
Jaap Kamps, Jussi Karlgren, Ralf Schenkel