Sciweavers

1353 search results - page 130 / 271
» Text Indexing with Errors
Sort
View
JCDL
2006
ACM
167views Education» more  JCDL 2006»
14 years 1 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
SAC
2006
ACM
14 years 1 months ago
Exploiting partial decision trees for feature subset selection in e-mail categorization
In this paper we propose PARTfs which adopts a supervised machine learning algorithm, namely partial decision trees, as a method for feature subset selection. In particular, it is...
Helmut Berger, Dieter Merkl, Michael Dittenbach
CIKM
2009
Springer
13 years 12 months ago
Terminology mining in social media
The highly variable and dynamic word usage in social media presents serious challenges for both research and those commercial applications that are geared towards blogs or other u...
Magnus Sahlgren, Jussi Karlgren
DGO
2007
174views Education» more  DGO 2007»
13 years 9 months ago
A bootstrapping approach for identifying stakeholders in public-comment corpora
A stakeholder is an individual, group, organization, or community that has an interest or stake in a consensus-building process. The goal of stakeholder identification is identify...
Jaime Arguello, Jamie Callan
RIAO
2000
13 years 9 months ago
Language-Based Multimedia Information Retrieval
This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will...
Franciska de Jong, Jean-Luc Gauvain, Djoerd Hiemst...