Sciweavers

538 search results - page 15 / 108
» Mining Relevant Text from Unlabelled Documents
Sort
View
SIGIR
2010
ACM
13 years 11 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
DMKD
2000
ACM
110views Data Mining» more  DMKD 2000»
13 years 11 months ago
Combining Strategies for Extracting Relations from Text Collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Eleazar Eskin, Luis Gravano
ACL
2006
13 years 8 months ago
A DOM Tree Alignment Model for Mining Parallel Data from the Web
This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...
Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao
KDD
1997
ACM
120views Data Mining» more  KDD 1997»
13 years 11 months ago
Discovering Trends in Text Databases
We describe a system we developed for identifying trends in text documents collected over a period of time. Trends can be used, for example, to discover that a company is shifting...
Brian Lent, Rakesh Agrawal, Ramakrishnan Srikant
ISI
2007
Springer
14 years 1 months ago
Mining Higher-Order Association Rules from Distributed Named Entity Databases
The burgeoning amount of textual data in distributed sources combined with the obstacles involved in creating and maintaining central repositories motivates the need for effective ...
Shenzhi Li, Christopher D. Janneck, Aditya P. Bela...