Sciweavers

41 search results - page 7 / 9
» Corpus Based Unsupervised Labeling of Documents
Sort
View
KDD
2010
ACM
252views Data Mining» more  KDD 2010»
13 years 10 months ago
Fast query execution for retrieval models based on path-constrained random walks
Many recommendation and retrieval tasks can be represented as proximity queries on a labeled directed graph, with typed nodes representing documents, terms, and metadata, and labe...
Ni Lao, William W. Cohen
SIGIR
2010
ACM
13 years 10 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
SIGIR
2005
ACM
14 years 4 days ago
Multi-label informed latent semantic indexing
Latent semantic indexing (LSI) is a well-known unsupervised approach for dimensionality reduction in information retrieval. However if the output information (i.e. category labels...
Kai Yu, Shipeng Yu, Volker Tresp
PVLDB
2008
141views more  PVLDB 2008»
13 years 6 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
LREC
2010
177views Education» more  LREC 2010»
13 years 8 months ago
Automatic Discovery of Semantic Relations using MindNet
Information extraction deals with extracting entities (such as people,organizations or locations) and named relations between entities (such as "People born-in Country")...
Zareen Syed, Evelyne Viegas, Savas Parastatidis