Search Sciweavers | Sciweavers

538 search results - page 15 / 108

» Mining Relevant Text from Unlabelled Documents

click to vote

SIGIR
2010
ACM

137views Information Technology» more SIGIR 2010»

Combining coregularization and consensus-based self-training for multilingual text categorization

13 years 11 months ago

Download webia.lip6.fr

We investigate the problem of learning document classiﬁers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...

Massih-Reza Amini, Cyril Goutte, Nicolas Usunier

claim paper

Read More »

click to vote

DMKD
2000
ACM

110views Data Mining» more DMKD 2000»

Combining Strategies for Extracting Relations from Text Collections

13 years 11 months ago

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...

Eugene Agichtein, Eleazar Eskin, Luis Gravano

claim paper

Read More »

click to vote

ACL
2006

141views Computational Linguistics» more ACL 2006»

A DOM Tree Alignment Model for Mining Parallel Data from the Web

13 years 8 months ago

Download research.microsoft.com

This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...

Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao

claim paper

Read More »

click to vote

KDD
1997
ACM

120views Data Mining» more KDD 1997»

Discovering Trends in Text Databases

13 years 11 months ago

Download rakesh.agrawal-family.com

We describe a system we developed for identifying trends in text documents collected over a period of time. Trends can be used, for example, to discover that a company is shifting...

Brian Lent, Rakesh Agrawal, Ramakrishnan Srikant

claim paper

Read More »

click to vote

ISI
2007
Springer

228views Security Privacy» more ISI 2007»

Mining Higher-Order Association Rules from Distributed Named Entity Databases

14 years 1 months ago

Download www.dimacs.rutgers.edu

The burgeoning amount of textual data in distributed sources combined with the obstacles involved in creating and maintaining central repositories motivates the need for effective ...

Shenzhi Li, Christopher D. Janneck, Aditya P. Bela...

claim paper

Read More »

« Prev « First page 15 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers