Search Sciweavers | Sciweavers

466 search results - page 19 / 94

» Scalable Feature Extraction from Noisy Documents

193

Voted

PAKDD
2010
ACM

167views Data Mining» more PAKDD 2010»

Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand

15 years 11 months ago

Download www.cs.umass.edu

We present a general framework for the task of extracting speciﬁc information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...

Pallika Kanani, Andrew McCallum, Shaohan Hu

claim paper

Read More »

299

click to vote

ICDE
2004
IEEE

117views Database» more ICDE 2004»

Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web

16 years 8 months ago

Download www.cc.gatech.edu

In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...

James Caverlee, Ling Liu, David Buttler

claim paper

Read More »

228

click to vote

WWW
2006
ACM

147views Internet Technology» more WWW 2006»

POLYPHONET: an advanced social network extraction system from the web

16 years 8 months ago

Download www2006.org

Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction syst...

Hideaki Takeda, Junichiro Mori, Kôiti Hasida...

claim paper

Read More »

207

click to vote

IJDAR
2002

108views more IJDAR 2002»

Document understanding for a broad class of documents

15 years 7 months ago

Download www.cs.rug.nl

We present a document analysis system able to assign logical labels and extract the reading order in a broad set of documents. All information sources, from geometric features and ...

Marco Aiello, Christof Monz, Leon Todoran

claim paper

Read More »

234

Voted

IPM
2002

106views more IPM 2002»

A feature mining based approach for the classification of text documents into disjoint classes

15 years 7 months ago

Download www.csc.lsu.edu

This paper proposes a new approach for classifying text documents into two disjoint classes. The new approach is based on extracting patterns, in the form of two logical expressio...

Salvador Nieto Sánchez, Evangelos Triantaph...

claim paper

Read More »

« Prev « First page 19 / 94 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers