Search Sciweavers | Sciweavers

232 search results - page 29 / 47

» Query-related data extraction of hidden web documents

134

click to vote

TOIS
2008

145views more TOIS 2008»

Classification-aware hidden-web text database selection

15 years 3 months ago

Download archive.nyu.edu

Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over multip...

Panagiotis G. Ipeirotis, Luis Gravano

claim paper

Read More »

151

Voted

PVLDB
2008

141views more PVLDB 2008»

WebTables: exploring the power of tables on the web

15 years 3 months ago

Download turing.cs.washington.edu

The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...

Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...

claim paper

Read More »

143

click to vote

PAKDD
2009
ACM

116views Data Mining» more PAKDD 2009»

Scalable Web Mining with Newistic

15 years 10 months ago

Download www.horatiumocian.com

Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...

Ovidiu Dan, Horatiu Mocian

claim paper

Read More »

119

Voted

KDD
2004
ACM

160views Data Mining» more KDD 2004»

Boosting for Text Classification with Semantic Features

16 years 4 months ago

Download www.aifb.uni-karlsruhe.de

Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...

Stephan Bloehdorn, Andreas Hotho

claim paper

Read More »

154

click to vote

KDD
2005
ACM

194views Data Mining» more KDD 2005»

Web object indexing using domain knowledge

16 years 4 months ago

Download research.microsoft.com

Web object is defined to represent any meaningful object embedded in web pages (e.g. images, music) or pointed to by hyperlinks (e.g. downloadable files). Users usually search for...

Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiya...

claim paper

Read More »

« Prev « First page 29 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers