Search Sciweavers | Sciweavers

511 search results - page 77 / 103

» Discovering data dependencies in Web content mining

138

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

15 years 11 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

142

click to vote

DMKD
1997
ACM

198views Data Mining» more DMKD 1997»

Clustering Based On Association Rule Hypergraphs

15 years 8 months ago

Download glaros.dtc.umn.edu

Clustering in data mining is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized. These d...

Eui-Hong Han, George Karypis, Vipin Kumar, Bamshad...

claim paper

Read More »

171

click to vote

SIGIR
2009
ACM

175views Information Technology» more SIGIR 2009»

Web derived pronunciations for spoken term detection

15 years 11 months ago

Download symptotic.com

Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of application...

Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan...

claim paper

Read More »

151

click to vote

SSDBM
2008
IEEE

149views Database» more SSDBM 2008»

Query Planning for Searching Inter-dependent Deep-Web Databases

15 years 10 months ago

Download www.cse.ohio-state.edu

Increasingly, many data sources appear as online databases, hidden behind query forms, thus forming what is referred to as the deep web. It is desirable to have systems that can pr...

Fan Wang, Gagan Agrawal, Ruoming Jin

claim paper

Read More »

122

click to vote

SAC
2005
ACM

153views Applied Computing» more SAC 2005»

Automatic extraction of informative blocks from webpages

15 years 10 months ago

Download clgiles.ist.psu.edu

Search engines crawl and index webpages depending upon their informative content. However, webpages — especially dynamically generated ones — contain items that cannot be clas...

Sandip Debnath, Prasenjit Mitra, C. Lee Giles

claim paper

Read More »

« Prev « First page 77 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers