Search Sciweavers | Sciweavers

35 search results - page 7 / 7

» Document centered approach to text normalization

177

click to vote

HPDC
2003
IEEE

132views Distributed And Parallel Com...» more HPDC 2003»

PlanetP: Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities

16 years 23 hour ago

Download www.gnunet.org

Abstract. We present PlanetP, a peer-to-peer (P2P) content search and retrieval infrastructure targeting communities wishing to share large sets of text documents. P2P computing is...

Francisco Matias Cuenca-Acuna, Christopher Peery, ...

claim paper

Read More »

185

Voted

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 1 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

182

click to vote

SIGIR
2009
ACM

134views Information Technology» more SIGIR 2009»

Addressing morphological variation in alphabetic languages

16 years 1 months ago

Download web.jhu.edu

The selection of indexing terms for representing documents is a key decision that limits how eﬀective subsequent retrieval can be. Often stemming algorithms are used to normaliz...

Paul McNamee, Charles K. Nicholas, James Mayfield

claim paper

Read More »

207

Voted

SIGIR
2008
ACM

162views Information Technology» more SIGIR 2008»

Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization

15 years 6 months ago

Download users.cis.fiu.edu

Multi-document summarization aims to create a compressed summary while retaining the main characteristics of the original set of documents. Many approaches use statistics and mach...

Dingding Wang, Tao Li, Shenghuo Zhu, Chris H. Q. D...

claim paper

Read More »

199

Voted

ISI
2007
Springer

228views Security Privacy» more ISI 2007»

Mining Higher-Order Association Rules from Distributed Named Entity Databases

16 years 27 days ago

Download www.dimacs.rutgers.edu

The burgeoning amount of textual data in distributed sources combined with the obstacles involved in creating and maintaining central repositories motivates the need for effective ...

Shenzhi Li, Christopher D. Janneck, Aditya P. Bela...

claim paper

Read More »

« Prev « First page 7 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers