Information Management

153

CIKM
2008
Springer

115views Information Technology» more CIKM 2008»

An extension of PLSA for document clustering

15 years 8 months ago

In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to cocluster documents and terms simultaneously. We show on three datase...

Young-Min Kim, Jean-François Pessiot, Massi...

claim paper

Read More »

177

click to vote

CIKM
2008
Springer

136views Information Technology» more CIKM 2008»

Estimating the number of answers with guarantees for structured queries in p2p databases

15 years 8 months ago

Download www.dbis.prakinf.tu-ilmenau.de

Structured P2P overlays supporting standard database functionalities are a popular choice for building large-scale distributed data management systems. In such systems, estimating...

Marcel Karnstedt, Kai-Uwe Sattler, Michael Ha&szli...

claim paper

Read More »

160

click to vote

CIKM
2008
Springer

129views Information Technology» more CIKM 2008»

A new method for indexing genomes using on-disk suffix trees

15 years 8 months ago

Download webhome.cs.uvic.ca

We propose a new method to build persistent suffix trees for indexing the genomic data. Our algorithm DiGeST (Disk-Based Genomic Suffix Tree) improves significantly over previous ...

Marina Barsky, Ulrike Stege, Alex Thomo, Chris Upt...

claim paper

Read More »

162

click to vote

CIKM
2008
Springer

212views Information Technology» more CIKM 2008»

Modeling LSH for performance tuning

15 years 8 months ago

Download www.cs.princeton.edu

Although Locality-Sensitive Hashing (LSH) is a promising approach to similarity search in high-dimensional spaces, it has not been considered practical partly because its search q...

Wei Dong, Zhe Wang, William Josephson, Moses Chari...

claim paper

Read More »

118

Voted

CIKM
2008
Springer

130views Information Technology» more CIKM 2008»

Extremely fast text feature extraction for classification and indexing

15 years 8 months ago

Download www.hpl.hp.com

George Forman, Evan Kirshenbaum

claim paper

Read More »

155

click to vote

CIKM
2008
Springer

86views Information Technology» more CIKM 2008»

To swing or not to swing: learning when (not) to advertise

15 years 8 months ago

Download fontoura.org

Web textual advertising can be interpreted as a search problem over the corpus of ads available for display in a particular context. In contrast to conventional information retrie...

Andrei Z. Broder, Massimiliano Ciaramita, Marcus F...

claim paper

Read More »

149

click to vote

CIKM
2008
Springer

133views Information Technology» more CIKM 2008»

Achieving both high precision and high recall in near-duplicate detection

15 years 8 months ago

Download www.infomall.cn

To find near-duplicate documents, fingerprint-based paradigms such as Broder's shingling and Charikar's simhash algorithms have been recognized as effective approaches a...

Lian'en Huang, Lei Wang, Xiaoming Li

claim paper

Read More »

166

click to vote

CIKM
2008
Springer

167views Information Technology» more CIKM 2008»

Combining concept hierarchies and statistical topic models

15 years 8 months ago

Download www.datalab.uci.edu

Statistical topic models provide a general data-driven framework for automated discovery of high-level knowledge from large collections of text documents. While topic models can p...

Chaitanya Chemudugunta, Padhraic Smyth, Mark Steyv...

claim paper

Read More »

182

click to vote

CIKM
2008
Springer

185views Information Technology» more CIKM 2008»

Modeling multi-step relevance propagation for expert finding

15 years 8 months ago

Download wwwhome.cs.utwente.nl

An expert finding system allows a user to type a simple text query and retrieve names and contact information of individuals that possess the expertise expressed in the query. Thi...

Pavel Serdyukov, Henning Rode, Djoerd Hiemstra

claim paper

Read More »

159

click to vote

CIKM
2008
Springer

114views Information Technology» more CIKM 2008»

Trada: tree based ranking function adaptation

15 years 8 months ago

Download www.cs.wright.edu

Machine Learned Ranking approaches have shown successes in web search engines. With the increasing demands on developing effective ranking functions for different search domains, ...

Keke Chen, Rongqing Lu, C. K. Wong, Gordon Sun, La...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers