Information Technology

202

CIKM
2011
Springer

183views Information Technology» more CIKM 2011»

Factorization-based lossless compression of inverted indices

14 years 7 months ago

Many large-scale Web applications that require ranked top-k retrieval are implemented using inverted indices. An inverted index represents a sparse term-document matrix, where non...

George Beskales, Marcus Fontoura, Maxim Gurevich, ...

claim paper

Read More »

263

click to vote

CIKM
2011
Springer

193views Information Technology» more CIKM 2011»

Learning to aggregate vertical results into web search results

14 years 7 months ago

Download ciir.cs.umass.edu

Aggregated search is the task of integrating results from potentially multiple specialized search services, or verticals, into the Web search results. The task requires predicting...

Jaime Arguello, Fernando Diaz, Jamie Callan

claim paper

Read More »

224

click to vote

CIKM
2011
Springer

220views Information Technology» more CIKM 2011»

Mining entity translations from comparable corpora: a holistic graph mapping approach

14 years 7 months ago

Download www.postech.ac.kr

This paper addresses the problem of mining named entity translations from comparable corpora, speciﬁcally, mining English and Chinese named entity translation. We ﬁrst observe...

Jinhan Kim, Long Jiang, Seung-won Hwang, Young-In ...

claim paper

Read More »

271

click to vote

CIKM
2011
Springer

242views Information Technology» more CIKM 2011»

LogSig: generating system events from raw textual logs

14 years 7 months ago

Download users.cs.fiu.edu

Modern computing systems generate large amounts of log data. System administrators or domain experts utilize the log data to understand and optimize system behaviors. Most system ...

Liang Tang, Tao Li, Chang-Shing Perng

claim paper

Read More »

193

Voted

CIKM
2011
Springer

238views Information Technology» more CIKM 2011»

Citation chain aggregation: an interaction model to support citation cycling

14 years 7 months ago

Download www.brunel.ac.uk

Timothy Cribbin

claim paper

Read More »

200

click to vote

CIKM
2011
Springer

215views Information Technology» more CIKM 2011»

14 years 7 months ago

Classifying trending topics: a typology of conversation triggers on Twitter

Download nlp.uned.es

Twitter summarizes the great deal of messages posted by users in the form of trending topics that reﬂect the top conversations being discussed at a given moment. These trending ...

Arkaitz Zubiaga, Damiano Spina, Víctor Fres...

claim paper

Read More »

203

click to vote

CIKM
2011
Springer

209views Information Technology» more CIKM 2011»

The impact of author ranking in a library catalogue

14 years 7 months ago

Download staff.science.uva.nl

The ﬁeld of information retrieval has witnessed over 50 years of research on retrieval methods for metadata descriptions and controlled indexing languages, the prototypical exam...

Jaap Kamps

claim paper

Read More »

205

click to vote

CIKM
2011
Springer

192views Information Technology» more CIKM 2011»

Joint inference for cross-document information extraction

14 years 7 months ago

Download nlp.cs.qc.cuny.edu

Previous information extraction (IE) systems are typically organized as a pipeline architecture of separated stages which make independent local decisions. When the data grows bey...

Qi Li, Sam Anzaroot, Wen-Pin Lin, Xiang Li, Heng J...

claim paper

Read More »

209

click to vote

CIKM
2011
Springer

218views Information Technology» more CIKM 2011»

Probabilistic near-duplicate detection using simhash

14 years 7 months ago

Download irl.cs.tamu.edu

This paper oﬀers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...

Sadhan Sood, Dmitri Loguinov

claim paper

Read More »

218

click to vote

CIKM
2011
Springer

218views Information Technology» more CIKM 2011»

Integrating and querying web databases and documents

14 years 7 months ago

Download www2.cs.uh.edu

There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...

Carlos Garcia-Alvarado, Carlos Ordonez

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers