Information Management

126

CIKM
2009
Springer

143views Information Technology» more CIKM 2009»

Robust record linkage blocking using suffix arrays

15 years 8 months ago

Record linkage is an important data integration task that has many practical uses for matching, merging and duplicate removal in large and diverse databases. However, a quadratic ...

Timothy de Vries, Hui Ke, Sanjay Chawla, Peter Chr...

claim paper

Read More »

156

click to vote

CIKM
2009
Springer

139views Information Technology» more CIKM 2009»

Efficient feature weighting methods for ranking

15 years 8 months ago

Download dm.postech.ac.kr

Feature weighting or selection is a crucial process to identify an important subset of features from a data set. Removing irrelevant or redundant features can improve the generali...

Hwanjo Yu, Jinoh Oh, Wook-Shin Han

claim paper

Read More »

148

click to vote

CIKM
2009
Springer

189views Information Technology» more CIKM 2009»

Classification-based resource selection

15 years 8 months ago

Download www.cs.cmu.edu

In some retrieval situations, a system must search across multiple collections. This task, referred to as federated search, occurs for example when searching a distributed index o...

Jaime Arguello, Jamie Callan, Fernando Diaz

claim paper

Read More »

123

click to vote

CIKM
2009
Springer

165views Information Technology» more CIKM 2009»

Generating SQL/XML query and update statements

15 years 8 months ago

Download www.matthiasnicola.de

The XML support in relational databases and the SQL/XML language are still relatively new as compared to purely relational databases and traditional SQL. Today, most database user...

Matthias Nicola, Tim Kiefer

claim paper

Read More »

154

click to vote

CIKM
2009
Springer

156views Information Technology» more CIKM 2009»

Scalable indexing of RDF graphs for efficient join processing

15 years 8 months ago

Download www.win.tue.nl

Current approaches to RDF graph indexing suffer from weak data locality, i.e., information regarding a piece of data appears in multiple locations, spanning multiple data structur...

George H. L. Fletcher, Peter W. Beck

claim paper

Read More »

135

click to vote

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Suffix trees for very large genomic sequences

15 years 8 months ago

Download webhome.cs.uvic.ca

A suffix tree is a fundamental data structure for string searching algorithms. Unfortunately, when it comes to the use of suffix trees in real-life applications, the current metho...

Marina Barsky, Ulrike Stege, Alex Thomo, Chris Upt...

claim paper

Read More »

146

click to vote

CIKM
2009
Springer

141views Information Technology» more CIKM 2009»

Efficient processing of group-oriented connection queries in a large graph

15 years 8 months ago

Download www.se.cuhk.edu.hk

We study query processing in large graphs that are fundamental data model underpinning various social networks and Web structures. Given a set of query nodes, we aim to find the g...

James Cheng, Yiping Ke, Wilfred Ng

claim paper

Read More »

123

click to vote

CIKM
2009
Springer

119views Information Technology» more CIKM 2009»

Ensembles in adversarial classification for spam

15 years 8 months ago

Download ebiquity.umbc.edu

The standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performanc...

Deepak Chinavle, Pranam Kolari, Tim Oates, Tim Fin...

claim paper

Read More »

134

click to vote

CIKM
2009
Springer

154views Information Technology» more CIKM 2009»

Empirical justification of the gain and discount function for nDCG

15 years 8 months ago

Download www.ccs.neu.edu

The nDCG measure has proven to be a popular measure of retrieval effectiveness utilizing graded relevance judgments. However, a number of different instantiations of nDCG exist, d...

Evangelos Kanoulas, Javed A. Aslam

claim paper

Read More »

134

click to vote

CIKM
2009
Springer

160views Information Technology» more CIKM 2009»

Improving binary classification on text problems using differential word features

15 years 8 months ago

Download ebiquity.umbc.edu

We describe an efficient technique to weigh word-based features in binary classification tasks and show that it significantly improves classification accuracy on a range of proble...

Justin Martineau, Tim Finin, Anupam Joshi, Shamit ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers