Search Sciweavers | Sciweavers

3450 search results - page 641 / 690

» Media Content Analysis

179

click to vote

KDD
2006
ACM

179views Data Mining» more KDD 2006»

Extracting key-substring-group features for text classification

16 years 6 months ago

Download www.comp.nus.edu.sg

In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...

Dell Zhang, Wee Sun Lee

claim paper

Read More »

178

click to vote

KDD
2005
ACM

125views Data Mining» more KDD 2005»

Email data cleaning

16 years 6 months ago

Download research.microsoft.com

Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...

Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang

claim paper

Read More »

157

click to vote

KDD
2004
ACM

131views Data Mining» more KDD 2004»

Fast nonlinear regression via eigenimages applied to galactic morphology

16 years 6 months ago

Download www.cs.cmu.edu

Astronomy increasingly faces the issue of massive datasets. For instance, the Sloan Digital Sky Survey (SDSS) has so far generated tens of millions of images of distant galaxies, ...

Brigham Anderson, Andrew W. Moore, Andrew Connolly...

claim paper

Read More »

152

click to vote

KDD
2004
ACM

163views Data Mining» more KDD 2004»

Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods

16 years 6 months ago

Download www.cs.cmu.edu

We consider the problem of improving named entity recognition (NER) systems by using external dictionaries--more specifically, the problem of extending state-of-the-art NER system...

William W. Cohen, Sunita Sarawagi

claim paper

Read More »

169

click to vote

KDD
2001
ACM

203views Data Mining» more KDD 2001»

Ensemble-index: a new approach to indexing large databases

16 years 6 months ago

Download www.ics.uci.edu

The problem of similarity search (query-by-content) has attracted much research interest. It is a difficult problem because of the inherently high dimensionality of the data. The ...

Eamonn J. Keogh, Selina Chu, Michael J. Pazzani

claim paper

Read More »

« Prev « First page 641 / 690 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers