data sets | Sciweavers

156

ICMLA
2004

96views Machine Learning» more ICMLA 2004»

RAIN: data clustering using randomized interactions between data points

15 years 8 months ago

Abstract-- This paper introduces a generalization of the Gravitational Clustering Algorithm proposed by Gomez et all in [1]. First, it is extended in such a way that not only the G...

Jonatan Gómez, Olfa Nasraoui, Elizabeth Leo...

claim paper

Read More »

160

click to vote

ICAI
2004

157views Artificial Intelligence» more ICAI 2004»

Inductive System Health Monitoring

15 years 8 months ago

Download ti.arc.nasa.gov

- The Inductive Monitoring System (IMS) software was developed to provide a technique to automatically produce health monitoring knowledge bases for systems that are either difficu...

David L. Iverson

claim paper

Read More »

153

click to vote

ICAD
2004

110views Emerging Technology» more ICAD 2004»

A Toolkit for Interactive Sonification

15 years 8 months ago

Download www.icad.org

This paper describes work-in-progress on an Interactive Sonification Toolkit which has been developed in order to aid the analysis of general data sets. The toolkit allows the des...

Sandra Pauletto, Andy Hunt

claim paper

Read More »

195

click to vote

ESANN
2006

187views Neural Networks» more ESANN 2006»

Visualizing gene interaction graphs with local multidimensional scaling

15 years 8 months ago

Download www.dice.ucl.ac.be

Several bioinformatics data sets are naturally represented as graphs, for instance gene regulation, metabolic pathways, and proteinprotein interactions. The graphs are often large ...

Jarkko Venna, Samuel Kaski

claim paper

Read More »

176

click to vote

EMNLP
2006

93views Natural Language Processing» more EMNLP 2006»

Random Indexing using Statistical Weight Functions

15 years 8 months ago

Download www.cs.usyd.edu.au

Random Indexing is a vector space technique that provides an efficient and scalable approximation to distributional similarity problems. We present experiments showing Random Inde...

James Gorman, James R. Curran

claim paper

Read More »

187

click to vote

DMIN
2006

142views Data Mining» more DMIN 2006»

Parallel Hybrid Clustering using Genetic Programming and Multi-Objective Fitness with Density (PYRAMID)

15 years 8 months ago

Download ww1.ucmss.com

Clustering is the process of locating patterns in large data sets. It is an active research area that provides value to scientific as well as business applications. Practical clust...

Junping Sun, William Sverdlik, Samir Tout

claim paper

Read More »

196

Voted

EMNLP
2004

114views Natural Language Processing» more EMNLP 2004»

Automatic Paragraph Identification: A Study across Languages and Domains

15 years 8 months ago

Download ilk.uvt.nl

In this paper we investigate whether paragraphs can be identified automatically in different languages and domains. We propose a machine learning approach which exploits textual a...

Caroline Sporleder, Mirella Lapata

claim paper

Read More »

191

click to vote

SDM
2007
SIAM

118views Data Mining» more SDM 2007»

On Privacy-Preservation of Text and Sparse Binary Data with Sketches

15 years 8 months ago

Download siam.org

In recent years, privacy preserving data mining has become very important because of the proliferation of large amounts of data on the internet. Many data sets are inherently high...

Charu C. Aggarwal, Philip S. Yu

claim paper

Read More »

194

click to vote

SODA
2008
ACM

126views Algorithms» more SODA 2008»

On distributing symmetric streaming computations

15 years 8 months ago

Download webdocs.cs.ualberta.ca

A common approach for dealing with large data sets is to stream over the input in one pass, and perform computations using sublinear resources. For truly massive data sets, howeve...

Jon Feldman, S. Muthukrishnan, Anastasios Sidiropo...

claim paper

Read More »

172

click to vote

NAACL
2007

126views Computational Linguistics» more NAACL 2007»

Entity Extraction is a Boring Solved Problem - Or is it?

15 years 8 months ago

Download www.aclweb.org

This paper presents empirical results that contradict the prevailing opinion that entity extraction is a boring solved problem. In particular, we consider data sets that resemble ...

Marc Vilain, Jennifer Su, Suzi Lubar

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers