Sciweavers

766 search results - page 131 / 154
» Clustering high dimensional data using subspace and projecte...
Sort
View
OSDI
2008
ACM
14 years 9 months ago
Improving MapReduce Performance in Heterogeneous Environments
MapReduce is emerging as an important programming model for large-scale data-parallel applications such as web indexing, data mining, and scientific simulation. Hadoop is an open-...
Matei Zaharia, Andy Konwinski, Anthony D. Joseph, ...
NAACL
2003
13 years 10 months ago
Monolingual and Bilingual Concept Visualization from Corpora
e by placing terms in an abstract ‘information space’ based on their occurrences in text corpora, and then allowing a user to visualize local regions of this information space....
Dominic Widdows, Scott Cederberg
ESCIENCE
2006
IEEE
14 years 2 months ago
Grid Approach to Embarrassingly Parallel CPU-Intensive Bioinformatics Problems
Bioinformatics algorithms such as sequence alignment methods based on profile-HMM (Hidden Markov Model) are popular but CPU-intensive. If large amounts of data are processed, a s...
Heinz Stockinger, Marco Pagni, Lorenzo Cerutti, La...
ICMCS
2005
IEEE
111views Multimedia» more  ICMCS 2005»
14 years 2 months ago
Manifold learning, a promised land or work in progress?
ABSTRACT In this paper, we report our experiments using a realworld image dataset to examine the effectiveness of Isomap, LLE and KPCA. The 1,897-image dataset we used consists of ...
Mei-Chen Yeh, I-Hsiang Lee, Gang Wu, Yi Wu, Edward...
SIGIR
2008
ACM
13 years 8 months ago
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
Multi-document summarization aims to create a compressed summary while retaining the main characteristics of the original set of documents. Many approaches use statistics and mach...
Dingding Wang, Tao Li, Shenghuo Zhu, Chris H. Q. D...