Sciweavers

471 search results - page 19 / 95
» MapReduce: Simplified Data Processing on Large Clusters
Sort
View
TSD
2001
Springer
14 years 4 days ago
Finding Semantically Related Words in Large Corpora
The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and descr...
Pavel Smrz, Pavel Rychlý
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 8 months ago
Distributed data-parallel computing using a high-level programming language
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and MapReduce in...
Michael Isard, Yuan Yu
WWW
2005
ACM
14 years 8 months ago
Three-level caching for efficient query processing in large Web search engines
Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...
Xiaohui Long, Torsten Suel
GRID
2003
Springer
14 years 27 days ago
Applying Database Support for Large Scale Data Driven Science in Distributed Environments
There is a rapidly growing set of applications, referred to as data driven applications, in which analysis of large amounts of data drives the next steps taken by the scientist, e...
Sivaramakrishnan Narayanan, Ümit V. Ça...
ISMB
1998
13 years 9 months ago
Automated Clustering and Assembly of Large EST Collections
The avMlability of large EST(Expressed Sequence Tag)databases has led to a revolution in the waynew genes are cloned. Difficulties arise, however,due to high error rates and redun...
David P. Yee, Darrell Conklin