Sciweavers

471 search results - page 19 / 95
» MapReduce: Simplified Data Processing on Large Clusters
Sort
View
102
Voted
TSD
2001
Springer
15 years 4 months ago
Finding Semantically Related Words in Large Corpora
The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and descr...
Pavel Smrz, Pavel Rychlý
189
Voted
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
16 years 19 days ago
Distributed data-parallel computing using a high-level programming language
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and MapReduce in...
Michael Isard, Yuan Yu
104
Voted
WWW
2005
ACM
16 years 1 months ago
Three-level caching for efficient query processing in large Web search engines
Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...
Xiaohui Long, Torsten Suel
GRID
2003
Springer
15 years 5 months ago
Applying Database Support for Large Scale Data Driven Science in Distributed Environments
There is a rapidly growing set of applications, referred to as data driven applications, in which analysis of large amounts of data drives the next steps taken by the scientist, e...
Sivaramakrishnan Narayanan, Ümit V. Ça...
100
Voted
ISMB
1998
15 years 1 months ago
Automated Clustering and Assembly of Large EST Collections
The avMlability of large EST(Expressed Sequence Tag)databases has led to a revolution in the waynew genes are cloned. Difficulties arise, however,due to high error rates and redun...
David P. Yee, Darrell Conklin