Sciweavers

EUROSYS
2013
ACM

TimeStream: reliable stream computation in the cloud

10 years 8 months ago
TimeStream: reliable stream computation in the cloud
TimeStream is a distributed system designed specifically for low-latency continuous processing of big streaming data on a large cluster of commodity machines. The unique characteristics of this emerging application domain have led to a significantly different design from the popular MapReducestyle batch data processing. In particular, we advocate a new abstraction called resilient substitution that caters to the specific needs in this new computation model to handle failure recovery and dynamic reconfiguration in response to load changes. Several real-world applications running on our prototype have been shown to scale robustly with low latency while at the same time maintaining the simple and concise declarative programming model. TimeStream handles an on-line advertising aggregation pipeline at a rate of 700,000 URLs per second with a 2-second delay, while performing sentiment analysis of Twitter data at a peak rate close to 10,000 tweets per second, with approximately 2second d...
Zhengping Qian, Yong He, Chunzhi Su, Zhuojie Wu, H
Added 28 Apr 2014
Updated 28 Apr 2014
Type Journal
Year 2013
Where EUROSYS
Authors Zhengping Qian, Yong He, Chunzhi Su, Zhuojie Wu, Hongyu Zhu, Taizhi Zhang, Lidong Zhou, Yuan Yu, Zheng Zhang
Comments (0)