Buffering and Read-Ahead Strategies for External Mergesort

14 years 4 months ago

Download www.vldb.org

The elapsed time for external mergesort is normally dominated by I/O time. This paper is focused on reducing I/O time during the merge phase. Three new buffering and readahead strategies are proposed, called equal buffering, extended forecasting and clustering. They exploit the fact that virtually all modern disks perform caching and sequential readahead. The latter two also collect information during run formation (the last key of each run block) which is then used to preplan reading. For random input data, extended forecasting and clustering were found to reduce merge time by 30% compared with traditional double buffering. Clustering exploits any temporal skew in input runs to further reduce the number of seeks. Authors' current address: Microsoft, One Microsoft Way, Redmond, WA 98052-6399, U.S.A.

Weiye Zhang, Per-Åke Larson

Real-time Traffic

Database | Equal Buffering | I/O Time | Traditional Double Buffering | VLDB 1998 |

claim paper

Post Info
More Details (n/a)

Added	06 Aug 2010
Updated	06 Aug 2010
Type	Conference
Year	1998
Where	VLDB
Authors	Weiye Zhang, Per-Åke Larson

Comments (0)

Sciweavers

Buffering and Read-Ahead Strategies for External Mergesort

Database | Equal Buffering | I/O Time | Traditional Double Buffering | VLDB 1998 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers