Network attached disk storage is characterized by independent network attachment and embedded intelligence. For Internet applications, it provides the key functionality of geographical replication and intelligent retrieval of data objects. The paper describes a latency reducing method based on the relative interconnectivity between data objects. We follow the locality-of-reference principle to partition interrelated data objects on close disk areas or network storage nodes. The method incorporates a clustering algorithm to support smarter placement of related objects and read-ahead group caching. Objects that are associated together are clustered in the same group and can be read from disk and cached together. The proposed clustering and cache algorithms do not use floating point, allowing direct and fast implementation on a variety of disk controllers. Categories and Subject Descriptors H.3 [Information Storage and Retrieval]: H.3.3. [Information Search and Retrieval] General Terms A...
Iliyak Georgiev, Ivo I. Georgiev