Document Object Modeling (DOM) is widely used approach for retrieving data from an XML document. If the size of the XML document is very large, however, using the DOM approach for retrieving data from the XML document may suffer from a lack of memory space for building the associated XML tree in the main memory. To alleviate this problem, we propose a method that allows the very large XML document to be split into small XML documents, retrieves data from the XML tree built from each of these small XML documents, and combines the results from all of the n XML trees to generate the final result. With this proposed approach, the memory space and processing time required to retrieve data from the very large XML document using DOM are reduced so that they can be managed by one single generalpurpose personal computer. Categories and Subject Descriptors I.7.1 [Document and Text Processing]: Document management; F.2.2 [Nonnumerical Algorithms and Problems]: Computations on discrete structures...
Seung Min Kim, Suk I. Yoo, Eunji Hong, Tae Gwon Ki