Parallel Materialization of the Finite RDFS Closure for Hundreds of Millions of Triples

16 years 2 months ago

Download www.cs.rpi.edu

In this paper, we consider the problem of materializing the complete ﬁnite RDFS closure in a scalable manner; this includes those parts of the RDFS closure that are often ignored such as literal generalization and container membership properties. We point out characteristics of RDFS that allow us to derive an embarrassingly parallel algorithm for producing said closure, and we evaluate our C/MPI implementation of the algorithm on a cluster with 128 cores using diﬀerent-size subsets of the LUBM 10,000-university data set. We show that the time to produce inferences scales linearly with the number of processes, evaluating this behavior on up to hundreds of millions of triples. We also show the number of inferences produced for diﬀerent subsets of LUBM10k. To the best of our knowledge, our work is the ﬁrst to provide RDFS inferencing on such large data sets in such low times. Finally, we discuss future work in terms of promising applications of this approach including OWL2RL rules...

Jesse Weaver, James A. Hendler

Real-time Traffic

Data Sets | Internet Technology | RDFS Closure | SEMWEB 2009 | ﬁnite Rdfs Closure |

claim paper

Post Info
More Details (n/a)

Added	27 May 2010
Updated	27 May 2010
Type	Conference
Year	2009
Where	SEMWEB
Authors	Jesse Weaver, James A. Hendler

Comments (0)

Sciweavers

Parallel Materialization of the Finite RDFS Closure for Hundreds of Millions of Triples

Data Sets | Internet Technology | RDFS Closure | SEMWEB 2009 | ﬁnite Rdfs Closure |

Explore & Download

Productivity Tools

Sciweavers