The Compression of Indexed Data and Fast Search for Large RDF Graphs

1 January 2018

journal article
Published by Japanese Society for Artificial Intelligence in Transactions of the Japanese Society for Artificial Intelligence

Vol. 33 (2), E-H43_1-H43_1
https://doi.org/10.1527/tjsai.e-h43

Abstract

In the area of the Semantic Web, RDF datastores are required to search for metadata quickly from large scale RDF data, such as Wikidata and DBpedia in the Linked Open Data (LOD). This paper presents compressed index structures and URI dictionaries of RDF data in order to develop a fast in-memory RDF database system (called FROST). Instead of the complete six types of indexes SPO, SOP, PSO, POS, OSP, and OPS in RDF triples, FROST employs the two types of indexes SPO and OPS that enable us to compactly store RDF triples in the memory. Using the compressed index structures, we develop a fast search method in the datastore system FROST that solves SPARQL queries and returns the query answers from RDF graphs. Our experiments show that (i) FROST outperforms the inmemory RDF frameworks Jena and RDF4J with respect to both fast query processing and saved memory, using the datasets and queries of the LUBM (a benchmarking framework for semantic repositories) and BMDB (RDF Store Benchmarks with DBpedia) benchmarks, and (ii) FROST outperforms the on-disk RDF store Virtuoso with respect to fast query processing, using the LUBM benchmark.

Keywords

This publication has 3 references indexed in Scilit:

Exchange and Consumption of Huge RDF Data
Lecture Notes in Computer Science, 2012
LUBM: A benchmark for OWL knowledge base systems
Journal of Web Semantics, 2005
Uniform Resource Identifier (URI): Generic Syntax
Published by RFC Editor ,2005