WARP: Workload-aware replication and partitioning for RDF

K. Hose; R. Schenkel

doi:10.1109/ICDEW.2013.6547414

2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW 2013)

WARP: Workload-aware replication and partitioning for RDF

Year: 2013, Pages: 1-6

DOI Bookmark: 10.1109/ICDEW.2013.6547414

Authors

K. Hose, Dept. of Comput. Sci., Aalborg Univ., Aalborg, Denmark
R. Schenkel, Max Planck Inst. for Inf., Saarbrucken, Germany

Abstract

With the increasing popularity of the Semantic Web, more and more data becomes available in RDF with SPARQL as a query language. Data sets, however, can become too big to be managed and queried on a single server in a scalable way. Existing distributed RDF stores approach this problem using data partitioning, aiming at limiting the communication between servers and exploiting parallelism. This paper proposes a distributed SPARQL engine that combines a graph partitioning technique with workload-aware replication of triples across partitions, enabling efficient query execution even for complex queries from the workload. Furthermore, it discusses query optimization techniques for producing efficient execution plans for ad-hoc queries not contained in the workload.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Query Execution for RDF Data Using Structure Indexed Vertical Partitioning
2015 IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW)
Fast Processing SPARQL Queries on Large RDF Data
2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech)
Efficient RDF Representation and Parallel Join Processing Algorithm on General Purpose Many-Core
2016 International Symposium on Computer, Consumer and Control (IS3C)
RDF Data Storage Techniques for Efficient SPARQL Query Processing Using Distributed Computation Engines
2018 IEEE International Conference on Information Reuse and Integration (IRI)
High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution
2017 46th International Conference on Parallel Processing (ICPP)
Job-Optimized Map-Side Join Processing Using MapReduce and HBase with Abstract RDF Data
2015 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload
IEEE Transactions on Knowledge & Data Engineering
Optimizing Keyword Search Over Federated RDF Systems
IEEE Transactions on Big Data
Optimizing Multi-Query Evaluation in Federated RDF Systems
IEEE Transactions on Knowledge & Data Engineering
S3QLRDF: Property Table Partitioning Scheme for Distributed SPARQL Querying of large-scale RDF data
2020 IEEE International Conference on Smart Data Services (SMDS)

WARP: Workload-aware replication and partitioning for RDF

Authors

Abstract

Related Articles