Towards A Scalable Semantic-based Distributed Approach for SPARQL query evaluation

Over the last two decades, the amount of data which has been created, published and managed using Semantic Web standards and especially via Resource Description Framework (RDF) has been increasing.As a result, efficient processing of such big RDF datasets has become challenging.Indeed, these processes require, both efficient storage strategies and query-processing engines, to be able to scale in terms of data size.In this study, we propose a scalable approach to evaluate SPARQL queries over distributed RDF datasets using a semantic-based partition and is implemented inside the state-of-the-art RDF processing framework: SANSA.An evaluation of the performance of our approach in processing large-scale RDF datasets is also presented.The preliminary results of the conducted experiments show that our approach can scale horizontally and perform well as compared with the previous Hadoop-based system.It is also comparable with the in-memory SPARQL query evaluators when there is less shuffling involved.

Speakers:

Jens Lehmann

University of Bonn
https://www.informatik.uni-bonn.de/de

Gezim Sejdiu

PhD Student / Research Associate

University of Bonn
https://www.informatik.uni-bonn.de/de

PhD Student & Research Associate at the University of Bonn, Smart Data Analytics (SDA). My research interest are in the area of Semantic Web, Big Data and Machine Learning. I am also interested in the area of distributed computing systems (Apache Spark, Apache Flink).

Hajira Jabeen

Dr.

University of Bonn
https://www.informatik.uni-bonn.de/de

Search form

Towards A Scalable Semantic-based Distributed Approach for SPARQL query evaluation

Speakers:

Jens Lehmann

Gezim Sejdiu

Damien Graux

Imran Khan

Ioanna Lytra

Hajira Jabeen

Interested in this talk?