TripleID: A low-overhead representation and querying using GPU for large RDFs

Chantrapornchai C., Choksuchat C., Haidl M., Gorlatch S.

Research article (book contribution) | Peer reviewed

Abstract

Resource Description Framework (RDF) is a commonly used format for semantic web processing. It basically contains strings representing terms and their relationships which can be queried or inferred. RDF is usually a large text file which contains many million relationships. In this work, we propose a framework, TripleID, for processing queries of large RDF data. The framework utilises Graphics Processing Units (GPUs) to search RDF relations. The RDF data is first transformed to the encoded form suitable for storing in the GPU memory. Then parallel threads on the GPU search the required data. We show in the experiments that one GPU on a personal desktop can handle 100 million triple relations, while a traditional RDF processing tool can process up to 10 million triples. Furthermore, we can query sample relations within 0.18 s with the GPU in 7 million triples, while the traditional tool takes at least 6 s for 1.8 million triples.

Details about the publication

PublisherKozielsk S
Book titleBeyond Databases, Architectures and Structures. Advanced Technologies for Data Mining and Knowledge Discovery
Page range400-415
Publishing companySpringer VDI Verlag
Title of seriesCommunications in Computer and Information Science (ISSN: 1865-0929)
Volume of series613
StatusPublished
Release year2016
Language in which the publication is writtenEnglish
ISBN9783319340982
DOI10.1007/978-3-319-34099-9_31
Link to the full texthttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84964766325&origin=inward
KeywordsCUDA; GPU; Parallel processing; Query processing; RDF

Authors from the University of Münster

Gorlatch, Sergei
Professur für Praktische Informatik (Prof. Gorlatch)
Haidl, Michael
Professur für Praktische Informatik (Prof. Gorlatch)