Efficient Parallel Dictionary Encoding for RDF Data

Aus International Center for Computational Logic
Version vom 26. Oktober 2014, 19:29 Uhr von Long Cheng (Diskussion | Beiträge) (Die Seite wurde neu angelegt: „{{Publikation Erster Autor |ErsterAutorNachname=Cheng |ErsterAutorVorname=Long |FurtherAuthors=Spyros Kotoulas ; Tomas E. Ward ; Georgios Theodoropoulos }} {{I…“)
(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)
Wechseln zu:Navigation, Suche

Toggle side column

Efficient Parallel Dictionary Encoding for RDF Data

Long ChengLong Cheng,  Spyros KotoulasSpyros Kotoulas,  Tomas E. WardTomas E. Ward,  Georgios TheodoropoulosGeorgios Theodoropoulos
Long Cheng, Spyros Kotoulas, Tomas E. Ward, Georgios Theodoropoulos
Efficient Parallel Dictionary Encoding for RDF Data
Proc. 17th International Workshop on the Web and Databases (WebDB'14), 1519-1527, November 2014. ACM
  • KurzfassungAbstract
    The SemanticWeb comprises enormous volumes of semi-structured data elements. For interoperability, these elements are represented by long strings. Such representations are not efficient for the purposes of SemanticWeb applications that perform computations over large volumes of information. A typical method for alleviating the impact of this problem is through the use of compression methods that produce more compact representations of the data. The use of dictionary encoding for this purpose is particularly prevalent in Semantic Web database systems. However, centralized implementations present performance bottlenecks, giving rise to the need for scalable, efficient distributed encoding schemes. In this paper, we describe a straightforward but very efficient encoding algorithm and evaluate its performance on a cluster of up to 384 cores and datasets of up to 11 billion triples (1.9 TB). Compared to the state-of-art MapReduce algorithm, we demonstrate a speedup of 2.6 - 7.4x� and excellent scalability.
  • Weitere Informationen unter:Further Information: Link
  • Forschungsgruppe:Research Group: Knowledge SystemsKnowledge-Based Systems
@inproceedings{CKWT2014,
  author    = {Long Cheng and Spyros Kotoulas and Tomas E. Ward and Georgios
               Theodoropoulos},
  title     = {Efficient Parallel Dictionary Encoding for {RDF} Data},
  booktitle = {Proc. 17th International Workshop on the Web and Databases
               (WebDB'14)},
  publisher = {ACM},
  year      = {2014},
  month     = {November},
  pages     = {1519-1527},
  doi       = {10.1109/HPCC.and.EUC.2013.214}
}