Publications

Stats

View publication

Title Compact Representation of Large RDF Data Sets for Publishing and Exchange
Authors Javier Fernández, Miguel Martínez-Prieto, Claudio Gutierrez
Publication date 2010
Abstract Increasingly huge RDF data sets are being published on the Web. Cur-
rently, they use different syntaxes of RDF, contain high levels of redundancy and
have a plain indivisible structure. All this leads to fuzzy publications, inefficient
management, complex processing and lack of scalability. This paper presents a
novel RDF representation (HDT) which takes advantage of the structural proper-
ties of RDF graphs for splitting and representing, efficiently, three components
of RDF data: Header, Dictionary and Triples structure. On-demand management
operations can be implemented on top of HDT representation. Experiments show
that data sets can be compacted in HDT by more than fifteen times the current
naive representation, improving parsing and processing while keeping a consis-
tent publication scheme. For exchanging, specific compression techniques over
HDT improve current compression solutions.
Downloaded 8 times
Pages 193-208
Conference name International Semantic Web Conference
PDF View PDF