View publication
Title | A Taxonomy of Basic Graph Pattern Motifs for Understanding SPARQL Query Logs |
Authors | Jaime Salas, Aidan Hogan |
Publication date | 2023 |
Abstract | Popular SPARQL query endpoints hosted by open knowledge graphs such as Wikidata and DBpedia process hundreds of thousands or even millions of queries per day. Making sense of queries at this scale is challenging. We propose a taxonomy of basic graph patterns (BGPs) in order to induce a hierarchical structure from such patterns found in a large query log. The leaves of this taxonomy are the raw basic graph patterns extracted from each query of the log. Each layer thereafter applies a generalisation step followed by a canonicalisation step, with each layer representing an increasingly coarse partition based on an increasingly more general motif. Generalisations are applied for constant subjects/objects (nodes), constant predicates (edge labels), direction, constant/variable distinction, and homomorphic equivalence. We discuss use-cases, define these generalisation steps, and apply them to induce a taxonomy of BGPs from a subset of the Wikidata query log. |
Pages | 1-11 |
Conference name | Alberto Mendelzon International Workshop on Foundations of Data Management |
Publisher | CEUR Publications |
Reference URL |