Publications

Stats

View publication

Title The Language of Plain SO-tgds: Composition, Inversion and Structural Properties
Authors Marcelo Arenas, Jorge Pérez, Juan Reutter, Cristian Riveros
Publication date September 2013
Abstract The problems of composing and inverting schema mappings
specified
by source-to-target tuple-generating dependencies (st-tgds) have attracted a
lot of attention, as they are of fundamental importance for the development
of Bernstein's metadata management framework. In the case of the
composition operator, a natural semantics has been proposed and the language
of second-order tuple generating dependencies (SO-tgds) has been identified
as the right language to express it. In the case of the inverse operator,
several semantics have been proposed, most notably the maximum recovery, the
only inverse notion that guarantees that every mapping specified by st-tgds
is invertible. Unfortunately, less attention has been paid to combining both
operators, which is the motivation of this paper. More precisely, we start
our investigation by showing that SO-tgds are not good for inversion, as
there exist mappings specified by SO-tgds that are not invertible under any
of the notions of inversion proposed in the literature. To overcome this
limitation, we borrow the notion of CQ-composition, which is a relaxation
obtained by parameterizing the composition of mappings by the class of
conjunctive queries (CQ), and we propose a restriction over the class of
SO-tgds that gives rise to the language of plain SO-tgds. Then we show that
plain SO-tgds are the right language to express the CQ-composition of
mappings given by st-tgds, in the same sense that SO-tgds are the right
language to express the composition of st-tgds, and we prove that every
mapping specified by a plain SO-tgd admits a maximum recovery, thus showing
that plain SO-tgds have a good behavior w.r.t. inversion. Moreover, we show
that the language of plain SO-tgds shares some fundamental structural
properties with the language of st-tgds, but being much more expressive, and
we provide a polynomial-time algorithm to compute maximum recoveries for
mappings specified by plain SO-tgds (which can also be used to compute
maximum recoveries for mappings given by st-tgds). All these results suggest
that the language of plain SO-tgds is a good alternative to be implemented
in data exchange and data integration applications.
Pages 763-784
Volume 79
Journal name Journal of Computer and System Sciences
Publisher Elsevier Science (Amsterdam, The Netherlands)
Reference URL View reference page