Convergence of datalog over (pre-) semirings

M Abo Khamis, HQ Ngo, R Pichler, D Suciu… - Journal of the ACM, 2024 - dl.acm.org
Journal of the ACM, 2024dl.acm.org
Recursive queries have been traditionally studied in the framework of datalog, a language
that restricts recursion to monotone queries over sets, which is guaranteed to converge in
polynomial time in the size of the input. But modern big data systems require recursive
computations beyond the Boolean space. In this article, we study the convergence of
datalog when it is interpreted over an arbitrary semiring. We consider an ordered semiring,
define the semantics of a datalog program as a least fixpoint in this semiring, and study the …
Recursive queries have been traditionally studied in the framework of datalog, a language that restricts recursion to monotone queries over sets, which is guaranteed to converge in polynomial time in the size of the input. But modern big data systems require recursive computations beyond the Boolean space. In this article, we study the convergence of datalog when it is interpreted over an arbitrary semiring. We consider an ordered semiring, define the semantics of a datalog program as a least fixpoint in this semiring, and study the number of steps required to reach that fixpoint, if ever. We identify algebraic properties of the semiring that correspond to certain convergence properties of datalog programs. Finally, we describe a class of ordered semirings on which one can use the semi-naïve evaluation algorithm on any datalog program.
ACM Digital Library