Misplaced Pages

Occurs check

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Algorithm component in computer science

In computer science, the occurs check is a part of algorithms for syntactic unification. It causes unification of a variable V and a structure S to fail if S contains V.

Application in theorem proving

In theorem proving, unification without the occurs check can lead to unsound inference. For example, the Prolog goal X = f ( X ) {\displaystyle X=f(X)} will succeed, binding X to a cyclic structure which has no counterpart in the Herbrand universe. As another example, without occurs-check, a resolution proof can be found for the non-theorem ( x y . p ( x , y ) ) ( y x . p ( x , y ) ) {\displaystyle (\forall x\exists y.p(x,y))\rightarrow (\exists y\forall x.p(x,y))} : the negation of that formula has the conjunctive normal form p ( X , f ( X ) ) ¬ p ( g ( Y ) , Y ) {\displaystyle p(X,f(X))\land \lnot p(g(Y),Y)} , with f {\displaystyle f} and g {\displaystyle g} denoting the Skolem function for the first and second existential quantifier, respectively. Without occurs check, the literals p ( X , f ( X ) ) {\displaystyle p(X,f(X))} and p ( g ( Y ) , Y ) {\displaystyle p(g(Y),Y)} are unifiable, producing the refuting empty clause.

Cycle by omitted occurs check

Rational tree unification

Prolog implementations usually omit the occurs check for reasons of efficiency, which can lead to circular data structures and looping. By not performing the occurs check, the worst case complexity of unifying a term t 1 {\displaystyle t_{1}} with term t 2 {\displaystyle t_{2}} is reduced in many cases from O ( size ( t 1 ) + size ( t 2 ) ) {\displaystyle O({\text{size}}(t_{1})+{\text{size}}(t_{2}))} to O ( min ( size ( t 1 ) , size ( t 2 ) ) ) {\displaystyle O({\text{min}}({\text{size}}(t_{1}),{\text{size}}(t_{2})))} ; in the particular, frequent case of variable-term unifications, runtime shrinks to O ( 1 ) {\displaystyle O(1)} .

Modern implementations, based on Colmerauer's Prolog II, use rational tree unification to avoid looping. However it is difficult to keep the complexity time linear in the presence of cyclic terms. Examples where Colmerauers algorithm becomes quadratic can be readily constructed, but refinement proposals exist.

See image for an example run of the unification algorithm given in Unification (computer science)#A unification algorithm, trying to solve the goal c o n s ( x , y ) = ? c o n s ( 1 , c o n s ( x , c o n s ( 2 , y ) ) ) {\displaystyle cons(x,y){\stackrel {?}{=}}cons(1,cons(x,cons(2,y)))} , however without the occurs check rule (named "check" there); applying rule "eliminate" instead leads to a cyclic graph (i.e. an infinite term) in the last step.

Sound unification

ISO Prolog implementations have the built-in predicate unify_with_occurs_check/2 for sound unification but are free to use unsound or even looping algorithms when unification is invoked otherwise, provided the algorithm works correctly for all cases that are "not subject to occurs-check" (NSTO). The built-in acyclic_term/1 serves to check the finiteness of terms.

Implementations offering sound unification for all unifications are Qu-Prolog and Strawberry Prolog and (optionally, via a runtime flag): XSB, SWI-Prolog, Tau Prolog, Trealla Prolog and Scryer Prolog. A variety of optimizations can render sound unification feasible for common cases.

See also

W.P. Weijland (1990). "Semantics for Logic Programs without Occur Check". Theoretical Computer Science. 71: 155–174. doi:10.1016/0304-3975(90)90194-m.

Notes

  1. Some Prolog manuals state that the complexity of unification without occurs check is O ( min ( size ( t 1 ) , size ( t 2 ) ) ) {\displaystyle O({\text{min}}({\text{size}}(t_{1}),{\text{size}}(t_{2})))} (in all cases). This is incorrect, as it would imply comparing arbitrary ground terms in constant time (by unifying e q ( t 1 , t 2 ) {\displaystyle eq(t_{1},t_{2})} with e q ( X , X ) {\displaystyle eq(X,X)} ).

References

  1. David A. Duffy (1991). Principles of Automated Theorem Proving. Wiley.; here: p.143
  2. Informally, and taking p ( x , y ) {\displaystyle p(x,y)} to mean e.g. "x loves y", the formula reads "If everybody loves somebody, then a single person must exist that is loved by everyone."
  3. F. Pereira; D. Warren; D. Bowen; L. Byrd; L. Pereira (1983). C-Prolog's User's Manual Version 1.2 (Technical report). SRI International. Retrieved 21 June 2013.
  4. A. Colmerauer (1982). K.L. Clark; S.-A. Tarnlund (eds.). Prolog and Infinite Trees. Academic Press.
  5. M.H. van Emden; J.W. Lloyd (1984). "A Logical Reconstruction of Prolog II". Journal of Logic Programming. 2: 143–149.
  6. Joxan Jaffar; Peter J. Stuckey (1986). "Semantics of Infinite Tree Logic Programming". Theoretical Computer Science. 46: 141–158. doi:10.1016/0304-3975(86)90027-7.
  7. B. Courcelle (1983). "Fundamental Properties of Infinite Trees". Theoretical Computer Science. 25 (2): 95–169. doi:10.1016/0304-3975(83)90059-2.
  8. Albertro Martelli; Gianfranco Rossi (1984). Efficient Unification with Infinite Terms in Logic Programming (PDF). The International Conference oj Fifth Generation Computer Systems.
  9. 7.3.4 Normal unification in Prolog of ISO/IEC 13211-1:1995.
  10. Ritu Chadha; David A. Plaisted (1994). "Correctness of unification without occur check in prolog". The Journal of Logic Programming. 18 (2): 99–122. doi:10.1016/0743-1066(94)90048-5.
  11. Thomas Prokosch; François Bry (2020). Unification on the Run (PDF). The 34th International Workshop on Unification. pp. 13:1–13:5.
Categories: