Misplaced Pages

Reaching definition

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

In compiler theory, a reaching definition for a given instruction is an earlier instruction whose target variable can reach (be assigned to) the given one without an intervening assignment. For example, in the following code:

d1 : y := 3
d2 : x := y

d1 is a reaching definition for d2. In the following, example, however:

d1 : y := 3
d2 : y := 4
d3 : x := y

d1 is no longer a reaching definition for d3, because d2 kills its reach: the value defined in d1 is no longer available and cannot reach d3.

As analysis

The similarly named reaching definitions is a data-flow analysis which statically determines which definitions may reach a given point in the code. Because of its simplicity, it is often used as the canonical example of a data-flow analysis in textbooks. The data-flow confluence operator used is set union, and the analysis is forward flow. Reaching definitions are used to compute use-def chains.

The data-flow equations used for a given basic block S {\displaystyle S} in reaching definitions are:

  • R E A C H i n [ S ] = p p r e d [ S ] R E A C H o u t [ p ] {\displaystyle {\rm {REACH}}_{\rm {in}}=\bigcup _{p\in pred}{\rm {REACH}}_{\rm {out}}}
  • R E A C H o u t [ S ] = G E N [ S ] ( R E A C H i n [ S ] K I L L [ S ] ) {\displaystyle {\rm {REACH}}_{\rm {out}}={\rm {GEN}}\cup ({\rm {REACH}}_{\rm {in}}-{\rm {KILL}})}

In other words, the set of reaching definitions going into S {\displaystyle S} are all of the reaching definitions from S {\displaystyle S} 's predecessors, p r e d [ S ] {\displaystyle pred} . p r e d [ S ] {\displaystyle pred} consists of all of the basic blocks that come before S {\displaystyle S} in the control-flow graph. The reaching definitions coming out of S {\displaystyle S} are all reaching definitions of its predecessors minus those reaching definitions whose variable is killed by S {\displaystyle S} plus any new definitions generated within S {\displaystyle S} .

For a generic instruction, we define the G E N {\displaystyle {\rm {GEN}}} and K I L L {\displaystyle {\rm {KILL}}} sets as follows:

  • G E N [ d : y f ( x 1 , , x n ) ] = { d } {\displaystyle {\rm {GEN}}=\{d\}} , a set of locally available definitions in a basic block
  • K I L L [ d : y f ( x 1 , , x n ) ] = D E F S [ y ] { d } {\displaystyle {\rm {KILL}}={\rm {DEFS}}-\{d\}} , a set of definitions (not locally available, but in the rest of the program) killed by definitions in the basic block.

where D E F S [ y ] {\displaystyle {\rm {DEFS}}} is the set of all definitions that assign to the variable y {\displaystyle y} . Here d {\displaystyle d} is a unique label attached to the assigning instruction; thus, the domain of values in reaching definitions are these instruction labels.

Worklist algorithm

Reaching definition is usually calculated using an iterative worklist algorithm.

Input: control-flow graph CFG = (Nodes, Edges, Entry, Exit)

// Initialize
for all CFG nodes n in N,
    OUT = emptyset; // can optimize by OUT = GEN;
// put all nodes into the changed set
// N is all nodes in graph,
Changed = N;
// Iterate 
while (Changed != emptyset)
{
    choose a node n in Changed;
    // remove it from the changed set
    Changed = Changed -{ n };
    // init IN to be empty
    IN = emptyset;
    // calculate IN from predecessors' OUT
    for all nodes p in predecessors(n)
         IN = IN Union OUT;
    oldout = OUT; // save old OUT
    // update OUT using transfer function f_n ()
    OUT = GEN Union (IN -KILL);
    // any change to OUT compared to previous value?
    if (OUT changed) // compare oldout vs. OUT
    {    
        // if yes, put all successors of n into the changed set
        for all nodes s in successors(n)
             Changed = Changed U { s };
    }
}

See also

Further reading

Compiler optimizations
Basic block
Data-flow
analysis
SSA-based
Code generation
Functional
Global
Other
Static analysis
Categories: