Notes on *"Drill & Join: A Method for Exact Inductive Program Synthesis"*

# Notes on *"Drill & Join: A Method for Exact Inductive Program Synthesis"* Some notes on the ***[Drill & Join: A Method for Exact Inductive Program Synthesis](https://link.springer.com/chapter/10.1007/978-3-319-17822-6_13)*** paper by *Remis Balaniuk*. ## 1. Algebra definitions - **Vector space**: an algebraic structure defined by: 1. a scalar field ***K*** 1. a set of vectors ***S*** 1. two binary operations that must oblige a set of properties (**vector sum** and **multiplication of a scalar for a vector**) - **Linear combination**: it's an expression containing *sums of vectors* and *multiplications of vectors by scalars*. Given a vector space ***V***, defined on a scalar field ***K*** and a set of ***N*** vectors ***S* = {*v0*, *v1*, ..., *vN*}**, a linear combination can be represented as **a0\*v0 + a1\*v1 + ... + aN\*vN**, where **{*a0*, *a1*, ..., *aN*}** are scalars of the field ***K***. - **Basis of a vector space**: a set of vectors with which it's possible to generate the entire vector space using linear combinations of the vectors. - **Span of a vector space**: the set of all the vectors that can be generated using linear combinations of the vectors part of the vector space. - **Dimension of a vector space**: it's the cardinality of one of the bases that can be used to generate the entire vector space. - **Algebraic ring**: it's a fundamental algebraic structure which consists of a set equipped with two binary operations that generalize the arithmetic operations of addition and multiplication. Through this generalization, theorems from arithmetic are extended to non-numerical objects such as polynomials, series, matrices and functions. - **Boolean algebra**: it's a set with binary operations *AND* and *OR* and the binary operator *NOT*, hence satisfying the Boolean laws. - **Boolean ring**: it's a ring for which **x2 = x** for all the elements **x** of the ring. It's a Boolean algebra, with ring multiplication corresponding to conjuction (*AND*) and ring addition to exclusive disjunction (*XOR*). In Logic the combination of the operators *XOR* and *AND* over the elements true/false, produce the Galois Field F2. ## 2. Galois Field 2 - GF(2) * Shortened as **GF(2)**, **F2** or **Z/2Z** * It's the smallest field made of two elements * The elements are called **true** (1) and **false** (0) * As every algebraic field, two operations are defined: addition and multiplication * The **addition** is carried on with the logical **XOR** operation * The **multiplication** is carried on with the logical **AND** operation * Addition has an identity element (**false**) and an inverse for each element * Multiplication has an identity element (**true**) and an inverse for every element (but **false**) ### Boolean polynomials A Boolean polynomial in ***B*** (Boolean algebra) is a string that results from a finite number of Boolean operations on a finite number of elements in ***B***. A multivariate polynomial over a ring has a unique representation as a xor-sum of monomials. This gives a normal form for Boolean polynomials: **XorSum(*J* ⊂ {1, 2,.., n}, *aJ* × MulSum(*j* ∈ *J*, *xj*))** where the *aJ* ∈ *B* coefficients are uniquely determined. This representation it's called ***[Algebraic Normal Form](#3.-Algebraic-Normal-Form-(ANF))***. A Boolean function of *n* variables **&fnof;: *Z2*n ⇒ *N2*** can be associated with a Boolean polynomial by deriving an algebraic normal form. ### GF(2) operations **Addition** | + | 0 | 1 | |:-----:|:-:|:-:| | **0** | 0 | 1 | | **1** | 1 | 0 | **Multiplication** | * | 0 | 1 | |:-----:|:-:|:-:| | **0** | 0 | 0 | | **1** | 0 | 1 | ## 3. Algebraic Normal Form (ANF) * Called also ***Zhegalkin Normal Form*** or ***Reed-Muller Expansion*** * The entire formula always evaluates to either **true** (1) or **false** (0) * Only **AND** (between single variables) and **XOR** operations are available ### Algebraic Normal Form expansion rules **XOR (logical exclusive disjunction)** > **(1 &oplus; A) &oplus; (1 &oplus; A &oplus; B)** > ⇒ (1 &oplus; A &oplus; 1 &oplus; A &oplus; B) > ⇒ (1 &oplus; 1 &oplus; A &oplus; A &oplus; B) > ⇒ B **NOT (logical negation)** > **¬(1 &oplus; A &oplus; B)** > ⇒ 1 &oplus; (1 &oplus; A &oplus; B) > ⇒ 1 &oplus; 1 &oplus; A &oplus; B > ⇒ A &oplus; B **AND (logical conjunction)** > **(1 &oplus; A) &and; (1 &oplus; A &oplus; B)** > ⇒ (1 &and; (1 &oplus; A &oplus; B)) &oplus; (A &and; (1 &oplus; A &oplus; B)) > ⇒ (1 &oplus; A &oplus; B) &oplus; (A &oplus; A &oplus; (A &and; B)) > ⇒ 1 &oplus; A &oplus; B &oplus; A &oplus; A &oplus; (A &and; B) > ⇒ 1 &oplus; A &oplus; A &oplus; A &oplus; B &oplus; (A &and; B) > ⇒ 1 &oplus; A &oplus; B &oplus; (A &and; B) **OR (logical disjunction)** Two rules are available: * **A &or; B = 1 &oplus; ((1 &oplus; A) &and; (1 &oplus; B))** * **A &or; B = A &oplus; B &oplus; (A &and; B)** > **(1 &oplus; A) &or; (1 &oplus; A &oplus; B)** > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; ((1 &oplus; A) &and; (1 &oplus; A &oplus; B)) > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; ((1 &and; (1 &oplus; A &oplus; B)) &oplus; (A &and; (1 &oplus; A &oplus; B))) > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; ((1 &oplus; A &oplus; B) &oplus; (A &oplus; A &oplus; (A &and; B))) > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; ((1 &oplus; A &oplus; B &oplus; A &oplus; A &oplus; (A &and; B))) > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; ((1 &oplus; A &oplus; A &oplus; A &oplus; B &oplus; (A &and; B))) > ⇒ (1 &oplus; A) &oplus; (1 &oplus; A &oplus; B) &oplus; (1 &oplus; A &oplus; B &oplus; (A &and; B)) > ⇒ 1 &oplus; A &oplus; 1 &oplus; A &oplus; B &oplus; 1 &oplus; A &oplus; B &oplus; (A &and; B) > ⇒ 1 &oplus; 1 &oplus; 1 &oplus; A &oplus; A &oplus; A &oplus; B &oplus; B &oplus; (A &and; B) > ⇒ 1 &oplus; A &oplus; (A &and; B) ## 4. Abstract algebra and higher order functions Let *Vn* be the set of all binary words of length *n*, |*Vn*| = 2n. The Boolean algebra *B* on *Vn* is a vector space over *Z2*. This correspondence between an algebra and our program space defines some useful properties: * the operations in a family need not be all explicitly stated; * a basis is any set of operators (operations) from which the remaining operations can be obtained by composition. A Boolean algebra may be defined from any of several different bases; * to be a basis is to yield all other operations by composition, whence any two bases must be intertranslatable; * a basis is a linearly independent spanning set; * let *v1*, .., *vm* ∈ *B* be a basis of *B*. *Span*(*v1*, .., *vm*) = { *λ1* &and; *v1* &oplus; .. &oplus; *λm* &and; *vm* | *λ1*, .., *λm* ∈ *Z2* }; * the dimension *dim*(*B*) of the Boolean algebra is the minimum *m* such that *B* = *span*(*v1*, .., *vm*). ## 5. The *Drill* function As first thing we define a **XorSum** function in pseudocode as: ``` def XorSum(from, to, elements) { result = 0 for (i = from; i < to; i++) { result ^= elements[i] } return result } ``` We define the set ***Fm*** of functions of the form **&fnof;: Z2p × Z2q → Z2** and contains Boolean functions belonging to a Boolean algebra of dimension **m** and described in polynomial form as: **&fnof;(X, Y) = *XorSum*(1, m, gi(X) &and; hi(Y))** where **gi: Z2p ⇒ Z2** and **hi: Z2q ⇒ Z2** are also boolean functions. The polynomial of order **n** (with **n** variables) has been rewritten by two polynomials of order **p** and **q**, where **p + q = n**. Considering a function **&fnof; ∈ *Fm***, a chosen **X0 ∈ Z2p** and a chosen **Y0 ∈ Z2q** such that **&fnof;(X0, Y0) ≠ 0**, we define the ***Drill*** high-order function as: **FX0,Y0 = F(&fnof;(X, Y), X0, Y0) = &fnof;(X, Y) &oplus; (&fnof;(X0, Y) &and; &fnof;(X, Y0))** ### Theorem If &fnof; ∈ *Fm* and &fnof;(X0, Y0) ≠ 0, then *&fnof;1* = **F**(X0, Y0) is included in a space *Fr* with *r* &leq; m-1. We therefore generated a new function of reduced vector space dimension. ### Proof 1. Consider *W* = *span*(*h1*, .., *hm*), which means a basis has *m* generator vectors. 2. Consequently *dim*(*W*) &leq; *m*. 3. The linear operator *h* ∈ *W* → *h*(*Y0*) is not the zero map because the hypothesis (*&fnof;*(X0, Y0) ≠ 0) forbids *hi*(*Y0*) = 0 for all *i* = 1, .., *m*. > ***&fnof;*(X0, Y0) ≠ 0** > ⇔ *XorSum*(1, *m*, *gi*(X0) &and; *hi*(Y0)) ≠ 0 > ⇔ ∀*i*: *gi*(X0) &and; *hi*(Y0) ≠ 0 > ⇔ ∀*i*: *hi*(Y0) ≠ 0 4. Consequently, the vector subspace *W1* = { *h* ∈ *W* | *h*(*Y0*) = 0 } has *dim*(*W1*) &leq; (*m* - 1). Because we know for sure that less than *m* functions *hi*(*Y0*) can lead to 0 from the step 3. 5. Notice that for all *X* ∈ *Z2p* we have the *Drill* function *f1*(*X*, ·) ∈ *W1*. > ***f1*(X, Y0)** > ⇒ *f*(X, Y0) &oplus; (*f*(X0, Y0) &and; *f*(X, Y0)), as per *Drill* expansion > ⇒ *f*(X, Y0) &oplus; (1 &and; *f*(X, Y0)), as *f*(X0, Y0) ≠ 0 (hence 1 in F2) by hypothesis > ⇒ *f*(X, Y0) &oplus; *f*(X, Y0) > ⇒ 0 6. Let *r* = *dim*(*W1*) and *h1i*, with *i* = 1, .., *r* be a spanning set such that *W1* = *span*(*h1*, .., *hr*). For all *X* ∈ *Z2p*, *f1*(X, ·) can be represented as a linear combination of the *h1i*, the coefficients depending on *X*. In other words, there exist coefficients *g1i*(*X*) such that ***f1i*(*X*, ·) = *XorSum*(1, *r*, *g1i*(*X*) &and; *h1i*)**, or written differently: ***f1*(*X*, *Y*) = *XorSum*(1, *r*, *g1i*(*X*) &and; *h1i*(*Y*))**. ### *Drill* application example As an illustration of the application of the *Drill* function, consider the Boolean function *&fnof;*(*x*, *y*): *Z2* × *Z2* → *Z2* whose behaviour can be represented by the following truth table. | *x* | *y* | *&fnof;*(*x*, *y*) = *y* &or; (¬*x* &and; ¬*y*) | |:-:|:-:|:-:| | F | F | T | | F | T | T | | T | F | F | | T | T | T | **Note**: the method does not require high-level definitions, only I/O pairs. So the illustration with the known function expression is just to show how the method works. #### Rewriting the function in ANF To ease the reasoning we can rewrite the *&fnof;*(*x*, *y*) function in ANF as: > ***&fnof;*(*x*, *y*)** > ⇒ *y* &or; (¬*x* &and; ¬*y*) > ⇒ *y* &or; ((T &oplus; *x*) &and; (T &oplus; *y*)) > ⇒ *y* &or; ((T &and; (T &oplus; *y*)) &oplus; (*x* &and; (T &oplus; *y*))) > ⇒ *y* &or; ((T &oplus; *y*) &oplus; (*x* &oplus; (*x* &and; *y*))) > ⇒ *y* &or; (T &oplus; *y* &oplus; *x* &oplus; (*x* &and; *y*)) > ⇒ *y* &oplus; (T &oplus; *y* &oplus; *x* &oplus; (*x* &and; *y*)) &oplus; (*y* &and; (T &oplus; *y* &oplus; *x* &oplus; (*x* &and; *y*))) > ⇒ *y* &oplus; (T &oplus; *y* &oplus; *x* &oplus; (*x* &and; *y*)) &oplus; (*y* &oplus; *y* &oplus; (*x* &and; *y*) &oplus; (*x* &and; *y*)) > ⇒ *y* &oplus; T &oplus; *y* &oplus; *x* &oplus; (*x* &and; *y*) &oplus; *y* &oplus; *y* &oplus; (*x* &and; *y*) &oplus; (*x* &and; *y*) > ⇒ T &oplus; *x* &oplus; (*x* &and; *y*) The rewritten *&fnof;*(*x*, *y*) function has dimension 2 (as can be seen by the two variable components ***x*** and **(*x* &and; *y*)**) and no shorter representation is possible. Respecting the theorem hypothesis *&fnof;*(*x0*, *y0*) ≠ 0, we can query the oracle (the truth table in this case) and obtain: ***x0* = F** and ***y0* = F**. Consequently we can obtain the partial functions (we'll be able to obtain their representation only with new iterations of the *Drill* function or with the support of the *Join* function, but for illustration purposes we use the known Boolean expression): > ***&fnof;*(*x0*, *y*)** > ⇒ *&fnof;*(F, *y*) > ⇒ T &oplus; F &oplus; (F &and; *y*) > ⇒ T > ***&fnof;*(*x*, *y0*)** > ⇒ *&fnof;*(*x*, F) > ⇒ T &oplus; *x* &oplus; (*x* &and; F) > ⇒ T &oplus; *x* That can be used to build the full *Drill* function: > ***&fnof;1*(*x*, *y*)** > ⇒ F(*&fnof;*(*x*, *y*), *x0*, *y0*) > ⇒ *&fnof;*(*x*, *y*) &oplus; (*&fnof;*(*x0*, *y*) &and; *&fnof;*(*x*, *y0*)) > ⇒ (T &oplus; *x* &oplus; (*x* &and; *y*)) &oplus; (T &and; (T &oplus; *x*)) > ⇒ (T &oplus; *x* &oplus; (*x* &and; *y*)) &oplus; (T &oplus; *x*) > ⇒ T &oplus; *x* &oplus; (*x* &and; *y*) &oplus; T &oplus; *x* > ⇒ (*x* &and; *y*) We can see that *&fnof;1*(*x*, *y*) has dimension 1 (a single variable component **(*x* &and; *y*)**), confirming the *Drill* theorem. ## 6. The *Join* function Consider the set ***Fm*** of Boolean functions of the form **&fnof;: Z2n → Z2** and ***v1*, .., *vm* ∈ *Fm*** a basis. The functions in this set can be described in polynomial form as: ***&fnof;*(*X*) = *XorSum*(1, *m*, *λi* &and; *vi*(*X*))**, where *λi* ∈ *Z2* are the coefficients. That's basically a linear combination of the generator vectors with *Z2* coefficients. Considering a function *&fnof;* ∈ *Fm*, a chosen *Xj* ∈ *Z2n* (which is the input to a partial function) such that *&fnof;*(*Xj*) ≠ 0 (because we are interested in rebuilding the partial functions obtained in a previous step, that have non-zero result) and a chosen function *vj*(*Xj*) ≠ 0 (otherwise we wouldn't be able to apply the linear combination for the selected *Xj*), we define the *Join* higher-order function as: **H*Xjvj* = H(*&fnof;*(*X*), *Xj*, *vj*) = *&fnof;*(*X*) &oplus; *vj*(*X*)** ### Theorem If *&fnof;* ∈ *Fm*, *&fnof;*(*Xj*) ≠ 0 and *vj*(*X*j) ≠ 0, then *&fnof;2* = H*Xjvj* ∈ *Fr* and *r* &leq; *m* - 1. We therefore generated a new function of reduced vector space dimension (as in the *Drill* theorem). ### Proof 1. Consider *W* = *span*(*v1*, .., *vm*), which means a basis has *m* generator vectors. 2. Consequently *dim*(*W*) &leq; *m*. 3. The linear operator *v* ∈ *W* → *v*(*Xj*) is not the zero map, otherwise *vj*(*Xj*) = 0 would be against one the hyphotesis (*vj*(*X*j) ≠ 0). 4. Consequently, the vector subspace *W2* = { *&fnof;* ∈ *W* | *&fnof;*(*Xj*) = 0 } has *dim*(*W2*) &leq; *m* - 1. Because we know for sure that less than *m* functions *&fnof;*(*Xj*) can lead to 0, knowing that some *&fnof;*(*Xj*) ≠ 0 per hypothesis. 5. *&fnof;2* ∈ *W2*, in fact *&fnof;2* = *&fnof;*(*Xj*) &oplus; *vj*(*Xj*) = 0 and per hypotheses we have *&fnof;*(*Xj*) ≠ 0 and *v*(*Xj*) ≠ 0, which in F2 means that *&fnof;*(*Xj*) = 1 and *v*(*Xj*) = 1. 6. Let *r* = *dim*(*W2*) and *v2i*, i = 1, .., *r* be a spanning set such that *W2* = *span*(*v21*, .., *v2r*). The function *&fnof;2* can be represented as a linear combination of the *v2i* generator vectors. In other words, there exist coefficients *λ2i* such that: ***&fnof;2*(*X*) = *XorSum*(1, *r*, *λ2i*(*X*) &and; *v2i*)**. ### *Join* application example As an illustration of the application of the *Join* function, we can consider the same Boolean function represented in the [*Drill* application example](#Drill-application-example) section. *&fnof;* belongs to a Boolean algebra of dimension 22 (as seen in the [Rewriting the function in ANF](#Rewriting-the-function-in-ANF) section) which can be defined, for instance, by the following spanning set: * *v1*(*x*, *y*) = *T* * *v2*(*x*, *y*) = *x* * *v3*(*x*, *y*) = *y* * *v4*(*x*, *y*) = *x* &and; *y* Respecting the stated hypotheses we can can pick *Xj* = (T, T) and *vj* = *v2*(*x,* *y*) = *x*, such that we have: *&fnof;*(T, T) ≠ F and *v2*(T, T) ≠ F. Applying the *Join* function we obtain: > ***&fnof;2*(*x*, *y*)** > ⇒ **H**(*&fnof;*(*X*), *Xj*, *vj*) > ⇒ *&fnof;*(*X*) &oplus; *vj*(*X*) > ⇒ (*y* &or; (¬*x* &and; ¬*y*)) &oplus; *x* > ⇒ T &oplus; *x* &oplus; (*x* &and; *y*) &oplus; *x* > ⇒ T &oplus; (*x* &and; *y*) We can see that *&fnof;2*(*x*, *y*) has dimension 1 (a single variable component **(*x* &and; *y*)**), confirming the *Join* theorem. **@Rolf: The ANF expression obtained in the original paper is (*x* &and; *y*) which seem to be wrong (missing the "T &oplus;" term).** ## 7. The *Drill* & *Join* program synthesis method *Drill* and *Join* are used to define a program synthesis method. Considering an active learning (where the algorithm can interactively query the user or some other information source, usually called oracle) framework, the input function *&fnof;*(*X*, *Y*) on **F** (*Drill* function) and the input function *&fnof;*(*X*) on **H** (*Join* function) represent an external **unknown concept** from which it is possible to obtain data by means of queries (I/O pairs). This **unknown concept** could be, for instance, a computer program that one would like to emulate (e.g. a blackbox function to be inferred) or optimize (e.g. a shellcode to be deobfuscated). ### Properies of the *Drill* and *Join* higher-order functions * **F** and **H** can be applied recursively: if *&fnof;*(*X,* *Y*) ∈ *Fm* then *&fnof;11*(*X*, *Y*) = **F**(*&fnof;*(*X*, *Y*), *X0*, *Y0*) ∈ *Fm-1* and *&fnof;12*(*X*, *Y*) = **F**(*&fnof;1*(*X*, *Y*), *X1*, *Y1*) ∈ *Fm-2*. Similarly, if *&fnof;*(*X*) ∈ *Fm* then *&fnof;21*(*X*) = **H**(*&fnof;*(*X*), *X0*, *v0*) ∈ *Fm-1* and *&fnof;22*(*X*) = **H**(*&fnof;*(*X*), *X1*, *v1*) ∈ *Fm-2*. Each recursion generates a new function belonging to an algebra of a lower dimension. * The recursion ends when the higher-order functions become the zero map: **F**(*&fnof;*(*X*, *Y*), *Xi*, *Yi*) = 0 &iff; *&fnof;*(*X*, *Y*) = (*&fnof;*(*Xi*, *Y*) &and; *&fnof;*(*X*, *Yi*)) and similarly **H**(*&fnof;*(*X*), *Xi*, *vi*) = 0 &iff; *&fnof;*(*X*) = *vi*(*X*). ### Obtaining the output program Two functions derived by the unrolled recursion of the *Drill* and *Join* functions can be used to rebuild the original target function. #### *Drill* rebuilding formula The original target function *&fnof;*(*X*, *Y*) can be recreated using the partial function *&fnof;1* obtained using the *Drill* function: > ***&fnof;*(*X*, *Y*) = *XorSum*(1, *m*, *&fnof;1i*(*Xi*, *Y*) &and; *&fnof;1i*(*X*, *Yi*))** #### *Join* rebuilding formula The original target function *&fnof;*(*X*) can be recreated using the partial function *&fnof;2* obtained using the *Join* function: > ***&fnof;*(*X*) = *XorSum*(1, *m*, *vi*(*X*))** **Note**: - if the *Drill* initial condition cannot be established (no (*X0*, *Y0*) such that *&fnof;*(*X0*, *Y0*) ≠ 0 can be found), then the target function is necessarily *&fnof;*(*X*, *Y*) = F; - if the *Join* initial condition cannot be established (no *X0* such that *&fnof;*(*X0*) ≠ 0 can be found), then the target function is the zero map. Although if given a valid *Xj* no *vj* function such that *vj*(*Xj*) ≠ 0 can be found, then the basis has been improperly selected; - the application of the **F** higher-order function alone requires the solution of an exponential number of subspace synthesis problems. And the **H** higher-order function may require a high cardinality basis. The two functions must be applied together to lead to proper results. ## 8. Complete example of application of the *Drill* & *Join* functions * Input function to be synthesized: **&fnof;(x, y) = (y &or; (¬x &and; ¬y))** * The oracle (truth table) for the input function is available in the [*Drill* application example](#Drill-application-example) chapter. * As previously mentioned, the function is already in the minimal form, but it'll end up as an ANF expression. * The steps will involve a mix of *Drill* and *Join* phases, depending on the knowledge of the vector space bases (necessary for the *Join* phases). * **Note**: the example in the original paper is wrong because it violates the hypotheses of the *Join* theorem, selecting a basis of *vj* functions and *Zj* values that lead to *vj*(*Zj*) = 0. The same example can be found in its corrected form in the ***[Effectiveness of Synthesis in Concolic Deobfuscation](https://www.sciencedirect.com/science/article/pii/S0167404817301475)*** paper. **@Rolf: Just in case it may be worth to verify if the example in the original paper it's wrong, to me it looks like it is.** ### Step by step example 1. We start applying the *Drill* function *&fnof;11*(*x*, *y*) = **F**(*&fnof;*(*x*, *y*), *x0*, *y0*) such that *&fnof;*(*x0*, *y0*) ≠ F. We can query the oracle (using random sampling or relying on the knowledge of the truth table) and choose ***x0 = F*** and ***y0 = F***. Note that the pairs (*x0* = F, *y0* = T) and (*x0* = T, *y0* = T) would have been valid choices too, because they would have validated the hypothesis *&fnof;*(*x0*, *y0*) ≠ F. Now that valid inputs are known we can apply the *Drill* function: > ***&fnof;11*(*x*, *y*)** > ⇒ **F**(*&fnof;*(*x*, *y*), x0, y0) > ⇒ *&fnof;*(*x*, *y*) &oplus; (*&fnof;*(*x0*, *y*) &and; *&fnof;*(*x*, *y0*)) Resulting in two partial functions ***&fnof;*(*x0*, *y*)** and ***&fnof;*(*x*, *y0*)** with truth table: | *x* | *y* | *&fnof;*(*x0* = F, *y*) | *&fnof;*(*x*, *y0* = F)| |:-:|:-:|:-:|:-:| | F | F | T | T | | F | T | T | T | | T | F | T | F | | T | T | T | F | We should then check if ***&fnof;11*(*x*, *y*) = F** for each value of *x* and *y*, to determine if *&fnof;11* is the zero map and if we should therefore stop the algorithm. | *x* | *y* | *&fnof;*(*x0* = F, *y*) &and; *&fnof;*(*x*, *y0* = F) | *&fnof;11*(*x*, *y*) | |:-:|:-:|:-:|:-:| | F | F | T | F | | F | T | T | F | | T | F | F | F | | T | T | F | T | We can see that *&fnof;11*(*x*, *y*) ≠ F for (*x* = T, *y* = T), so we didn't find the zero map. 2) We can now apply the *Join* function *&fnof;21*(*Z*) = **H**(*&fnof;*(*Z*), *Zj*, *vj*) such that *vj*(*Zj*) ≠ F and *&fnof;*(*Zj*) ≠ F. We need to select a basis for the vector space containing the ***&fnof;*(*x0*, *y*)** and ***&fnof;*(*x*, *y0*)** partial functions, for example selecting the two generator vectors ***v0*(*z*) = *T*** and ***v1*(*z*) = z** (where *z* can be either *x* or *y*). The truth table for the *vj* functions follows: | *x* | *v0*(*x*) | *v1*(*x*) | |:-:|:-:|:-:| | F | T | F | | T | T | T | Now that a basis is known we can apply the *Join* function: > ***&fnof;21*(*X*)** > ⇒ **H**(*&fnof;*(*Z*), *Zj*, *vj*) > ⇒ *&fnof;*(*Z*) &oplus; *vj*(*Zj*) Applying a first iteration of the *Join* function to the ***&fnof;*(*x0*, *y*)** function, selecting ***y0* = T** and ***v0*(*y*) = *T*** to oblige to the hypotheses (note that selecting the pair (*y0* = F, *v0*(*y*) = T) would have been a valid choice too), we obtain: | *y* | *&fnof;21,1* = **H**(*&fnof;*(*x0*, *y*), *y0*, *v0*) | |:-:|:-:| | F | F | | T | F | We can see that *&fnof;21,1* = F for all the values of *x* and *y*, so the *&fnof;21,1*(*Z*) function is the zero map and we can therefore stop the recursion and obtain the full rebuilt function (using the [*Join* rebuilding formula](#Join-rebuilding-formula)): ***&fnof;*(*x0*, *y*) = *v0*(*z*) = T**. Applying a first iteration of the *Join* function to the ***&fnof;*(*x*, *y0*)** function, selecting ***x0* = F** and ***v0*(*x*) = *T*** to oblige to the hypotheses (note that in this case we couldn't select other values of *x* given the input function and the selected basis), we obtain: | *x* | *&fnof;21,2* = **H**(*&fnof;*(*x*, *y0*), *x0*, *v0*) | |:-:|:-:| | F | F | | T | T | We can see that *&fnof;21,2* ≠ F for some values of *x* and *y*, so the *&fnof;21,2*(*Z*) function is not the zero map. Therefore we'll need to apply a new round of recursion. 3) As we saw at the step 1, the *&fnof;11*(*x*, *y*) function didn't generate the zero map, so a second iteration of the *Drill* function is necessary. We can therefore calculate *&fnof;12*(*x*, *y*) = **F**(*&fnof;11*(*x*, *y*), *x1*, *y1*) such that *&fnof;11*(*x1*, *y1*) ≠ F. Querying the oracle (basically looking at the truth table of the *&fnof;11*(*x*, *y*) function) we obtain that (*&fnof;11*(*x*, *y*) ≠ F) &iff; (*x* = T, *y* = T). Hence we select ***x1* = T** and ***y1* = T** and obtain: > ***&fnof;12*(*x*, *y*)** > ⇒ **F**(*&fnof;11*(*x*, *y*), *x1*, *y1*) > ⇒ *&fnof;11*(*x*, *y*) &oplus; (*&fnof;11*(*x1*, *y*) &and; *&fnof;11*(*x*, *y1*)) Resulting in two partial functions ***&fnof;11*(*x1*, *y*)** and ***&fnof;11*(*x*, *y1*)** with truth table: | *x* | *y* | *&fnof;11*(*x1* = T, *y*) | *&fnof;11*(*x*, *y1* = T) | |:-:|:-:|:-:|:-:| | F | F | F | F | | F | T | F | F | | T | F | F | F | | T | T | T | T | We should then check if ***&fnof;12*(*x*, *y*) = F**, to determine if *&fnof;12* is the zero map and if we should therefore stop the algorithm. | *x* | *y* | *&fnof;11*(*x1* = T, *y*) &and; *&fnof;11*(*x*, *y1* = T) | *&fnof;12*(*x*, *y*) | |:-:|:-:|:-:|:-:| | F | F | F | F | | F | T | F | F | | T | F | F | F | | T | T | T | F | We can see that *&fnof;12*(*x*, *y*) = F for all the values of *x* and *y*, so we found the zero map and we can stop the recursion. 4. We can now apply a new round of the *Join* function on the not yet synthesized partial functions: ***&fnof;21,2***, ***&fnof;11*(*x1*, *y*)**, ***&fnof;11*(*x*, *y1*)**. Applying a second iteration of the *Join* function to the ***&fnof;21,2*** function, selecting ***x1* = T** and ***v1*(*x*) = *x*** to oblige to the hypotheses (note that in this case we couldn't select other values of *x* given the input function and the selected basis), we obtain: | *x* | *&fnof;22,2* = H(*&fnof;21,2*, *x1*, *v1*) | |:-:|:-:| | F | F | | T | F | We can see that *&fnof;22,2* = F for all the values of *x* and *y*, so the *&fnof;22,2*(*Z*) function is the zero map and we can therefore stop the recursion and obtain the full rebuilt function (using the [*Join* rebuilding formula](#Join-rebuilding-formula)): ***&fnof;*(*x*, *y0*) = *v0*(*z*) &oplus; *v1*(*z*) = T &oplus; x**. To handle the partial functions ***&fnof;11*(*x1*, *y*)** and ***&fnof;11*(*x*, *y1*)** we can use the same basis chosen to handle the ***&fnof;*(*x0*, *y*)** and ***&fnof;*(*x*, *y0*)** functions. Applying a first iteration of the *Join* function to the ***&fnof;11*(*x1*, *y*)** function, selecting ***y0* = T** and ***v0*(*y*) = *y*** to oblige to the hypotheses (note that in this case we couldn't select other values of *y* given the input function and the selected basis), we obtain: | *y* | *&fnof;21,3* = H(*&fnof;11*(*x1*, *y*), *y0*, *v0*) | |:-:|:-:| | F | F | | T | F | We can see that *&fnof;21,3* = F for all the values of *y*, so the *&fnof;21,3*(*Z*) function is the zero map and we can therefore stop the recursion and obtain the full rebuilt function (using the [*Join* rebuilding formula](#Join-rebuilding-formula)): ***&fnof;11*(*x1*, *y*) = *v0*(*z*) = y**. Applying a first iteration of the *Join* function to the ***&fnof;11*(*x*, *y1*)** function, selecting ***x0* = T** and ***v0*(*x*) = *y*** to oblige to the hypotheses (note that in this case we couldn't select other values of *x* given the input function and the selected basis), we obtain: | *x* | *&fnof;21,4* = H(*&fnof;11*(*x*, *y1*), *x0*, *v0*) | |:-:|:-:| | F | F | | T | F | We can see that *&fnof;21,4* = F for all the values of *x*, so the *&fnof;21,4*(*Z*) function is the zero map and we can therefore stop the recursion and obtain the full rebuilt function (using the [*Join* rebuilding formula](#Join-rebuilding-formula)): ***&fnof;11*(*x*, *y1*) = *v0*(*z*) = x**. **Note**: In the first *Join* iteration step applied to the *&fnof;11*(*x1*, *y*) function we may have chosen *y0* = T and *v0* = T to oblige with the hypotheses, but at the second step we would have realised that it would have been impossible to choose a *y1* value to oblige to the hypothesis, meaning that the first selection of the generator vector (*v0*) was wrong. **@Rolf: It is not clear to me if there's an optimal way to select the generator vector from the basis.** 5. Now that we have the full representation of all the partial problems we can build the original target function (using the [*Drill* rebuilding formula](#Drill-rebuilding-formula)): > ***&fnof;*(*x*, *y*)** > ⇒ (*&fnof;*(*x0*, *y*) &and; *&fnof;*(*x*, *y0*)) &oplus; (*&fnof;1*(*x1*, *y*) &and; *&fnof;1*(*x*, *y1*)) > ⇒ (*T* &and; (*T* &oplus; *x*)) &oplus; (*y* &and; *x*) > ⇒ (*T* &oplus; *x*) &oplus; (*y* &and; *x*) > ⇒ *T* &oplus; *x* &oplus; (*x* &and; *y*) We can verify that the obtained ANF representation matches with the manual expansion executed in the [Rewriting the function in ANF](#Rewriting-the-function-in-ANF) section, but this time we obtained it with the recursive application of the *Drill* and *Join* functions querying the oracle in a black-box fashion.