PLONK Arithmetization

PLONK v.s. Halo

Similarities: proofs systems, use polynomial commitment schemes, same constraint system (constraint polynomial, BG12 permutations, lookup tables, custom gates)

Differences: PLONK requires a trusted-setup, PLONK uses pairings, Halo has proof recursion (2-cycle of elliptic curve groups), PLONK uses a Kate-like PCS, Halo uses a Hyrax-like PCS

Timeline:

[Standard] PLONK (Aug-2019) - proof system via polynomial commitments using a single updateable trusted-setup. Uses pairings, a non-R1CS constraint-system, BG12 permutation proofs, and Kate-like polynomial commitments.

Halo (Sep-2019) - proof recursion via elliptic curve cycles, no pairings, no trusted setup, Hyrax-like PCS.

TurboPLONK (late-2019/early-2020) - PLONK with custom gates/constraints.

plookup (Nov-2020) - extends PLONK with lookup tables (replace expensive computations with key-value maps to lookup a computation's output on an input, currently lookup tables focus on replacing bitwise operations such as bitwise AND and XOR).

UltraPLONK (late-2020) - PLONK with custom gates and lookup tables, i.e. combines TurboPLONK and plookup (makes many algorithms that are SNARK unfriendly, more friendly). Side note: using lookup tables and custom gates allows for efficient modular arithmetic in a field smaller than the PLONK field, which could allow for recursive proofs without cycles of elliptic curves (i.e. multiple base and scalar fields). Side note #2: Pedersen hashing within UltraPLONK is roughly as efficient as Poseidon.

Halo2 (late-2020) - combines UltraPLONK, a PCS that does not require a trusted-setup, and cycles of elliptic curves for recursion.

Question: Verification time is constant in PLONK, but is not in Halo/Halo2 because Halo's proof size varies with computation size? Zcash will rely on batch transaction verification to achieve something close to succinctness.

Standard PLONK Arithmetization

Arithmetization - breaks a general computation into a sequence of steps.

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Example:

a * b + 23 == 100

Arithmetization converts a general computation into a system of polynomials (a set of interdependent polynomials) where each is constrained to a value. A general computation's set of polynomials is its constraint system. Every polynomial is of the same form.

Standard PLONK uses the constraint polynomial:

s_{l} x_{l} + s_{r} x_{r} + s_{m} x_{l} x_{r} = s_{o} x_{o} + c

$s_{l}, s_{r}, s_{o}, s_{m} \in {0, 1}$ are boolean selectors
$x_{l}, x_{r}, x_{o} \in F$ are values used in arithmetic
$c \in F$ is an optional constant used to assign constraint system values to a constant (public input)

which can encode addition, multiplication, and constant assignment.

		Constraint
Addition	$l + r = o$	$1 x_{l} + 1 x_{r} + 0 x_{l} x_{r} = 1 x_{o} + 0$
Multiplication	$l * r = o$	$0 x_{l} + 0 x_{r} + 1 x_{l} x_{r} = 1 x_{o} + 0$
Assign Constant (Public Input)	$l = 5$ $r = 5$ $l + r = 5$ $l * r = 5$	$1 x_{l} + 0 x_{0} + 0 x_{l} x_{r} = 0 x_{o} + 5$ $0 x_{l} + 1 x_{r} + 0 x_{l} x_{r} = 0 x_{o} + 5$ $1 x_{l} + 1 x_{r} + 0 x_{l} x_{r} = 0 x_{0} + 5$ $0 x_{l} + 0 x_{r} + 1 x_{l} x_{r} = 0 x_{o} + 5$
Exponentiation	$(l * r) * r = o$	$\begin{aligned} (0) & 0 x_{l} + 0 x_{r} + 1 x_{l} x_{r} = 1 x_{o} \\ (1) & 0 x_{l} + 0 x_{r} + 1 x_{o}^{(0)} x_{r} = 1 x_{o} \end{aligned}$

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Note: a constraint may reference a value from another constraint, e.g. exponentiation. This is enforced via a permutation argument, not using constraint.

TODO: The

s_{l}, s_{r}, s_{o}, s_{m}, c

values define a computation. The

x_{l}, x_{r}, x_{o}

values are filled in by someone who knows how the computation proceeds (the inputs and outputs at each gate).

We can think of a computation's constraint system as a matrix:

$x_{l}$	$x_{r}$	$x_{o}$	$s_{l}$	$s_{r}$	$s_{m}$	$s_{o}$	$c$
$x_{l}^{(1)}$	$x_{r}^{(1)}$	$x_{o}^{(1)}$	$s_{l}^{(1)}$	$x_{r}^{(1)}$	$s_{m}^{(1)}$	$s_{o}^{(1)}$	$c^{(1)}$
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
$x_{l}^{(n)}$	$x_{r}^{(n)}$	$x_{o}^{(n)}$	$s_{l}^{(n)}$	$x_{r}^{(n)}$	$s_{m}^{(n)}$	$s_{o}^{(n)}$	$c^{(n)}$

where the prover fills in the columns

x_{l}

x_{r}

, and

x_{o}

with values (the witnesses) such that every constraint is satisfied.

Example:

a * b + 23 == 100

(cont.)

The

s

's and equality constraints define a computation, the

c

's give an instance of the computation; the prover fills in

x

values to show that they know how a computation proceeds.

	$x_{l}$	$x_{r}$	$x_{o}$	$s_{l}$	$s_{r}$	$s_{m}$	$s_{o}$	$c$
$a * b = c$	$x_{l}^{(1)}$	$x_{r}^{(1)}$	$x_{o}^{(1)}$	$0$	$0$	$1$	$1$	$0$
$d = 23$	$x_{l}^{(2)}$	$x_{r}^{(2)}$	$x_{o}^{(2)}$	$1$	$0$	$0$	$0$	$23$
$c + d = e$	$x_{l}^{(3)}$	$x_{r}^{(3)}$	$x_{o}^{(3)}$	$1$	$1$	$0$	$1$	$0$
$f = 100$	$x_{l}^{(4)}$	$x_{r}^{(4)}$	$x_{o}^{(4)}$	$1$	$0$	$0$	$0$	$100$

Equality Constraints: x_{o}^{(1)} = x_{l}^{(3)} x_{l}^{(1)} = x_{r}^{(3)} x_{o}^{(3)} = x_{o}^{(4)}

Compressing Constraint Checks

We would like to replace

n

constraint checks (i.e.

\forall i \in [n]

is the

i^{t h}

constraint satisfied?) with a single check of some form. We do this by transforming the constraint system into a polynomial expression that is true if and only if the constraint system is satisfied.

Lagrange Interpolation

Lagrange interpolation converts a function's evaluation form

{(x_{1}, y_{1}), \dots, (x_{n}, y_{n})}

into a passing through each point.

Lagrange interpolation is used to represent each column as a polynomial:

x_{l} (X), x_{r} (X), x_{o} (X), s_{l} (X), s_{r} (X), s_{m} (X), s_{o} (X), c (X)

, where each column's polynomial on the

i^{t h}

input outputs the column's value in row

i

We associate each matrix row

i \in [n]

with a unique value

ω^{i}

where the set of values associated with all rows is

H = {ω^{1}, \dots, ω^{n}}

. Zipping this set of inputs with a column

zip (H, column) = {(ω^{1}, a_{1}), \dots, (ω^{n}, a_{n})}

gives the evaluation form for a polynomial that outputs the

i^{t h}

column value on row

i

's input

ω^{i}

. Performing Lagrange interpolation on each column's evaluation form produces the polynomial that maps each row's input

ω^{i}

to the value in the column at that row

a_{i}

. For example, to interpolate column

x_{l}

x_{l} (X)

we use the evaluation form

{(ω^{1}, x_{l}^{(1)}), \dots, (ω^{n}, x_{l}^{(1)})}

$H$	$x_{l}$	$x_{r}$	$x_{o}$	$s_{l}$	$s_{r}$	$s_{m}$	$s_{o}$	$c$
$ω^{1}$	$x_{l}^{(1)}$	$x_{r}^{(1)}$	$x_{o}^{(1)}$	$s_{l}^{(1)}$	$x_{r}^{(1)}$	$s_{m}^{(1)}$	$s_{o}^{(1)}$	$c^{(1)}$
$⋮$
$ω^{n}$	$x_{l}^{(n)}$	$x_{r}^{(n)}$	$x_{o}^{(n)}$	$s_{l}^{(n)}$	$x_{r}^{(n)}$	$s_{m}^{(n)}$	$s_{o}^{(n)}$	$c^{(n)}$

Rewriting Constraint Checks as a Polynomial Expression

Given the set of interpolating polynomials, we use the constraint equation to write an expression that holds for every input

ω^{i} \in H

s_{l} (ω^{i}) x_{l} (ω^{i}) + s_{l} (ω^{i}) x_{l} (ω^{i}) + s_{m} (ω^{i}) x_{l} (ω^{i}) x_{r} (ω^{i}) = s_{o} (ω^{i}) x_{o} (ω^{i}) + c (ω^{i})

moving the right hand side to the left, we rewrite the above as:

s_{l} (ω^{i}) x_{l} (ω^{i}) + s_{l} (ω^{i}) x_{l} (ω^{i}) + s_{m} (ω^{i}) x_{l} (ω^{i}) x_{r} (ω^{i}) - s_{o} (ω^{i}) x_{o} (ω^{i}) - c (ω^{i}) = 0 .

We can write the left hand side of the above equation as a polynomial:

s_{l} (X) x_{l} (X) + s_{l} (X) x_{l} (X) + s_{m} (X) x_{l} (X) x_{r} (X) - s_{o} (X) x_{o} (X) - c (X)

which that outputs

0

on every input

ω^{i}

, i.e. has roots

ω^{1}, \dots, ω^{n}

. We know some of the polynomial's roots, therefore we know part of its linear factorization contains the degree-1 terms

(X - ω^{1}) \dots (X - ω^{n})

s_{l} (X) x_{l} (X) + s_{l} (X) x_{l} (X) + s_{m} (X) x_{l} (X) x_{r} (X) - s_{o} (X) x_{o} (X) - c (X) = (X - ω^{1}) \dots (X - ω^{n}) h (X) .

We call

(X - ω^{1}) \dots (X - ω^{n})

the vanishing polynomial

V (X)

which outputs

0

on every input

ω^{i}

V (X) = \prod_{i \in [n]} (X - ω^{i})

which in a cyclic multiplicative group of order

n

simplifies to:

\begin{aligned} V (X) & = X^{n} - 1 \\ V (ω^{i}) & = (ω^{i})^{n} - 1 \\ = ω^{i n mod n} - 1 \\ = ω^{0} - 1 \\ = 1 - 1 \\ = 0 \end{aligned}

and

h (X)

its cofactor polynomial. Thus, we can write the constraint system polynomial as as equality:

s_{l} (X) x_{l} (X) + s_{l} (X) x_{l} (X) + s_{m} (X) x_{l} (X) x_{r} (X) - s_{o} (X) x_{o} (X) - c (X) = V (X) h (X) .

The constraint system is satisfied if its polynomial representation

s_{l} (X) x_{l} (X) + \dots - c (X)

is divisible by

V (X)

without remainder.

V (X) | s_{l} (X) x_{l} (X) + s_{l} (X) x_{l} (X) + s_{m} (X) x_{l} (X) x_{r} (X) - s_{o} (X) x_{o} (X) - c (X)

Thus we can perform

n

constraint checks using a single polynomial division check.

Permutation Notation

A permutation

σ

that shuffles an array of

n

elements

σ ((a_{1}, \dots, a_{n})) = (b_{1}, \dots, b_{n})

can be written:

σ = (σ_{1}, \dots, σ_{n})

where

σ_{1} = 2

means that

a_{2} \mapsto b_{1}

Equality Constraints

The polynomial division check ensures that the prover knows satisfying inputs and outputs for each gate, but does not ensure a correct wiring, e.g. the output of one gate is the input to another.

Example: I know

(a, b)

such that

a + b = a * b

Applying the division check on the two gates proves that I know satisfying inputs and outputs for each gate in isolation, i.e. I know

(a, b, c, d, e, f)

such that

a + b = e

and

c * d = f

:

adding copy constraints proves that the left inputs are equal, the right inputs are equal and the outputs are equal, i.e. I know

(a, b)

such that

a + b = a * b

To prove equality of wires we apply a permutation check across the constraint system values

x_{l}, x_{r}, x_{o}

. We join each of the three columns into a single array of length

3 n

u = x_{l} ∥ x_{r} ∥ x_{o} = (x_{l}^{(1)}, \dots, x_{l}^{(n)}, x_{r}^{(1)}, \dots, x_{r}^{(n)}, x_{o}^{(1)}, \dots, x_{o}^{(n)}) .

and create a permutation

σ \in S_{3 n}

that acts on

u

to produce an array

v = σ (u) = (u_{σ_{1}}, \dots, u_{σ_{3 n}})

. We choose

σ

such that each subset of copied values forms a cycle, i.e. copied values are permuted with only their copies.

Side Note: permutation cycles

Given a permutation

σ \in S_{6}

where:

σ = [\begin{matrix} 1 & 2 & 3 & 4 & 5 & 6 \\ 2 & 1 & 4 & 6 & 5 & 3 \end{matrix}]

we can write

σ

in cycle notation:

σ = (1 2) (3 4 6) (5)

or simply

(1 2) (3 4 6)

. An

m

-cycle is a subset of

m

elements that repeat after

m

applications of

σ

, e.g. the

2

-cycle

(1 2)

maps

1 \to 2

then

2 \to 1

Example: permutation of constraint system values

Given a constraint system of

n = 3

constraints:

x_{l} = (x_{l}^{(1)}, x_{l}^{(2)}, x_{l}^{(3)}) x_{r} = (x_{r}^{(1)}, x_{r}^{(2)}, x_{r}^{(3)}) x_{l} = (x_{o}^{(1)}, x_{o}^{(2)}, x_{o}^{(3)}) \begin{aligned} u = x_{l} ∥ x_{r} ∥ x_{l} = ( & x_{l}^{(1)}, & x_{l}^{(2)}, & x_{l}^{(3)}, & x_{r}^{(1)}, & x_{r}^{(2)}, & x_{r}^{(3)}, & x_{o}^{(1)}, & x_{o}^{(2)}, & x_{o}^{(3)}) \\ 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 \end{aligned}

and wiring:

\begin{aligned} x_{l}^{(1)} = x_{l}^{(2)} \\ x_{o}^{(1)} = x_{r}^{(2)} = x_{l}^{(3)} \end{aligned}

we create a permutation

σ

, which operates on a set containing

| u | = 3 n

elements, such that each set of copied constraint system values forms a cycle:

σ = (x_{l}^{(1)} x_{l}^{(2)}) (x_{l}^{(3)} x_{r}^{(2)} x_{o}^{(1)})

written in a less cumbersome way using indices in

u

as:

σ = (1 2) (3 5 7) .

The permuted vector

v = σ (u) = (u_{σ_{1}}, \dots, u_{σ_{3 n}})

is:

\begin{aligned} v = ( & x_{l}^{(2)}, & x_{l}^{(1)}, & x_{o}^{(1)}, & x_{r}^{(1)}, & x_{l}^{(3)}, & x_{r}^{(3)}, & x_{r}^{(2)}, & x_{o}^{(2)}, & x_{o}^{(3)}) \\ 2 & 1 & 7 & 4 & 3 & 6 & 5 & 8 & 9 . \end{aligned}

The unpermuted array

u

is encoded into the polynomial:

u (X) = \prod_{i \in [3 n]} (i X + u_{i})

and the permuted array

v

is encoded into the polynomial:

v (X) = \prod_{i \in [3 n]} (σ_{i} X + v_{i}) .

where

σ_{i}

is the index in

u

that permutes into

v_{i}

, i.e.

v_{i} = u_{σ_{i}}

and

v = σ (u) = (u_{σ_{1}}, \dots, u_{σ_{3 n}})

. Also let

u_{i} (X)

and

v_{i} (X)

denote the

i^{t h}

terms of

u (X)

and

v (X)

respectively:

\begin{aligned} u_{i} (X) & = (i X + u_{i}) \\ v_{i} (X) & = (σ_{i} X + v_{i}) . \end{aligned}

Notice that for each term

v_{i} (X)

v (X)

there is an identical term

u_{σ_{i}} (X)

u (X)

, thus the polynomials

u (X)

and

v (X)

are the same, despite their

i^{t h}

terms possibly being different:

\begin{aligned} u_{i} (X) & \neq v_{i} (X) if i \neq σ_{i} \\ u_{σ_{i}} (X) & = v_{i} (X) \\ \Rightarrow \prod_{i \in [3 n]} u_{i} (X) & = \prod_{i \in [3 n]} v_{i} (X) \end{aligned}

thus the following relation holds:

\frac{u (X)}{v (X)} = \prod_{i \in [3 n]} \frac{(i X + u_{i})}{(σ_{i} X + v_{i})} = \prod_{i \in [3 n]} \frac{u_{i} (X)}{v_{i} (X)} = 1 .

This relation between a polynomial representing an array and a polynomial representing a permutation of the array can be tested using Schwartz-Zippel: given a random input

β

the probability the

u (β) = v (β)

, or

\frac{u (β)}{v (β)} = 1

, is negligible.

Side Note: PLONK uses a slightly different relation.

PLONK actually defines

u (X) = \prod_{i \in [3 n]} (i X + u_{i} + γ)

and

v (X) = \prod_{i \in [3 n]} (σ_{i} X + v_{i} + γ)

where the verifier chooses a random

γ

and sends it to the prover (which are equivalent via the right-shift Schwartz-Zippel lemma), thus in reality the above expression is:

\frac{u (X)}{v (X)} = \prod_{i \in [3 n]} \frac{(i X + u_{i} + γ)}{(σ_{i} X + v_{i} + γ)} = 1 .

Side Note: the trace of a running product.

The trace of a running product is an array

s

whose first element is

1

an contains the value of the product after each multiplicand, e.g.

\prod_{i \in [3]} x^{i} s = (1, 1 * x^{1}, 1 * x * x^{2}, 1 * x * x^{2} * x^{3})

The verifier chooses a random

β

for the prover to evaluate

\frac{u (β)}{v (β)}

and the prover constructs a grand product array

s = (s_{1}, \dots, s_{3 n})

that represents

\frac{u (β)}{v (β)}

, i.e.

s

is the trace of the product

\frac{u (β)}{v (β)}

\begin{aligned} s_{1} & = 1 \\ s_{2} & = s_{1} \frac{u_{1} (β)}{v_{1} (β)} & = (1) \frac{u_{1} (β)}{v_{1} (β)} \\ s_{3} & = s_{2} \frac{u_{2} (β)}{v_{2} (β)} & = (1 \frac{u_{1} (β)}{v_{1} (β)}) \frac{u_{2} (β)}{v_{2} (β)} \\ ⋮ \\ s_{3 n} & = s_{3 n - 1} \frac{u_{3 n - 1} (β)}{v_{3 n - 1} (β)} & = (1 \frac{u_{1} (β)}{v_{1} (β)} \dots \frac{u_{3 n - 2} (β)}{v_{3 n - 2} (β)}) \frac{u_{3 n - 1} (β)}{v_{3 n - 1} (β)} \end{aligned}

s

is recursive in that its

i^{t h}

element is equal to the previous element times

\frac{u_{i - 1} (β)}{v_{i - 1} (β)}

s_{i} = s_{i - 1} \frac{u_{i - 1} (β)}{v_{i - 1} (β)} where: i \in [2, 3 n] .

Notice that the

\frac{u (β)}{v (β)}

can be computed from

s

by multiplying its last element by the last product term in

\frac{u (β)}{v (β)}

\frac{u (β)}{v (β)} = \prod_{i \in [3 n]} \frac{u_{i} (β)}{v_{i} (β)} = s_{3 n} \frac{u_{3 n} (β)}{v_{3 n} (β)} = s_{3 n + 1}

The prover creates a polynomial

s (X)

using Lagrange interpolation on a publicly known set of

3 n

inputs

H = ⟨ ω ⟩ = {ω^{1}, \dots, ω^{3 n}} \subset F

and image

s

such that:

\forall i \in [3 n] : s (ω^{i}) = s_{i} .

Notice that the following relation holds for a pair of neighboring inputs

ω^{i}

and

ω^{i + 1}

\begin{aligned} s_{i + 1} & = s_{i} \frac{u_{i} (β)}{v_{i} (β)} \\ \Rightarrow s (ω^{i + 1}) & = s (ω^{i}) \frac{u_{i} (β)}{v_{i} (β)} & . \end{aligned}

Given

s (X)

, the verifier wants to check that

v

is a permutation of

u

according to

σ

. The verifier checks that the permutation identity holds at the last element in

s

\begin{aligned} \frac{u (β)}{v (β)} & \overset{?}{=} 1 \\ \Rightarrow s (ω^{3 n}) \frac{u_{3 n} (β)}{v_{3 n} (β)} & \overset{?}{=} 1 \end{aligned}

and that

s

is a correctly constructed running product

s_{i} = s_{i - 1} \frac{u_{i - 1} (β)}{v_{i - 1} (β)}

\begin{aligned} s (ω^{3 n}) & \overset{?}{=} s (ω^{3 n - 1}) \frac{u_{3 n - 1} (β)}{v_{3 n - 1} (β)} \\ ⋮ \\ s (ω^{2}) & \overset{?}{=} s (ω^{1}) \frac{u_{1} (β)}{v_{1} (β)} \\ s (ω^{1}) & \overset{?}{=} 1 & . \end{aligned}

\frac{u (β)}{v (β)} = 1

at the random challenge

β

then

u (X) = v (X)

, which implies

v = σ (u)

Proving Subarrays via Randomized Set Differences (Not Done)

Note: "ordered multiset" == "array"
Note: "subarray" == "order preserving subset of an array"

PLONK defines an ordered multiset's

a = (a_{1}, \dots, a_{n})

set difference

a^{'}

as the difference between adjacent elements of

a

\begin{aligned} a^{'} & = (a_{i + 1} - a_{i})_{i \in [n - 1]} \\ e.g. & a & = (2, 6, 4) \\ a^{'} & = (6 - 2, 4 - 6) = (4, - 2) & . \end{aligned}

Given arrays

a

and

b

, if

b

is a subarray of

a

, then the set difference of

sorted (a ∥ b)

contains

| b |

number of zeros because each

b_{i}

is inserted next to an

a_{j}

having the same value:

\begin{aligned} e.g. & a & = (1, 3, 4, 7) \\ b & = (1, 7) \\ c & = sorted (a ∥ b) = (1, 1, 3, 4, 7, 7) \\ c^{'} & = (0, 2, 1, 3, 0) \\ c^{'} & contains | b | = 2 zeros & . \end{aligned}

However, this alone does not prove that an array is a subarray of

b o l d a

because we can find an array

f ⊄ a

where the set difference of

sorted (a ∥ f)

contains

| f |

number of zeros:

\begin{aligned} e.g. & a & = (1, 1) \\ f & = (3, 3) \\ g & = sorted (a ∥ f) = (1, 1, 3, 3) \\ g^{'} & = (0, 0) \\ g^{'} & contains | f | = 2 zeros & . \end{aligned}

However, given a random value

γ

, we can test (w.h.p.) that an array

b

is subarray of an array

a

by shifting all elements by

γ

\begin{aligned} e.g. & a + γ & = (a_{1} + γ, \dots, a_{n} + γ) \\ b + γ & = (b_{1} + γ, \dots, b_{m} + γ) \\ c & = sorted (a ∥ b) = (1, 1, 3, 4, 7, 7) \\ c^{'} & = (0, 2, 1, 3, 0) \\ c^{'} & contains | b | = 2 zeros & . \end{aligned}

Multiset Checks (Right-Shift Schwartz-Zippel)

Given two arrays

a

and

b

of equal length

n

, if both

a

and

b

contain the same elements, regardless of their ordering, then the products of the arrays' elements will be equal:

\begin{aligned} a & = (x, y, z) \\ b & = (x, z, y) = σ (a) \\ a_{1} a_{2} a_{3} & = b_{1} b_{2} b_{3} & . \end{aligned}

However, the products of two [equally lengthed] arrays' elements being equal does not guarantee that

a

and

b

contain the same elements, e.g.

\begin{aligned} a & = (x, y, z) \\ b & = (\frac{x}{2}, \frac{z}{2}, 4 y) \neq σ (a) \\ \prod_{i \in [n]} a_{i} & = \prod_{i \in [n]} b_{i} . \end{aligned}

We can use a product equality check on the elements of two arrays to guarantee that the arrays contain the same elements by right-shifting every element of each array by a random value

γ

a = (x, y, z) b = (x, z, y) (a_{1} + γ) (a_{2} + γ) (a_{3} + γ) = (b_{1} + γ) (b_{2} + γ) (b_{3} + γ) ⟹ b = σ (a) .

Background

Linear Factors Contain Roots

A univariate polynomial

f (X)

having

n

roots

{a_{1}, \dots, a_{n}}

can be written uniquely as

n

degree-

1

polynomials:

f (X) = c (X - a_{1}) \dots (X - a_{n}) roots (f) = {a_{1}, \dots, a_{n}}

where

c

is a constant. If we don't care about the sign of the roots we can write:

f (X) = c (X + a_{1}) \dots (X + a_{n}) roots (f) = {- a_{1}, \dots, - a_{n}} .

A polynomial that has a linear factor

(i X - a)

has a root at

\frac{a}{i}

\begin{aligned} f (X) & = \prod_{i \in [n]} (i X - a_{i}) \\ = c (X - \frac{a_{1}}{1}) \dots (X - \frac{a_{n}}{n}) where: c = 1 * \dots * n \\ roots (f) & = {\frac{a_{1}}{1}, \dots, \frac{a_{n}}{n}} . \end{aligned}

Encoding Sets and Arrays into Polynomials

We can encode a set of values

{a_{1}, \dots, a_{n}}

into a unique polynomial:

f (X) = \prod_{i \in [n]} (X - a_{i}) roots (f) = {a_{1}, \dots, a_{n}} .

We can encode an array

(a_{1}, \dots, a_{n})

into a unique polynomial where each root encodes an array element and its position:

f (X) = \prod_{i \in [n]} (i X - a_{i}) roots (f) = {\frac{a_{1}}{1}, \dots, \frac{a_{n}}{n}} .

Side Note: if an array is a permutation of another, e.g.

a = (a_{1}, \dots, a_{n})

and

b = (b_{1}, \dots, b_{n}) = σ (a)

where

σ = (σ_{1}, \dots σ_{n})

is a permutation, then the following equality holds:

\prod_{i \in [n]} (i X - a_{i}) = \prod_{i \in [n]} (σ_{i} X - b_{i})

e.g.

a = (5, 6, 7)

and

b = (6, 7, 5) = σ (a)

\begin{aligned} a_{1} & \to b_{3} & (σ_{3} = 1) \\ a_{2} & \to b_{1} & (σ_{1} = 2) \\ a_{3} & \to b_{2} & (σ_{2} = 3) \end{aligned} \begin{aligned} \prod_{i \in [n]} (i X - a_{i}) & = \prod_{i \in [n]} (σ_{i} X - b_{i}) \\ (1 X - a_{1}) (2 X - a_{2}) (3 X - a_{3}) & = (σ_{1} X - b_{1}) (σ_{2} X - b_{2}) (σ_{3} X - b_{3}) \\ (1 X - 5) (2 X - 6) (3 X - 7) & = (2 X - 6) (3 X - 7) (1 X - 5) \end{aligned}

Schwartz-Zippel

Given two univariate polynomials:

f (X) = (X + a_{1}) \dots (X + a_{n}) g (X) = (X + b_{1}) \dots (X + b_{n})

Schwartz-Zippel says that for a random input

γ \leftarrow F

the probability that different polynomials

f (X) \neq g (X)

evaluate to the same value

f (γ) = g (γ)

is very low.

Pr [f (γ) = g (γ) ∣ f (X) \neq g (X)] \leq \frac{max (d_{f}, d_{g})}{| F |}

This allows us to test polynomial equivalency, i.e. is

f (X)

the same polynomial as

g (X)

Schwartz-Zippel holds if every root of

f (X)

and

g (X)

is shifted by the a randomly chosen constant

δ \leftarrow F

f (X) = (X + a_{1} + δ) \dots (X + a_{n} + δ) g (X) = (X + b_{1} + δ) \dots (X + b_{n} + δ) Pr [f (γ) = g (γ) ∣ f (X) \neq g (X)] \leq \frac{max (d_{f}, d_{g})}{| F |}

because shifting two polynomials by the same value along the x-axis does not affect their equality.

We can encode the elements of a set

{a_{1}, \dots, a_{n}}

into a polynomial

f (X) = (X - a_{1}) \dots (X - a_{n})

and the set

{a_{1}, \dots, a_{n}}

into a polynomial

g (X) = (X - b_{1}) \dots (X - b_{n})

and use Schwartz-Zippel to test set equality (i.e. do sets

{a_{1}, \dots, a_{n}}

and

{b_{1}, \dots, b_{n}}

contain the same elements).

We can encode an array

(a_{1}, \dots, a_{n})

as a polynomial whose roots contains the array elements and their positions:

f (X) = (1 X - a_{1}) \dots (n X - a_{n}) roots (f) = {\frac{a_{1}}{1}, \dots, \frac{a_{n}}{n}}

thus, we can use Schwartz-Zippel to test array equality (i.e.

\forall i \in [n] : a_{i} = b_{i}

PLONK uses Schwartz-Zippel (on random input

λ \leftarrow F

) with a random right-shift

δ \leftarrow F

to check array equality:

f (X) = (1 X - a_{1} + δ) \dots (n X - a_{n} + δ) g (X) = (1 X - b_{1} + δ) \dots (n X - b_{n} + δ) Pr [f (λ) = g (λ) ∣ f (X) \neq g (X)] \leq \frac{max (d_{f}, d_{g})}{| F |} .

Multivariate Variant

The Schwartz-Zippel probability is the same for an

n

-variate polynomial

f (X_{1}, \dots, X_{n})

and a randomly selected evaluation point

(x_{1}, \dots, x_{n}) \leftarrow F^{n}

Pr [f (x_{1}, \dots, x_{n}) = 0] \leq \frac{d}{| F |} .

Zero-Polynomial Variant

Given a random evaluation point such that

f (x_{1}, \dots, x_{n}) = 0

, the probability that

f (X_{1}, \dots, X_{n})

is the zero-polynomial (always a zero-function) is the converse of the Schwartz-Zippel probability

Pr [f (X_{1}, \dots, X_{n}) is the zero-polynomial] = 1 - Pr [f (x_{1}, \dots, x_{n}) = 0] .

Low-Degree Extension

Given an array

a = (a_{1}, \dots, a_{n}) \in F^{n}

and a subset

S = {s_{1}, \dots, s_{n}} \subset F

, then the low-degree extension of

a

is the interpolation polynomial

a (X)

of degree

d = n - 1

that passes through the points

{(s_{1}, a_{1}), \dots, (s_{n}, a_{n})}

\forall i \in [n] : a (s_{i}) = a_{i} .

Cosets (todo)

We associate each value in the constraint system with a unique value in

F

\begin{aligned} x_{l} & \to H_{1} & \forall i \in [n] : x_{l}^{(i)} \to ω^{i} \\ x_{r} & \to H_{2} & \forall i \in [n] : x_{r}^{(i)} \to 2 ω^{i} \\ x_{o} & \to H_{3} & \forall i \in [n] : x_{o}^{(i)} \to 3 ω^{i} \end{aligned}

where

H_{1}, H_{2}, H_{3} < F

are disjoint subgroups and

ω^{i} \in H

, . We then join the labels into an array of length

3 n

u = (ω^{1}, \dots, ω^{n}, 2 ω^{1}, \dots, 2 ω^{n}, 3 ω^{1}, \dots, 3 ω^{n})

Anca Nitulescu

2021/03/30 20:26:21

The constraint syst

The above is not true, since the left hand expression is a polynomial of higher degree than V(x), what you say after is indeed what holds, the left-hand polynomial is a multiple of V(x), but not equal (Edited)

2021/03/30 22:35:07

constraint system as a **matrix**:

how is this matrix simplifying the presentation or what structure property of the matrix is used in the following to justify this writing? I cannot see any? (Edited)

2021/03/30 22:36:01

Performing Lagrange interpolation on each column's evaluation form produces the polynomial that maps each row's i

isn't this simpler to visualise just from the initial set of equations in indeterminates x_l, x_r, s_l, ... ? (Edited)

2021/03/30 22:36:51

trying to explain what I mean here https://hackmd.io/@fvoicMuRSDK9byZVw9slOA/HJNCFWZS_ (Edited)

403 Forbidden - HackMDBuild together with Markdown

HackMD

Nicolas Gailly

2021/03/31 11:09:40

Because later we interpolate by the columns of the matrix. Using the matrix notation makes it easy to refers to rows, columns and specific cells. At least that's how I think of it. (Edited)

porcuquine

2021/04/07 17:29:36

Should be \omega^i. (Edited)

Jake

2021/04/27 18:16:30

polynomial

nice, thank you (Edited)

PLONK Arithmetization

PLONK v.s. Halo

Standard PLONK Arithmetization

Compressing Constraint Checks

Lagrange Interpolation

Rewriting Constraint Checks as a Polynomial Expression

Permutation Notation

Equality Constraints

Proving Subarrays via Randomized Set Differences (Not Done)

Multiset Checks (Right-Shift Schwartz-Zippel)

Background

Linear Factors Contain Roots

Encoding Sets and Arrays into Polynomials

Schwartz-Zippel

Multivariate Variant

Zero-Polynomial Variant

Low-Degree Extension

Cosets (todo)

Read more

NI-PoRep Audit Spec

SuperSnap Spec

Synth-PoRep Spec

Halo2 Circuits