Sum-Check protocol notes

Table of contents

Sum-Check protocol notes

The Sum-Check protocol

What does Verifier gain with that?

Protocol description

Example Round 1

Example Round 2..v-1

Final round

A quick recap on what just happened

Soundness & Correctness

Completeness

Soundness

Sum-Check + Low-Degree Multilinear Extensions.

Multilinar Extensions

Lagrange interpolation for Multilinear polynomials

Extra topics

The Zero-Check protocol

Protocol overview

A real world example for an easy multiplication gate

TODO: Introduce eq poly

Get the multilinear extensions (MLEs) of our polynomials.

Run the sum-check against our multivariate polynomial

First round

Second round

Last round

The Prod-Check protocol

Credits

Questions left

Fix the offsets at which the randomness start.
Check Chiesa's talk to widen the explanations: https://www.youtube.com/watch?v=N1-67VPrsbA

The Sum-Check protocol

First formalized by Lund, Fortnow, Karloff and Nisan [LFKN92].

Suppose we're given a multi-variate polynomial

g

defined over the Finite Field

F

.
The main purpose of the Sum-Check is for the Prover to provide the Verifier with the following sum:

H := \sum_{b_{1} \in {0, 1}} \sum_{b_{2} \in {0, 1}} . . . \sum_{b_{v} \in {0, 1}} g (b_{1}, . .)

What does Verifier gain with that?

The verifier can compute

g

on it's own by evaluating

g

2^{v}

places (all the inputs in

{0, 1}^{v}

).
But this is of course non-acceptable performance-wise. Hence, the sum-check protocol allows the Verifier to actually reduce this problem to evaluate

g

at a particular random point

r_{x} \in F

Protocol description

Prover sends a value
$C_{1}$ claimed to be equal to the reslt of the sum defined by
$H$ .

Now a multi-round process stats where we will be repeating until we have gone through all the variables of

g

.
The multi-round process consists on the following:

Prover sends to Verifier the univariate polynomial
$g_{1} (X_{1})$ claimed to equal:
$\sum_{(x_{2}, . ., x_{v}) \in {0, 1}^{v - 1}}$ .
Then, the Verifier checks that actually,
$C_{1} = g 1 (0) + g 1 (1)$ and also ensures that
$g 1$ is a univariate polynomial with at most
$d e g (g (X_{j})) = d e g_{j} (g)$
This actually translates to:

Example Round 1

Let

g = (x, y, z) = 2 x^{3} + x z + y z

. Let

H

be equal to the sum of all

g

's evaluations over the Boolean Hypercube (

H = 12

).
This is referencing to the same eq we saw previously:

H := \sum_{b_{1} \in {0, 1}} \sum_{b_{2} \in {0, 1}} . . . \sum_{b_{v} \in {0, 1}} g (b_{1}, . .)

When the sum-check protocol is applied to

g

, Prover generates a univariate polynomial

s

which is based on

g

and which has only one variable (

x

in this case).
This polynomial contains the evaluations for all the possible

{0, 1}

values of the

n - 1

variables-left of the polynomial (except for (

x

) which is left as the only variable, that's why it is univariate).

This translates to the following:

g (x, 0, 0) + g (x, 0, 1) + g (x, 1, 0) + g (x, 1, 1) = (2 x^{3}) + (2 x^{3} + x) + (2 x^{3}) + (2 x^{3} + x + 1) = (8 x^{3}) + 2 x + 1

As it can be seen in the example, Prover left

x

untouched and actually computed the sum for all the evaluations left of

g

, creating the new univariate polynomial

s_{0} (x) = (8 x^{3}) + 2 x + 1

The Prover sends

s_{0}

to the Verifier who actually has 2 tasks:

Ensure that actually
$s_{0} (0) + s_{0} (1) = 12$ . This proves still the correct sum of
$H = 12$ .
Check that indeed the degree of the polynomial
$s_{0}$ is at most the degree of the monomial at the same position as te variable left. ie.
$d e g (s_{0} (x)) = d e g (g [0] (x, y, z))$ ->
$d e g (8 x^{3} + 2 x + 1) = d e g (2 x^{3})$

Wrong probably
3. Now, the Verifier samples a random element

r_{1}

F

based on Prover's data sent previously. And sends

r_{1}

to the Prover.
4. The prover now, sends the Verifier again a univariate polynomial

s_{2}

claimed to equal:

\sum_{(x_{j + 1}, . . ., x_{v}) \in {0, 1}^{v - j}} g (r_{1}, . . ., r_{j - 1}, X_{j}, x_{j + 1}, . . ., x_{v})

which should make the following statements hold:

g_{j - 1} (r_{j - 1}) = g_{j} (0) + g_{j} (1) d e g (g_{j}) \leq d e g_{j} (g) & & i s_u n i v a r i a t e (g_{j})

Or said with the same names as in the previous example:

s_{1} (0) + s_{1} (1) = s_{0} (r_{1}) d e g (s_{1} (y)) = d e g (g [1] (x, y, z))

Example Round 2..v-1

Remember that we ended the first round with:

g_{j = 0} = s_{0} (x) = (8 x^{3}) + 2 x + 1

and knowing that

s_{0} (0) + s_{0} (1) = 12

Now, let's make the numbers with the next round where

j = 1

.
Remember that Verified just sampled

r_{1}

(which we will suppose that equals 2).

Now prover constructs a univariate polynomial

s_{2}

which now leaves only the second variable of

g

and leaves the first one with the value

r_{1}

s_{1} (y) = g (2, y, 0) + g (2, y, 1) = 16 + (16 + 2 + y) = 34 + y

So as you can see, now we have half of the evaluations to compute and sum up together.

The Prover now sends

s_{1}

to the Verifier who actually has 2 tasks:

Ensure that actually
$s_{1} (0) + s_{1} (1) = s_{0} (r_{1})$ .
Check that indeed the degree of the polynomial
$s_{1}$ is at most the degree of the monomial at the same position as te variable left. ie.
$d e g (s_{1} (x)) = d e g (g [1] (x, y, z))$ ->
$d e g (34 + y) = d e g (x z)$

Final round

The process is repeated until we arrive to the latest variable of the polynomial
$g$ for which the maths would look the same:

Supose that the Verifier just sent

r_{3}

so that we're at the very last round.
Now the prover will build

g (r_{0}, r_{1}, r_{2})

and this polynomial will need to satisfy the same properties for the degrees & being univariate as well as

g (r_{0}, r_{1}, r_{2}) = s_{2} (r_{3})

Note that no evaluations are performed in the Boolean Hypercube in the last round. Rather, Verifier uses the oracle-query capabilities to actually query

g

at (

r_{0}, r_{1}, r_{2}

) getting

r_{3}

in response which is what was sent to the prover.

If the Verifier checked the consistency of all of the

$v$ rounds and accepted the responses from the Prover, arrived here, the Verifier is convinced of the correctness of the Sum-Check.

We will see why in the next section.

A quick recap on what just happened

In the first round, the prover sent

g_{0} (x)

is claimed to be equal to the polynomial

s_{0} (x)

The idea of the sum-check is that the Verifier will probabilistically check that this equality between polynomials holds by picking a random

F

, (

r_{0}

) confirming:

g_{0} (r_{0}) = s_{0} (r_{0})

Be aware that here, if

g_{0} \neq s_{0}

, then with probability at least

1 - d e g_{0} (g) / F

over the Verifier's choice of

r_{0}

, the previous equation fails to hold.

Note that here,

$d e g_{0} (g)$ denotes the degree of the 0th monomial of
$g$

**The check of the degree of

s

is done because two different degree

d

univariate polynomials agree on at most

d

inputs.

So if

| F | ≫ d e g_{i} (g)

then we can be sure that the probability of a false positive via

1 - d e g_{0} (g) / F

is very small**

Now it's important to realize that it would be very costly for the Verifier to actually evaluate

$s_{0} (r_{0})$ as it still has to evaluate the sum of the evaluations of
$g (x_{1}, . ., x_{v})$ . (It still needs to perform

2^{v - 1}

evaluations.

But as we saw, the sum-check protocol is really good at doing specifically this!! If you look at the problem now, we have

s_{0}

which is a sum of evaluations of a

(v - 1)

-variate polynomial.
So the goal for the Verifier now, is to recursively perform the same process as in Round 1 so that the number of the evaluations that the Verifier needs to perform is indeed lower.

Soundness & Correctness

Completeness

Completeness is obvious. As long as the Prover sends on each round

i

\forall i \in {0, v} g_{i} (X_{i})

then the Verifier will accept with probability 1 the satisfaction of the statement.

Soundness

A good proof of Soundness by induction can be seen in Thaler's book in pag.37.

This is the capture:

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Sum-Check + Low-Degree Multilinear Extensions.

Multilinar Extensions

Definition:

Let

F

be any finite field and let

f : {0, 1} \to F

be any function mapping the v-dimensional Boolean HyperCube to

F

.
A

v

-variate polynomial

g

is said to be an estension of

f

g

agrees with

f

at all Boolean-valued inputs. Ie.

\forall i \in {0, 1}^{v} g (x) = f (x)

Any function

f

mapping

{0, 1} \to F

has an extension polynomial that is multilinear (a polynomial that has degree at most 1 in each variable).

This actually implies that the total degree of the polynomial is at most

v

which is logarithmic in respect to the domain (

2^{v}

).
In contrast, univariate low-degree extensions over a domain of size

n

have degree

n - 1

As with univariate low-degree extensions, one can think of a (low degree) extension

g

of a function

f : {0, 1} \to F

as a distance-amplifying encoding of

f

This means, that if two functions

f

f^{'} : {0, 1}

disagree at least by one input, then extensions

g

g^{'}

of total degree at most

d

will differ almost everywhere.

Lagrange interpolation for Multilinear polynomials

Lagrange interpolation is a mathematical technique that allows us to reconstruct a polynomial of degree

(n - 1)

given

n

data points. By using Lagrange interpolation, the verifier can interpolate the original multilinear polynomial P from a subset of the points claimed by the prover, reducing the number of required evaluations.

The process involves constructing

n

Lagrange basis polynomials, denoted as

L_{1}, L_{2}, \dots, L_{n}

, which have the property that

L_{i} (x_{i}) = 1

and

L_{i} (x_{j}) = 0

for

i \neq j

(

i, j = 1

n

These basis polynomials can be defined as:

L_{i} (x) = \prod_{j = 1, j \neq i}^{n} \frac{x_{i} - x_{j}}{x - x_{j}}

Using these basis polynomials, the verifier can then interpolate the original polynomial P as follows:

P (x) = \sum_{i = 1}^{n} y i \cdot L i (x)

Now, by evaluating the reconstructed polynomial P at a subset of points (different from the original claimed points), the verifier can efficiently check whether the prover's claim is valid. The number of evaluations required is reduced because the verifier only needs to evaluate P at a smaller set of points, determined by the Lagrange basis polynomials.

In summary, Lagrange interpolation reduces the number of evaluations needed by the verifier during the sum-check protocol by allowing the reconstruction of the original multilinear polynomial using a subset of claimed points. This technique improves the efficiency of the protocol without compromising its correctness.

Let's dive deeper into the reasoning behind the reduced number of evaluations required by the verifier when using Lagrange interpolation:

When using Lagrange interpolation in the sum-check protocol, the verifier constructs the Lagrange basis polynomials

${L_{1}, . . ., L_{n}}$ based on a subset of points claimed by the prover. These basis polynomials have the property that they evaluate to 1 at one particular point

x_{i}

and 0 at all other points

x_{j}

for

j \neq i

Now, instead of evaluating the original polynomial P at all the claimed points

{x_{1}, . ., x_{n}}

, the verifier can evaluate the reconstructed polynomial P at a different set of points. This set of points is determined by the Lagrange basis polynomials and is typically smaller than the total number of claimed points.

The reason for this reduction in the number of evaluations can be understood as follows:

Reconstruction of P: By using Lagrange interpolation, the verifier reconstructs the original polynomial P using a subset of the claimed points. The Lagrange basis polynomials form a system that spans the space of polynomials of degree

n - 1

, allowing the verifier to uniquely determine

P

Efficient interpolation: The reconstructed polynomial

P

, obtained through Lagrange interpolation, encapsulates the information about the original polynomial evaluated at the claimed points. This means that by evaluating

P

at a different set of points, the verifier can indirectly verify the correctness of the prover's claim.

Smaller evaluation set: The Lagrange basis polynomials determine the set of evaluation points for

P

. Since these basis polynomials have the property of evaluating to 1 at one particular point and 0 at all other points, the verifier needs to evaluate P only at those specific points to efficiently check the claim.

By evaluating the reconstructed polynomial P at this smaller set of points (we do have as many points as Lagrange basis polynomials), different from the original claimed points, the verifier can efficiently check the validity of the prover's claim. This reduction in the number of evaluations is possible because the verifier leverages the information encoded in the Lagrange basis polynomials to indirectly verify the claimed evaluations.

In summary, by using Lagrange interpolation, the verifier reconstructs the polynomial P using a subset of the claimed points and evaluates it at a different set of points determined by the Lagrange basis polynomials. This approach reduces the number of required evaluations, as the verifier only needs to evaluate P at this smaller set of points to efficiently check the prover's claim.

Extra topics

The amount of evaluations the Verifier needs to compute, decreases geometrically for each recursive round of sum-check applied. Is estimated that also, the Prover can compute all of the necessary points for the sum-check in
$O (2^{v})$ .
The Sum-Check protocol only requires the Verifier to actually know the degree of each of the monomials' variables of
$g$ and the ability to evaluate
$g$ at a random point
$r \in F$ (granted by Schwartz-Zippel). This means that the Verifier is able to carry on the Sum-Check on
$g$ without actually knowing it.

The Zero-Check protocol

The zero-check protocol is a technique used for example in HyperPlonk explained within the same paper in sec. 3.2.

The zero-check protocol is useful to prove in a multilinear PIOP built on top of sum-check which allows to prove polinomial identities that are 0.

This is in particular really useful to actually proof/check that a gate identity within a circuit sums up to zero.

Protocol overview

Verifier sends a random vector to Prover with at least as many elements as variables are in the multivariate polynomial
$f (\vec{X})$ .
After that, we multiply our multivariate polynomial by the eq poly. (We'll see later what this is and what it means). Ending up with the claim:

$\sum_{\vec{x} \in H^{2}} f (\vec{x}) \cdot eq (\vec{x}, \vec{r})$
Run the sum-check protocol over the multivariate polynomial that results from the previous product. This, should convince the Verifier that the poly sums to 0.

A real world example for an easy multiplication gate

Let's assume we have the following multiplication gate identities:

a	b	c
2	3	6
0	1	0
2	1	2
0	0	0

This is of course checking that

a \cdot b = c

which is transformed to be an identity that evaluates to 0 when holds that:

(a \cdot b) - c = 0

TODO: Introduce eq poly

e q (x, y) = \prod_{i = 1}^{μ} (x_{i} y_{i} + (1 - x_{i}) (1 - y_{i}))

Link it to the Lagrange interpolation section.

Get the multilinear extensions (MLEs) of our polynomials.

The first thing that we need to do if we want to be able to sum-check our identity is to find the multilinear extensions (MLEs) of our polynomials.
To do so, we proceed as follows:

Notice that

a (x)

has 4 terms. To encode it as a multilinear polinomial we will make use of 2 variables. As 2 bits in the Boolean Hypercube allow us to at least encode 4 different elements (Notice I say encode, not represent).

So, let's do it!
To start, we will call the two variables that form

\vec{x} \to {x_{1}, x_{2}}

Hence, the operation to uplift our vector as a multilinear polynomial works as follows:

\tilde{a} (\vec{x}) = \sum_{\vec{x} \in B^{μ}} a (\vec{x}) \cdot e q (x, \vec{X}) where \vec{X} = [x_{1}, x_{2}], a (\vec{x}) = a [x_{1}^{2} + x_{2}], e q (x, y) = \prod_{i = 1}^{μ} (x_{i} y_{i} + (1 - x_{i}) (1 - y_{i}))

We can already see the shape that

\tilde{a} (\vec{x})

will take:

\tilde{a} (\vec{x}) = 2 (1 - x_{1}) (1 - x_{2}) + 0 + 2 (1 - x_{1}) x_{2} + 0 = 2 - 2 x_{1}

Notice that what we are doing here is simply encoding into coefficient form a vector in a multilinear polynomial style.
As we said, to encode 4 elements we need 2 bits. Therefore, our

x

now becomes a vector for which each variable is a bit (remember we're in the boolean hypercube and so variables are ranged between

[0, 1]

To formulate it better, we're moving the elements of the vector to being encoded as the vertex of the 2-dimensional boolean hypercube

$H^{2}$ .
This means Vertex(0,0) = 2, Vertex(1,0) = 0, Vertex(0,1) = 0 and Vertex(1,1) = 2.
These are the four vertices of the 2-dimensional HyperCube.

For instance, you can actually see that

\tilde{a} (\vec{x})

has two coefficients at 2 and two more at 0. Notice how this maps exactly the actual vector

a (x)

of our multiplication gate table.

What we are doing here is set the polynomial in coefficient form similarly to how we do it in the univariate setting using Lagrange basis interpolation with our polynomials.

Now, we can do the same for the vectors, ending up with:

\tilde{b} (\vec{x}) = 3 (1 - x_{1}) (1 - x_{2}) + x_{1} (1 - x_{2}) + x_{2} (1 - x_{1}) + 0 = x_{2} x_{1} - 2 x_{1} - 2 x_{2} + 3 \tilde{c} (\vec{x}) = 6 (1 - x_{1}) (1 - x_{2}) + 0 + + x_{2} (1 - x_{1}) + 0 = - 5 x_{2} + x_{1} (5 x_{2} - 6) + 6

Now, we get from the

V

erifier our challenge

\vec{γ} = (γ_{1}, γ_{2}) = (- 1, 1)

Notice that the challenge gamma is composed by as many elements as variables we have in our polynomials.

Also, notice that gamma is a random challenge so it can have any value (not restricted to be in

F^{2}

The last MLE to compute is our

\tilde{e q} (\vec{x}, \vec{γ}) = (- x_{1} + 2 (1 - x_{2})) x_{2}

Run the sum-check against our multivariate polynomial

Now that we have computed all of our MLEs, it's time to run the sum-check.
First, notice that we need to have our full polynomial computed.

Since we already have our MLEs, we can simply operate and expand the computation:

\tilde{f} (\vec{x}) = ((\tilde{a} (\vec{x}) \cdot \tilde{b} (\vec{x})) - \tilde{c} (\vec{x})) \cdot \tilde{e q} (\vec{x}, \vec{γ})) = - 2 x_{2} x_{1}^{2} + 4 x_{1}^{2} + x_{2} x_{1} - 4 x_{1} + x_{2}

First round

Now, we have computed our

f (x)

polynomial. We should now run the sum of the non-fixed valiables over the Boolean Hypercube

H^{2}

.
This means, we have

x_{1}

fixed and we compute the sum over the Hypercube over

x_{2} \in {0, 1}

S_{0} (x) := \sum_{x_{2} \in {0, 1}} \tilde{f} (x_{1}, x_{2}) = S_{0} (x_{1}, x_{2} = 0) + S_{0} (x_{1}, x_{2} = 1) = where S_{0} (x_{1}, 0) = 4 x_{1}^{2} - 4 x_{1}, S_{0} (x_{1}, 1) = 2 x_{1}^{2} - 3 x_{1} + 1 = 6 x_{1}^{2} - 7 x_{1} + 1

The next thing to do now, is for the Verifier to check

The

P

rover sends

S_{0} (x)

to the

V

erifier who actually has 2 tasks:

Ensure that actually
$s_{0} (0) + s_{0} (1) = 0$ . This proves still the correct sum of
$H = 0$ . You can actually easily check that this holds
Check that indeed the degree of the polynomial
$s_{0}$ is at most the degree of the monomial at the same position as te variable left. ie.
$d e g (S_{0} (x)) = d e g (f [0] (x, y))$ ->
$d e g (- 2 x_{2} x_{1}^{2}) = d e g (6 x_{1}^{2})$

Second round

After the

V

erifier checked the correctness of both arguments, it samples yet another random challenge

r_{0} = 2

Knowing that, Prover computes:

S_{1} (x_{2}) \overset{}{=} \sum_{x_{2} \in {0, 1}} \tilde{f} (2, x_{2}) \overset{}{=} 8 - 5 x_{2} where : S_{1} (0) = 8, S_{1} (1) = 3

Now Prover sends this polynomial to the Verifier who needs to ensure:

$S_{1} (0) + S_{1} (1) = S_{0} (r_{0})$ .

$\sum_{x_{2} \in {0, 1}} 8 - 5 x_{2} = 11 = S_{0} (2) = 11$
Check that indeed the degree of the polynomial
$s_{1}$ is at most the degree of the monomial at the same position as te variable left. ie.
$d e g (s_{1} (x)) = d e g (f [1] (x, y))$

Last round

Now, we ran out of variables.

What do we do then? We simply ask the Verifier (let's say

r_{1} = 5

) and allow him to check that

f (r_{0}, r_{1}) = S_{1} (r_{1})

.
Which if we try to, results in:

f (r_{0}, r_{1}) = - 17 = S_{1} (r_{1}) = - 17

The Prod-Check protocol

Credits

Most of this writeup has been taken from J. Thaler's book and re-written with special enphasis on the areas I found more complex to follow.
I also took help from ChatGPT to find a good example for Multilinear extension inequality based on function inequality at a particular point.

Questions left

In the end of the Example round 2, when we prove
$s_{1} (0) + s_{1} (1) = r_{1}$ what are we specifically proving? We're simply adding rounds to the protocol so that the probability of finding polynomials that satisfy the relations enforced by previous evaluations which weren't correct is almost impossible right?
Would be nice to add a section on sumcheck combination (Multiple Sum-checks performed within one using whatever technique to mix them.)
How to non-interactive sum-check.
How does the prover know that the polynomial is multilinear if we are in ZK/non-interactive setting (polynomial is not sent, only commitments are).

We actually will ned x random values to be able to bind all initial claims that the prover does about the polynomial. One for each degree of it. This means that if we have 3 claims we can only sumcheck a multivariate polynomial of at most degree 3. Otherwise, prover will be missing values to complete the sum-check protocol
CPerezz

asn-d6

2023/05/25 10:25:22

actually

= s_0(r_1) ? or just r_1? (Edited)

Carlos Pérez

2023/10/16 10:19:02

That's correct. Thanks for the check!

Adria Massanet

2024/01/15 18:40:08

estension

extension

2024/01/16 12:57:22

polynomials.

vectors?

antoineF4C5

2025/01/29 15:53:57

Product( (x - x_j) / (x_i - x_j) )

2025/01/29 15:57:24

( -x_1 + 2*(1 - x_1) ) * x_2