The Sum-Check Protocol

In this note, we will explore two versions of the Sum-Check protocol: the vanilla protocol, as well as a useful generalization that we'll refer to as "Specialized Sum-Check".

The Sum-Check problem statement

Sum-check is an algorithm where the following statement is proven:

\sum_{x_{1} \in {0, 1}} \dots \sum_{x_{m} \in {0, 1}} g (x_{1}, \dots, x_{m}) = C_{1},

where

g : F^{m} \to F

is any m-variate polynomial, and

C_{1} \in F

is the result of computing the sum. The polynomial

g

is public (i.e. known to both the prover and verifier). Hence, the verifier could compute that sum itself. However, we often don't want the verifier to have to do all this work. The main benefit of sum-check is that the verifier can be convinced that the statement is true with very little work, at the cost of more work for the prover.

Throughout this note, we will use the following as a short-hand for the above statement:

\sum_{(x_{1}, \dots, x_{m}) \in {0, 1}^{m}} g (x_{1}, \dots, x_{m}) = C_{1} .

The Sum-Check Protocol

In this section, we present the algorithm. We will show why the algorithm properly proves the sum-check statement in the next section.

Sum-check is an interactive algorithm which can be made non-interactive using the Fiat-Shamir transform. The algorithm consists of

m

rounds, where in round

i

, the prover sends a univariate polynomial

s_{i} : F \to F

to the verifier, and the verifier responds with a field element

r_{i} \in_{R} F

sampled uniformly at random. After the last round, the verifier evaluates

g

at a single point, runs a final check, after which it is convinced that the statement is true.

Let's take a closer look at what happens in a round.

round 1

Prover

The prover constructs the univariate polynomial

s_{1} (x_{1}) = \sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (x_{1}, x_{2}, \dots, x_{m}) .

In words,

s_{1} : F \to F

is defined as the sum of

g

over all variables except

x_{1}

The prover sends

s_{1}

to the verifier.

Verifier

The verifier checks

s_{1} (0) + s_{1} (1) = C_{1} .

Then, the verifier samples

r_{1} \in_{R} F

uniformly at random and sends it to the prover.

round 2

Prover

The prover constructs the univariate polynomial

s_{2} (x_{2}) = \sum_{(x_{3}, \dots, x_{m}) \in {0, 1}^{m - 2}} g (r_{1}, x_{2}, x_{3}, \dots, x_{m}) .

The prover sends

s_{2}

to the verifier.

Verifier

The verifier checks

s_{2} (0) + s_{2} (1) = s_{1} (r_{1}) .

Then, the verifier samples

r_{2} \in_{R} F

uniformly at random and sends it to the prover.

round
$i$ ,
$3 \leq i \leq m$

Prover

The prover constructs the univariate polynomial

s_{i} (x_{i}) = \sum_{(x_{i + 1}, \dots, x_{m}) \in {0, 1}^{m - i}} g (r_{1}, \dots, r_{i - 1}, x_{i}, x_{i + 1}, \dots, x_{m}) .

The prover sends

s_{i}

to the verifier.

Verifier

The verifier checks

s_{i} (0) + s_{i} (1) = s_{i - 1} (r_{i - 1}) .

Then, the verifier samples

r_{i} \in_{R} F

uniformly at random and sends it to the prover.

Verifier last check

In round

m

, the verifier received

s_{m} (x_{m})

from the prover, and sampled

r_{m} \in_{R} F

. Now, the verifier checks

g (r_{0}, \dots, r_{m}) = s_{m} (r_{m})

Note that this is the only evaluation of

g

by the verifier in the entire protocol.

This completes the protocol. If the check passes, then the verifier is convinced that the original statement is true.

Sum-Check: Analysis

In this section, we will show why the protocol as described in the previous section is sound.

The first realization to make is that the protocol is recursive: each round reduces its statement to a new statement that can itself be proved using sum-check. That is, rounds are linked together by the following relationship: if the statement at round

i + 1

is correct, then the statement at round

i

is correct.

Before we dive in further, let's introduce some useful notation. Given a function

f (x_{1}, x_{2}) : F^{2} \to F

, then the function

f_{r_{1}} (x_{2})

is a function such that

f_{r_{1}} (x_{2}) = f (r_{1}, x_{2}) .

That is,

f_{r_{1}} : F \to F

is a function of one less variable than

f

. We say that the variable

x_{1}

is bound to

f

(with value

r_{1}

) to yield the new function

f_{r_{1}}

. Intuitively, all this does is fix the evaluation of the variable

x_{1}

to always be

r_{1}

. Naturally, this is not limited to functions of 2 variables.

We are now ready to show why the sum-check protocol properly convinces the verifier that the sum-check problem statement being proved is true. Although not a formal proof, this will be in a format similar to a proof by induction: we will prove the recursive (or "inductive") case, and the base case. We move away from the formal proof format to help convey the intuition behind the proof rather than being formally correct. The recursive case covers how each round reduces its statement to a "smaller" statement, that is also provable using the sum-check protocol. The base case covers the part of the protocol after the last round is over.

Recursive case

To be more concrete, we will look at how the statement in round 1 is reduced to the statement in round 2. However, the idea is the same for every round.

Recall that the statement to prove in round 1 is:

\sum_{(x_{1}, \dots, x_{m}) \in {0, 1}^{m}} g (x_{1}, \dots, x_{m}) = C_{1},

while the statement to prove in round 2 is:

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g_{r_{1}} (x_{2}, \dots, x_{m}) = s_{1} (r_{1}) .

There are three key things to notice:

the sum is over one less variable in round 2 than it was in round 1,
$g$ has been replaced with
$g_{r_{1}}$ ,
the right-hand side is a scalar value (i.e.
$s_{1} (r_{1}) \in F$ ), which we can name
$C_{2}$ .

The statement in round 2 is a valid sum-check problem, to which we can apply the sum-check protocol!

Now, crucially, let's see why the statement in round 1 reduces to the statement in round 2. That is, if the statement in round 2 is true, then the statement in round 1 is true.

Assume that the second statement is true. That is,

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g_{r_{1}} (x_{2}, \dots, x_{m}) = s_{1} (r_{1}) .

Then, since

r_{1}

was sampled uniformly at random, by the Schwartz-Zippel lemma,

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g_{x_{1}} (x_{2}, \dots, x_{m}) = s_{1} (x_{1}) \forall x_{1} \in F .

Or equivalently,

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (x_{1}, x_{2}, \dots, x_{m}) = s_{1} (x_{1}) \forall x_{1} \in F .

That is, we can now treat the entire polynomials on the left-hand side and the right-hand side as equal. Then,

\begin{aligned} s_{1} (0) + s_{1} (1) & = (\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (0, x_{2}, \dots, x_{m})) + (\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (1, x_{2}, \dots, x_{m})) \\ = \sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (0, x_{2}, \dots, x_{m}) + g (1, x_{2}, \dots, x_{m}) \\ = \sum_{(x_{1}, \dots, x_{m}) \in {0, 1}^{m}} g (x_{1}, \dots, x_{m}), \end{aligned}

which is the left-hand side of the statement in round 1! Hence, the statement in round 1 can be written as

s_{1} (0) + s_{1} (1) = C_{1}

. Unsurprisingly, this is exactly the check that the verifier does in the protocol!

To recap, when we assume that the statement in round 2 is true, then the statement in round 1 can be rewritten as

s_{1} (0) + s_{1} (1) = C_{1}

. So after the verifier runs this check, all that is left to show is that the statement in round 2 is actually true (which will be carried out in the next round of the protocol). This is another way of saying that the statement in round 1 does indeed reduce to the statement in round 2, which was our initial goal.

Next, we will look at the base case, which occurs after round

m

Base case

The statement left to prove after round

m

is:

g_{r_{1}, \dots, r_{m}} = s_{m} (r_{m})

That is, there is no more sum left, and all the variables are bound. Recall that in round

m

, the verifier received the polynomial

s_{m}

, and randomly sampled

r_{m} \in_{R} F

Now, this check is trivially done by the verifier. The verifier evaluates

g (r_{1}, \dots, r_{m})

to yield the left-hand side of the statement, and checks if it is equal to

s_{m} (r_{m})

Since this was the last statement to prove in the protocol, then the verifier is convinced that the original statement in round 1 is correct!

The Specialized Sum-Check protocol

We will now look at a specialization of the problem statement that turns out to be quite useful in practice. We call it a "specialization" because we make more assumptions about the structure of the inner sum than in the traditional sum-check protocol.

Problem statement

The sum-check problem statement can be specialized to:

\sum_{(x_{1}, \dots, x_{m}) \in {0, 1}^{m}} g (f_{1} (x_{1}, \dots, x_{m}), \dots, f_{n} (x_{1}, \dots, x_{m})) = C_{1},

where

f_{1}, \dots, f_{n}

are multilinear polynomials. Notice that when

f_{i} (x_{1}, \dots, x_{m}) = x_{i}

, then we recover the vanilla problem statement explored in the previous sections.

The protocol

The protocol works exactly the same as the vanilla sum-check protocol. The

s_{i}

polynomials are defined similarly:

s_{1} (x_{1}) = \sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g (f_{1} (x_{1}, x_{2}, \dots, x_{m}), \dots, f_{n} (x_{1}, x_{2}, \dots, x_{m})),

s_{2} (x_{2}) = \sum_{(x_{3}, \dots, x_{m}) \in {0, 1}^{m - 2}} g ({f_{1}}_{r_{1}} (x_{2}, x_{3}, \dots, x_{m}), \dots, {f_{n}}_{r_{1}} (x_{2}, x_{3}, \dots, x_{m})),

\dots

Analysis

The argument for why the specialized sum-check protocol is sound is actually the same as for the previous one. We will briefly look at the recursive case, again by focusing on how the problem statement in round 1 reduces to the problem statement in round 2.

Assume that the problem statement in round 2 is true. That is,

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} g ({f_{1}}_{r_{1}} (x_{2}, \dots, x_{m}), \dots, {f_{n}}_{r_{1}} (x_{2}, \dots, x_{m})) = s_{1} (r_{1}),

This is actually the same state as in the vanilla problem. To make this a bit more clear, let's rewrite the summand slightly. We define

h_{r_{1}} (x_{2}, \dots, x_{m})

h_{r_{1}} (x_{2}, \dots, x_{m}) = g ({f_{1}}_{r_{1}} (x_{2}, \dots, x_{m}), \dots, {f_{n}}_{r_{1}} (x_{2}, \dots, x_{m}))

Notice that

h_{r_{1}}

is nonlinear in all its variables. Then, we can rewrite the problem statement in round 2 as

\sum_{(x_{2}, \dots, x_{m}) \in {0, 1}^{m - 1}} h_{r_{1}} (x_{2}, \dots, x_{m}) = s_{1} (r_{1}) .

Hopefully, it is now clear that since this is the same form as in the vanilla sum-check problem, then the exact same argument for the recursive case applies.

As for the base case, the verifier needs to check

g (f_{1} (r_{1}, \dots, r_{m}), \dots, f_{n} (r_{1}, \dots, r_{m})) = s_{m} (r_{m}) .

Efficient evaluation of round polynomials
$s_{i}$

This specialization is useful in practice, in part because there exists an efficient algorithm for the prover to construct the polynomials

s_{i}

at each round. We go over the algorithm in great detail in this note.

lucasxia01

2024/08/12 18:39:44

evaluates

usually, the prover evaluates this and proves via poly commitment that this evaluation is correct. I don't think this step is succinct otherwise

plafer

2024/08/16 11:44:17

This is a good point. I believe the sum-check literature leaves it open whether or not the protocol is succinct - maybe linear verifier time is acceptable for your application. The verifier cost is `O(m) + a single oracle query to g` - you get to decide whether you let the verifier evaluate `g` directly, or use any other protocol to help the verifier do that more efficiently. Note that in GKR, for all inner layers, this "other protocol" is another round of sum-check.

2024/08/12 18:46:23

check

I feel like this is the big difference between the specialization and the vanilla case because the verifier needs to get an evaluation from each individual f_i and compute g on these evaluations to check these instead of checking the evaluation on h.

2024/08/16 11:57:25

Actually from the specialization doesn't make any difference from the verifier's perspective. I see it as the verifier evaluating `h`, where it happens that to evaluate `h`, you first evaluate the `f_i`s, and then `g`. The additional structure that we assume in this specialization ends up being useful to the prover though - as discussed in the next section, the prover is able to evaluate the round polynomials more efficiently by taking advantage of that structure. But this is only because it needs to evaluate `h` at many points, and is able to reuse work between these evaluations. For the verifier, I don't see any benefit however, since it only needs to evaluate `h` at 1 point.

The Sum-Check problem statement

The Sum-Check Protocol

round 1

Prover

Verifier

round 2

Prover

Verifier

round i, 3≤i≤m

Prover

Verifier

Verifier last check

Sum-Check: Analysis

Recursive case

Base case

The Specialized Sum-Check protocol

Problem statement

The protocol

Analysis

Efficient evaluation of round polynomials si

Read more

Basefold

LogUp-GKR

LogUp

LogUp-GKR: The Air constraints

round
$i$ ,
$3 \leq i \leq m$

Efficient evaluation of round polynomials
$s_{i}$