Batched Sumcheck explanation

Lagrange Polynomial basics

For

0 \leq i < 2^{n}

, define it's bit representation as

(i_{0}, \dots, i_{n - 1})

such that

i = \sum_{k = 0}^{n - 1} i_{k} \cdot 2^{n - k - 1}

Note that for

n^{'} < n

, if

i < 2^{n^{'}}

then its bit representation will look like

(0, \dots, 0, i_{n^{'} - n}, \dots, i_{n - 1})

, with

n - n^{'}

zeros in front.

The Lagrange polynomials over

{0, 1}^{n}

are indexed by

0 \leq i < 2^{n}

, where

\begin{aligned} L_{i} (X_{0}, \dots, X_{n - 1}) & = eq (i_{0}, \dots, i_{n - 1}, X_{0}, \dots, X_{n - 1}) \\ = eq (i_{0}, \dots, i_{n - n^{'} - 1}, X_{0}, \dots, X_{n - n^{'} - 1}) \cdot eq (i_{n - n^{'}}, \dots, i_{n - 1}, X_{n - n^{'}}, \dots, X_{n - 1}) \end{aligned}

The Lagrange polynomials where

0 \leq i < 2^{n^{'}}

can be factored as

\begin{aligned} L_{i} (X_{0}, \dots, X_{n - 1}) & = eq (i_{0}, \dots, i_{n - n^{'} - 1}, X_{0}, \dots, X_{n - n^{'} - 1}) \cdot eq (i_{n - n^{'}}, \dots, i_{n - 1}, X_{n - n^{'}}, \dots, X_{n - 1}) \\ = eq (0, \dots, 0, X_{0}, \dots, X_{n - n^{'} - 1}) \cdot eq (i_{n - n^{'}}, \dots, i_{n - 1}, X_{n - n^{'}}, \dots, X_{n - 1}) \\ = (\prod_{k = 0}^{n - n^{'} - 1} (1 - X_{k})) \cdot (\prod_{k = n - n^{'}}^{n - 1} ((1 - i_{k}) (1 - X_{k}) + i_{k} \cdot X_{k})) \\ = L_{0} (X_{0}, \dots, X_{n - n^{'} - 1}) \cdot L_{i} (X_{n - n^{'}}, \dots, X_{n - 1}) \end{aligned}

Here, we use the convention that a Lagrange polynomial in

m

variables is indexed over all

0 \leq i < 2^{m}

The multi-linear extension of a polynomial over the

n

-th boolean hypercube is

P (X_{0}, \dots, X_{n - 1}) = \sum_{i = 0}^{2^{n} - 1} P_{i} \cdot L_{i} (X_{0}, \dots, X_{n - 1})

If the polynomial only only has non-zero values at the first

2^{n^{'}}

evaluation points, then the factorization of the Lagrange polynomials still applies.

\begin{aligned} P (X_{0}, \dots, X_{n - 1}) & = \sum_{i = 0}^{2^{n} - 1} P_{i} \cdot L_{i} (X_{0}, \dots, X_{n - 1}) \\ = \sum_{i = 0}^{2^{n^{'}} - 1} P_{i} \cdot L_{i} (X_{0}, \dots, X_{n - 1}) \\ = \sum_{i = 0}^{2^{n^{'}} - 1} P_{i} \cdot L_{0} (X_{0}, \dots, X_{n - n^{'} - 1}) \cdot L_{i} (X_{n - n^{'}}, \dots, X_{n - 1}) \\ = L_{0} (X_{0}, \dots, X_{n - n^{'} - 1}) \cdot \sum_{i = 0}^{2^{n^{'}} - 1} P_{i} \cdot L_{i} (X_{n - n^{'}}, \dots, X_{n - 1}) \end{aligned}

An important fact about Lagrange polynomials, is that they must sum to 1:

1 = \sum_{i = 0}^{2^{m} - 1} L_{i} (X_{0}, \dots, X_{m - 1})

If we have a polynomial over

n^{'} < n

variables

P^{'} (X_{0}, \dots, X_{n^{'} - 1}) = \sum_{i = 0}^{2^{n^{'}} - 1} P_{i}^{'} \cdot L_{i} (X_{0}, \dots, X_{n^{'} - 1})

The canonical way to "lift" this polynomial to

n

variables, is to consider

\begin{aligned} P (X_{0}, \dots, X_{n - 1}) & = P^{'} (X_{n - n^{'} - 1}, \dots, X_{n - 1}) \\ = \sum_{i = 0}^{2^{n^{'}} - 1} P_{i}^{'} \cdot L_{i} (X_{n - n^{'} - 1}, \dots, X_{n - 1}) \\ = \sum_{i = 0}^{2^{n} - 1} P_{i} \cdot L_{i} (X_{0}, \dots, X_{n - 1}) \end{aligned}

Let's see how the list of evaluations

{P_{i}}_{i = 0}^{2^{n} - 1}

relates to the initial ones

{P_{i}^{'}}_{i = 0}^{2^{n^{'}} - 1}

For any

0 \leq i^{'} < 2^{n^{'}}

and

0 \leq j < 2^{n - n^{'}}

, we can write the index

0 \leq i < 2^{n}

i = i^{'} + j \cdot 2^{n^{'}}

, and it's bit decomposition is

(j_{0}, \dots, j_{n - n^{'} - 1}, i_{0}^{'}, \dots, i_{n^{'} - 1}^{'})

.
The

i

-th evaluation of

P

is equal to the

i^{'}

-th evaluation of

P^{'}

\begin{aligned} P_{i} & = P_{i^{'} + j \cdot 2^{n^{'}}} \\ = P (j_{0}, \dots, j_{n - n^{'} - 1}, i_{0}^{'}, \dots, i_{n^{'} - 1}^{'}) \\ = P^{'} (i_{0}^{'}, \dots, i_{n^{'} - 1}^{'}) \\ = P_{i^{'}}^{'} \end{aligned}

In other words, the list of evaluations of

P

is equal to

2^{n - n^{'}}

repetitions of the list of evaluations of

P^{'}

P_{i} = P_{i mod 2^{n^{'}}}^{'}

Commitment

We have two a combination function

F : F \times F \to F

and two sets of two MLE polynomials

(P_{1}, Q_{1})

(P_{2}, Q_{2})

with

2^{n_{1}}

and

2^{n_{2}}

evaluations respectively. Let

n = max {n_{1}, n_{2}}

Given

P_{b}

(respectively

Q_{b}

) with

2^{n_{b}}

evaluations (

b = 1, 2

), we interpret it as a polynomial with

n

variables, such that when we list out its evaluations in the "canonical" order, the

2^{n_{b}}

non-zero evaluations are first. We can define it as:

P_{b}^{'} (X_{0}, \dots, X_{n - 1}) = L_{0} (X_{0}, \dots, X_{n - n_{b} - 1}) P_{b} (X_{n - n_{b}}, \dots, X_{n - 1})

In order to open both pairs of polynomials using the same PCS opening argument, we need to commit to

P_{b}^{'}

, obtained by computing the MSM with the first

2^{n_{b}}

group elements in the commitment key.

The interpretation of

P_{b}^{'}

allows us to avoid allocating zeros at the end of the list of evaluations of

P_{b}

Sumcheck

In the context of Sumcheck, we use a different interpretation of

P_{b}

to avoid padding.

We want to batch the claims

σ_{b} = \sum_{x \in {0, 1}^{n_{b}}} F (P_{b} (x), Q_{b} (x))

using a random linear combination challenge

γ

In this context we run it on the polynomials

P_{b}^{″} (X_{0}, \dots, X_{n - 1}) = P_{b} (X_{n - n_{b}}, \dots, X_{n - 1})

This representation is equivalent to taking the list of evaluations of

P_{b}

, and repeating them

2^{n - n_{b}}

times such that

P_{b}^{″}

is of length

2^{n}

. Again, we want to avoid having to allocate that much space, especially for repeated values, so we need to "imagine" these repetitions instead.

Sumcheck works partially evaluating the claims in the challenge variables

r = (r_{0}, \dots, r_{n - 1})

If we instead apply the same Sumcheck combination on the repeated polynomials, we notice that the sum we get is the original one, scaled by

2^{n - n_{b}}

\begin{aligned} \sum_{x \in {0, 1}^{n}} F (P_{b}^{″} (x), Q_{b}^{″} (x)) & = \sum_{y \in {0, 1}^{n - n_{b}}} \sum_{x \in {0, 1}^{n_{b}}} F (P_{b} (x), Q_{b} (x)) \\ = \sum_{y \in {0, 1}^{n - n_{b}}} σ_{b} \\ = 2^{n - n_{b}} \cdot σ_{b} \end{aligned}

In rounds

0 \leq i < n - n_{b}

, the prover computes the univariate polynomial for this claim, given by:

\begin{aligned} S_{b}^{(i)} (X_{i}) & = \sum_{x \in {0, 1}^{n - i - 1}} F (P_{b}^{″} (r_{0}, \dots, r_{i - 1}, X_{i}, x), Q_{b}^{″} (r_{0}, \dots, r_{i - 1}, X_{i}, x)) \\ = \sum_{y \in {0, 1}^{n - i - 1 - n_{b}}} \sum_{x \in {0, 1}^{n_{b}}} F (P_{b} (x), Q_{b} (x)) \\ = 2^{n - i - 1 - n_{b}} \cdot σ_{b} \end{aligned}

This polynomial is constant, and is equal to the initial claim scaled by

2^{n - i - 1 - n_{b}}

When we bind the polynomial to

r_{i}

, it does not affect our list of evaluations, since

P_{b}^{″} (r_{0}, \dots, r_{i - 1}, r_{i}, X_{i + 1}, \dots, X_{n - 1}) = P_{b} (X_{n - n_{b}}, \dots, X_{n - 1})

For rounds

n - n_{b} \leq i < n

, the protocol will start evaluating the "active" variables of

P_{b}

, and the Sumcheck prover simply continues as previously, without having to do any scaling.

During the protocol, the prover will have sent the univariate polynomials

S^{(i)} (X_{i}) = S_{1}^{(i)} (X_{i}) + γ \cdot S_{2}^{(i)} (X_{i})

and at the end, sends the evaluations

p_{b}^{″} = P_{b}^{″} (r_{0}, \dots, r_{n - 1}) = P_{b} (r_{n - n_{b}}, \dots, r_{n - 1}), q_{b}^{″} = Q_{b}^{″} (r_{0}, \dots, r_{n - 1}) = Q_{b} (r_{n - n_{b}}, \dots, r_{n - 1}) .

The verifier having processed this data through the transcript, does the following:

Set
$e^{(0)} = 2^{n - n_{1}} \cdot σ_{1} + γ \cdot 2^{n - n_{2}} \cdot σ_{2}$
For each
$i = 0, 1, \dots, n - 1$ ,
- Check
  $e^{(i)} = S^{(i)} (0) + S^{(i)} (1)$
- Set
  $e^{(i + 1)} = S^{(i)} (r_{i})$
Check
$e^{(n)} = F (p_{1}^{″}, q_{1}^{″}) + γ \cdot F (p_{2}^{″}, q_{2}^{″})$

The parties now run a PCS argument to check the evaluations

p_{b}^{″}, q_{b}^{″}

PCS

The evaluations returned by the prover are for the polynomials

P_{b}^{″}, Q_{b}^{″}

. Note however that we have committed to the polynomials

P^{'}, Q^{'}

as described in the first section.

We can transform the evaluation

u_{b}^{″}

for

P^{″}

(r_{0}, \dots, r_{n - 1})

, into an evaluation

u_{b}^{'}

for

P^{'}

at the same point, by setting

\begin{aligned} u_{b}^{'} & = P^{'} (r_{0}, \dots, r_{n - 1}) = L_{0} (r_{0}, \dots, r_{n - n_{b} - 1}) \cdot P (r_{n - n_{b}}, \dots, r_{n - 1}) \\ = P^{'} (r_{0}, \dots, r_{n - 1}) = L_{0} (r_{0}, \dots, r_{n - n_{b} - 1}) \cdot u_{b}^{″} \\ = u_{b}^{″} \cdot \prod_{i = 0}^{n - n_{b} - 1} (1 - r_{i}) . \end{aligned}

This simply involves rescaling the original Sumcheck evaluation by the first Lagrange polynomial, evaluated in the "missing" first variables of

P^{″}

We can then batch all polynomial evaluation instances into a single instance of size

n

Batched Sumcheck explanation

Lagrange Polynomial basics

Commitment

Sumcheck

PCS

Read more

Note on "Small field zerocheck"

Compressed SNARK description

Issue with public inputs

Albi Protostar tmp notes