An Overview about FIPS 203: Module-Lattice-based Key-Encapsulation-Mechanism

Introduction

In August 13th, 2024, NIST announced three post-quantum cryptography standards, where two of them are from lattice-based cryptography and the other one is from hash-based cryptography. In this post, we will have an overview about FIPS 203, which specify algorithms derived from CRYSTALS-Kyber, is a key encapsulation mechanism based on module lattice.

Notation

Pre-requisited

The LWE Problem

Definition

The Learning with Errors Problem (LWE) has an important role in many cryptographic schemes that are related to lattice cryptography, so let's review this problem again.

Definition 1. For positive integer

m, n, q

and

β < q

, the

L W E_{n, m, q, β}

problem asks to distinguish between the following two distributions:

$(A, A s + e)$ where
$A \leftarrow Z_{q}^{n \times m}$ ,
$s \leftarrow [β]^{m}$ ,
$e \leftarrow [β]^{n}$ .
$(A, u)$ where
$A \leftarrow Z_{q}^{n \times m}$ ,
$u \leftarrow Z_{q}^{n}$ .

The crucial part that makes this problem hard is the exist of error vector

e

, which makes Gaussian elimination can't be applied here. The hardness of LWE relys on the parameters

n, m, q, β

, and it becomes harder when

m

and

\frac{β}{q}

grow. The parameter

n

is not known to have large impact on the hardness of the problem.

We also notice that there is nothing too special about using the uniform distribution for the secret and error terms, it just makes the presentation simpler. When rounded Gaussian distribution is chosen in the original definition of LWE, some practical implementations like Kyber uses the binomial distribution to generate the error because of it speed. To account for different distributions that one could use, we can define the LWE problem with the distribution like this:

Definition 2. For positive integer

m, n, q

and a distribution

ψ

, the

L W E_{n, m, q, β}

problem asks to distinguish between the following two distributions:

$(A, A s + e)$ where
$A \leftarrow Z_{q}^{n \times m}$ ,
$s \leftarrow ψ^{m}$ ,
$e \leftarrow ψ^{n}$ .
$(A, u)$ where
$A \leftarrow Z_{q}^{n \times m}$ ,
$u \leftarrow Z_{q}^{n}$ .

An LWE-based Encryption Scheme

Let's talk about an encryption scheme based on LWE, which relys on the hardness of

L W E_{m, q, β}

. This scheme is adapted from the original and have been improved by many cryptographer.

Key generation:
$\begin{aligned} s k & : s \leftarrow [β]^{m} \\ p k & : (A \in Z_{q}^{m \times m}, t = A s + e_{1}) \end{aligned}$
where
$e_{1} \leftarrow [β]^{m}$
Encryption: To encrypt a message
$m \in {0, 1}$ , the encryptor chooses
$r, e_{2} \leftarrow [β]^{m}$ and
$e_{3} \leftarrow [β]$ and output:
$(u^{T} = r^{T} A + e_{2}^{T}, v = r^{T} t + e_{3} + [\frac{q}{2}] m)$
Decryption: To decrypt, one computes
$v - u^{T} s$ . Instead of receiving message
$m$ directly, we will get:
$\begin{aligned} v - u^{T} s & = r^{T} (A s + e_{1}) + e_{3} + \frac{q}{2} m - (r^{T} A + e_{2}^{T}) s \\ = r^{T} e_{1} + e_{3} + \frac{q}{2} m - e_{2}^{T} s \end{aligned}$

We can rewrite the final equation as

e + \frac{q}{2} m

, and if the parameters are set such that

e < \frac{q}{4}

, the decryptor can determine

m

by checking whether the value

v - u^{T} s

is closer to

0

or to

\frac{q}{2}

Lattices

The core of lattice-based cryptography, include LWE, are objects known as lattices. An

m

-dimensional integer lattice

Λ

is simply a subgroup of the group

(Z^{m}, +)

. Such a group can be described via a generating set called a basis. In particular, a lattice

Λ

defined by a (full-rank) basis

B \in Z^{m \times m}

Λ = L (B) = {v \in Z^{m} : \exists A v = 0 \mod q}

In this post, we will just learn about

q

-ary integer lattices, as there are the ones that are used in cryptographic constructions. They also have the nice theoretical property that solving some problem over random instances of these lattices is as hard as solving some problem for any lattice.

For a matrix

A \in Z_{q}^{n \times m}

, the q-ary lattice

Λ

defined by

A

Λ = L_{q}^{⊥} (A) = {v \in Z^{m} : A v}

The most well known computational problems on lattices are the following:

Shortest Vector Problem (SVP): Given a lattice basis
$B$ , find the shortest nonzero vector in
$L (B)$
Closest Vector Problem (CVP): Given a lattice basis
$B$ and a target vector
$t$ (not necessarily in the lattice), find the lattice point
$v \in L (B)$ closest to
$t$
Shortest Independent Vectors Problem (SIVP): Given a lattice basis
$B \in R^{n \times n}$ , find
$n$ linearity independent lattice vectors
$S = [s_{1}, s_{2}, . . ., s_{n}]$ (where
$s_{i} \in L (B)$ for all
$i$ ) so that
$m a x | | v_{i} | | \leq m a x | | b_{i} | |$ , where
$| | x | | = \sqrt{x_{1}^{2} + x_{2}^{2} + . . . + x_{n}^{2}}$

We can represent the LWE problem in the language of lattices. Notice that these problems have the relationship: if someone find a way to solve one of these problems efficiently, then he/she can also solve remain problems.

Encryption over Polynomial Rings

The main inefficiency with the LWE-based encryption scheme above was that it required a large ciphertext for encrypting one bit. To deal with this, we can consider the LWE problem over high-degree polynomial rings, rather than just over

Z_{q}

Polynomial Rings

The polynomial ring

(Z [X], +, \times)

, with the indeterminate

X

, consists of elements of the form

a (X) = \sum_{i = 0}^{\infty} a_{i} X^{i}

for

a_{i} \in Z

, with the usual polynomial addition and multiplication operations. For convenience, we will often omit the indeterminate

X

and simply write

a

instead of

a (X)

. The degree of

a

, denoted

d e g (a)

is the largest

i

for which

a_{i} \neq 0

. A polynomial is called monic if

a_{d e g (a)} = 1

and is irreducible if it can be represented as product of two polynomials in the same ring.

We will work with the ring

(R_{q, f}, +, \times)

, where

f \in Z_{q} [X]

is a monic polynomial of degree

d

. The elements of

R_{q, f}

are the polynomials

a = \sum_{i = 0}^{d - 1} a_{i} X^{i}

, where

a_{i} \in Z_{q}

. The sum of two elements in

R_{q, f}

simply involves summing the corresponding coefficients in

Z_{q}

, that is:

a + b = \sum_{i = 0}^{d - 1} (a_{i} + b_{i}) X^{i}

So the addition of polynomial in

R_{q, f}

can be seen as addition of vectors over

Z_{q}^{d}

. Multiplication of a polynomial by an element in

Z_{q}

therefore also has the same interpretation as multiplying a vector by constant.

Multiplication of two polynomials in

R_{q, f}

involves performing a normal polynomial multiplication followed by a reduction modulo

f

, which means that the remainder after a division by

f

is performed.

Generalized-LWE Problems and Encryption

With the polynomial ring

R_{q, f}

, we can define the new version of

L W E_{n, m, β}

Definition 3: For positive integer

m, n, q, β < q

and ring

R_{q, f}

, the

R_{q, f} - L W E_{n, m, β}

problem asks to distinguish between the following two distributions:

$(A, A s + e)$ where
$A \leftarrow R_{q, f}^{n \times m}$ ,
$s \leftarrow [β]^{m}$ ,
$e \leftarrow [β]^{n}$ .
$(A, u)$ where
$A \leftarrow R_{q, f}^{n \times m}$ ,
$u \leftarrow R_{q, f}^{n}$ .

We can also improve the LWE-based encryption scheme which are described in previous section:

Key generation:
$\begin{aligned} s k & : s \leftarrow [β]^{m} \\ p k & : (A \in R_{q, f}^{m \times m}, t = A s + e_{1}) \end{aligned}$
where
$e_{1} \leftarrow [β]^{m}$
Encryption: To encrypt a message
$m \in R_{f}$ , where the coefficients are in
${0, 1}$ , the encryptor chooses
$r, e_{2} \leftarrow [β]^{m}$ and
$e_{3} \leftarrow [β]$ and output:
$(u^{T} = r^{T} A + e_{2}^{T}, v = r^{T} t + e_{3} + [\frac{q}{2}] m)$
Decryption: To decrypt, one computes:
$\begin{aligned} v - u^{T} s & = r^{T} (A s + e_{1}) + e_{3} + \frac{q}{2} m - (r^{T} A + e_{2}^{T}) s \\ = r^{T} e_{1} + e_{3} + \frac{q}{2} m - e_{2}^{T} s \end{aligned}$

And we can extract the message

m

by checking each coefficient is closer to 0 or to

q

Optimizations

Number Theoretic Transform

The main problem of cryptographic scheme using polynomial is the complexity of multiplying two polynomials of degree

d

, which is

O (d^{2})

. When applying a cryptographic scheme in real world application, it shouldn't be too slow as it will effect all system. Therefore, to perform polynomial multiplication in

R_{q, f}

efficiently, Number Theoretic Transfrom (NTT) is the best method, which has complexity

O (d \log d)

. NTT is a special case of the FFT over the finite field

G F (q)

rather than over the complex numbers.

Compression/Decompression function

A compression function is an operation that takes an element from one set into smaller target set. When the target set is bigger than start set, it will be called a decompress function.

Definition 4: For an element

x \in Z_{q}

and some positive integer

p

, we define a mapping from

Z_{q}

Z_{p}

as:

[x]_{q \to p} = [\frac{x p}{q}] \in Z_{p}

We can use the function above to compress/decompress data: when we compress an element in

Z_{q}

to one in

Z_{p}

(p < q) and decompress back to

Z_{q}

, the result will not be too far away from the original element.

Lemma 1. For integers

p < q

and

x \in Z_{q}

, it holds that

[[x]_{q \to p}]_{p \to q} = x + η \in Z_{q}

for some

η \in Z

satisfying

| η | \leq \frac{q}{2 p} + \frac{1}{2}

There are two reasons for using these functions:

Recovering the message
$m$ from the noisy decryption output can be done using the compression function. In particular, for an element
$x \in Z_{q}, [x]_{q \to 2}$ will map to
$0$ if
$x$ is closer to
$0$ than to
$\frac{1}{2}$ , and to
$1$ otherwise.
Using these function will bring bandwidth efficiency while maintaining security properties.

Key Encapsulation Mechanism

In cryptography, a key encapsulation mechanism, or KEM, is a public-key cryptosystem that allows a sender to generate a short secret key and transmit it to a receiver securely, in spite of eavesdropping and intercepting adversaries.

There are three algorithms in KEM - KEM-KeyGen, KEM-Encaps and KEM-Decaps.

The key generation algorithms outputs a secret key and a public key.
The encapsulation algorithm takes the public key as input and outputs a share key and a ciphertext.
The decapsulation algorithm takes the ciphertext and the secret key as input and produces the same shared key as output

A CPA-secure KEM is one in which an adversary cannot distinguish the shared key from uniform when given the public key and ciphertext. Such a KEM can be constructed from any CPA-secure public key encryption scheme by simply encrypting a random message and setting it as the shared key.

Detail

Algorithms

CRYSTALS-Kyber CPA-secure Encryption Scheme

Let's talk about CRYSTALS-Kyber CPA-secure Encryption Scheme, which is based on the hardness of the generalized LWE problem. The scheme works over the ring

R_{3329, X^{256} + 1}

, and the distribution of the secrets, denoted as

ψ_{η}

for positive integer

η

, is drawn from the binomial distribution because it's easier to sample.

Public parameters:
$k, η_{1}, η_{2}, d_{u}, d_{v} \in Z^{+}$
CPA-KeyGen:
$\begin{aligned} A & \leftarrow R_{3329, X^{256} + 1}^{k \times k} \\ (s, e) & \leftarrow ψ_{η_{1}}^{k} \times ψ_{η_{1}}^{k} \\ t & = A s + e \\ p k & = (A, t), s k = s \end{aligned}$
CPA-Encrypt(pk, m):
$\begin{aligned} (r, e_{1}, e_{2}) & \leftarrow ψ_{η_{1}}^{k} \times ψ_{η_{2}}^{k} \times ψ_{η_{2}} \\ u^{T} & = [r^{T} A + e_{1}^{T}]_{q \to 2^{d_{u}}} \\ v & = [r^{T} t + e_{2} + \frac{q - 1}{2} m]_{q \to 2^{d_{v}}} \\ c & = (u, v) \end{aligned}$
CPA-Decrypt(sk, c):
$\begin{aligned} u^{'} & = [u]_{2^{d_{u}} \to q} \\ v^{'} & = [v]_{2^{d_{v}} \to q} \\ m^{'} & = [v - u^{T} s]_{q \to 2} \end{aligned}$

ML-KEM

Wrapped everything above, we can talk about ML-KEM right now!

Public parameters: Same as CPA-Encryption scheme.
KEM-KeyGen:
$\begin{aligned} (p k, s k) & \leftarrow CPA-KeyGen \\ p k & := (A, t), s k := s \end{aligned}$
KEM-Encaps(pk):
$\begin{aligned} m & \leftarrow {0, 1}^{256} \in R_{X^{256} + 1} \\ (K, ρ) & := H (m, p k) \in {0, 1}^{512} \\ c & := CPA-Encrypt (p k, m, ρ) \\ s k & := K, c t x t := c \end{aligned}$
KEM-Decaps(sk, c, h, z):
$\begin{aligned} m^{'} & := CPA-Decrypt (s k, c) \\ (K^{'}, ρ^{'}) & := H (m^{'}, p k) \\ c^{'} & := CPA-Encrypt (p k, m^{'}, ρ^{'}) \\ c \neq c^{'} & ⟹ K^{'} :=⊥ \\ Shared Key & := K^{'} \end{aligned}$

There are some notices about ML-KEM:

The polynomials comprising the matrix
$A$ are sampled at random. And in order to efficiently do the multiplication
$A s$ , we need to convert all polynomials in
$A$ to their NTT representation. The best strategy here is sample
$A$ randomly in its NTT representation. Furthermore, the public key
$t$ should be stored in its NTT representation for many benefits it brings.
We can't sample
$s, e, r, e_{1}, e_{2}$ directly in their NTT representation because their distribution is not uniformly random.
We cannot perform the compression operations
$[.]_{q \to p}$ when the element is in its NTT representation.

The table below is parameters for the three instantiations of Kyber. The security of the three schemes are approximately equivalent to that of AES-128, AES-192, and AES-256, respectively.

	k	$η_{1}$	$η_{2}$	$d_{u}$	$d_{v}$	decapsulation key size	encapsulation key size	ciphertext size
Kyber-512	2	3	2	10	4	1632 B	800 B	768 B
Kyber-768	3	2	2	10	4	2400 B	1184 B	1088 B
Kyber-1024	4	2	2	11	5	3168 B	1568 B	1568 B

Implementation

The implementation is described clearly at here. You can find example implementation of ML-KEM at https://github.com/Giapppp/ml-kem.

Benchmark

This part uses the implementation above to run three instantiations of Kyber. The environment uses here is MacOS Solama, 1,4 GHz Quad-Core Intel Core i5, 16 GB 2133 MHz LPDDR3.

	Times
Kyber-512	0.26s
Kyber-768	0.40s
Kyber-1024	0.81s

Conclusion

FIPS 203 and the ML-KEM standard represent significant advancements in cryptographic technology, particularly in preparing for potential future threats posed by quantum computing. By understanding the parameter sets, differences from previous schemes, and practical considerations, organizations can effectively implement ML-KEM to enhance their data protection strategies.

Resources

FIPS 203: Module-Lattice-based Key-Encapsulation-Mechanism

Vadim Lyubashevsky, Basic Lattice Cryptography: The concepts behind Kyber (ML-KEM) and Dilithium (ML-DSA)

An Overview about FIPS 203: Module-Lattice-based Key-Encapsulation-Mechanism

Introduction

Notation

Pre-requisited

The LWE Problem

Definition

An LWE-based Encryption Scheme

Lattices

Encryption over Polynomial Rings

Polynomial Rings

Generalized-LWE Problems and Encryption

Optimizations

Number Theoretic Transform

Compression/Decompression function

Key Encapsulation Mechanism

Detail

Algorithms

CRYSTALS-Kyber CPA-secure Encryption Scheme

ML-KEM

Implementation

Benchmark

Conclusion

Resources

Read more

Blockchain Thingies

Garbled circuit

Writeup

Square Attack