FHE Schemes

This is a WIP.

Some materials we use for below notes:

Summary of FHE Slides [1]
Summary of FHE Blog [2]
FHE survey paper 2022 [3]
Intro to FHE Video [4]
CHIMERA [5]
KPZ22 [6] shows concretely many important formula in the Appendix
TFHE-Tech and TFHE-Overview

Preliminaries on Mumber/Polynomial Representation

Number can be presented in "whole", radix, rns, or tower form:

"whole" when the max value is n while the domain is [0,n-1], examples include (native) prime field modulo arithmetic
radix is in the form of
$x = \sum_{0}^{l - 1} x_{i} \cdot b^{i}$ where
$x_{i} \in [0, b - 1]$ and
$b$ is the base, decimal or binary representation are examples of radix form

radix form is more general than the "whole" where we can say that
$n$ is the base and we only count
$i$ to
$0$
we can do additions in parallel but we have to accumulate the carries
we can do comparison fast by going from most significant "bit" to least significant "bit"
rns, (residue number system, see more at RNS and apps), is a more general form where we have different bases as long as they are co-prime
$x_{i} = \sum_{0}^{l - 1} r_{i} \cdot p_{i}$ (the
$p_{i}$ ' s co-prime with each other)

if all
$p_{i}$ ' s are different then we have advantage when doing additions in parallel as there is no carry
we have to convert to radix form or "whole" for comparison
tower form, TBD

Nevertheless, when we have bases (radix or rns) we can do SIMD (packing).

Polynomial can be presented in coefficient or evaluation form:

coefficient:
$f (X) = \sum_{0}^{d - 1} a_{i} \cdot X^{i}$
evaluation:
${x_{i}, f (X_{i})}_{0}^{d - 1}$

Coefficient of polynomials are numbers represented in forms above;
Polynomial also has bases (

f (X) = q (X) \cdot z (X)

$z (X) = X^{l - 1} + 1$ (used in ( R)LWE [2])
$z (X) = \prod_{0}^{l - 1} (X - x_{i})$

When dealing with polynomial we rely on FFT (NTT) for fast computation (as opposed to radix or rns parallelization of numbers computation).

Generations of FHE schemes

1st, 2nd, 3rd ([1]):

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

4th ([1]):

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

2nd has efficient packing than 3rd so better for depth 1 or depth 2 computation!
4th is not directly usable because it is on approximate numbers, however its idea should be useful for tweaks (later schemes)

Preliminaries on Encapsulation and Noise Growth Management

Encapsulation of numbers (e.g. encryption) can carry noises, e.g. an encapsulation of 16 "bits" (think in radix or rns form) can have actual value of upto 4 bits and noises upto 12 bits, and we can always recover the actual value if the noises do not exceed that 12 bits threshold.

noises increase "a bit" when adding two encapsulations, in our example if noise is 11 bits and adding two of that we still have only 12 bits and still good, but not beyond that

noises increase a lot when multiplying an encapsulation with a number, e.g. multiply an encapsulation with 8 bit noise and a 4 bit number still get good encapsulation of 12 bits noise

noises increase a lot lot when multiplying two encapsulations, e.g. multiply two encapsulations of 4 bits value and 2 bits noise is safe with 12 bits noise afterwards, but not beyond that

Its all about noise growth management when handling encapsulation.
If we trivially make something like 8 bits value with 1.000.000 bits of noise we can have a lot of multiplication depth but this is expansive in terms of both storage and computation.

Noise Growth Management Strategy

Relinearization (a.k.a Key Switching, 2nd + 3rd gen FHE) [2]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

quite circular and we need to keep the "degree" of the relinearization polynomial low below the noise threshold

Modulus Switching (2nd + 3rd gen FHE) [4]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Rescaling (4th gen FHE) [2]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Important Schemes

1st gen FHE [3]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

2nd gen FHE [3]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

3rd gen FHE [3]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

4th gen FHE [3]

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Comparison [3]

3rd gen is faster but requires more communication (ciphertext size) –> we encrypt (and pack) with 2nd gen for communication and then switch (and unpack) to 3rd gen for computation! (ONLY WHEN BOOSTRAPPING IS NEEDED, OTHERWISE NO NEED CONVERSION)!

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

For non-binary we use 2nd gen and binary we use 3rd gen

A bit more details on important Schemes

Notation

$R_{q} = Z_{q} [X] / (X^{N} + 1)$ ; (packing
$N$ elements inside a ciphertext)
$N$ is a power of 2;
$q$ is the ciphertext space of ciphertext
$c$ ;
$t$ is the plaintext space of message
$m$ and
$q >> t$ ;
$χ_{\cdot}$ is the distribution for key and error;
$s \in R_{q} \leftarrow χ_{S}$ is the secret key;
$e \in R_{q} \leftarrow χ_{E}$ is the error;
LWE:
$c = (a, b) = . (a_{0}, \dots, \a_{N - 1}, b) \in Z_{q}^{N + 1}$ s.t.
$< a, s > + b = m + e$ ;
RLWE:
$m$ becomes
$m (X)$ where
$c = (a, b) \in R_{q}^{2}$ s.t.
$a \cdot s + b = m (X) + e (X)$ ;
$q$ is a simplication of
$Q_{i} = q_{i} Q_{i - 1}$ where
$q_{i}$ is a chain of moduli that has more or less same bit width (e.g. 32 or 64 bit primes)
noise has to be less than
$q / 2$

General

Plaintext messages are vector size
$N$ of elements (coefficients) modulo
$t$ (prime
$t = p$ but in general
$t = p^{r}$ coprime to
$2 N$ for packing)
Keygen:
$s \leftarrow χ_{S}$ ;
$a^{'} \leftarrow R_{q}$ ;
$e \leftarrow χ_{E}$ ;
$b = a^{'} s + 2 e$ ;
- $s k = (1, s) \in R_{q}^{2}$ and
  $p k = a = (b, - a^{'}) \in R_{q}^{2}$
Enc
$m \in R_{2}$ becomes
$m = (m, 0) \in R_{q}^{2}$ ;
$r, e_{0}, e_{1} \leftarrow χ$
- $c = (m) + 2 (e_{0}, e_{1}) + a r$
- i.e.
  $c = (c_{0}, c_{1}) = (m + 2 e_{0} + b r, 2 e_{1} - a^{'} r)$
Dec (remember, all
$mod q$ arithmetic below)
- $m =< c, s > mod 2 = c_{0} + c_{1} s mod 2 = m + 2 (e_{1} + e_{1} s + e r) mod 2$
Add: just add!
Mul: just mul (element wise) but look at decryption, now need refresh:
- $< c, s > \cdot < c^{'}, s >= (c_{0} + c_{1} s) (c_{0}^{'} + c_{1}^{'} s) = c_{0} c_{0}^{'} + (c_{0} c_{1}^{'} + c 1 c_{0}^{'}) s + c_{1} c_{1}^{'} s^{2} = d_{0} + d_{1} s + d_{2} s^{2}$
Enc of
$m$ can be scaled with
$Δ = Q / t$ in BFV

BGV/BFV, non-binary, can be fixed point, i.e. Key Switching + Modulus Switching

Key Switching
- $BitDecomp (x \in R_{q}^{n}, q)$ decomposes
  $x = \sum_{0}^{l o g q} 2^{j} u_{j}$ where
  $u \in R_{2}^{n}$ into
  $u_{0}, \dots, u_{l o g q}) \in R_{2}^{n \cdot l o g q}$ ;
- $Powersof2 (x \in R_{q}^{n}, q)$ returns
  $(x, 2 \cdot x, \dots, 2^{l o g q} \cdot x) \in R_{q}^{n \cdot l o g q}$ ;
- SwitchKeyGen(
  $s_{1} \in R_{q}^{n_{1}}, s_{2} \in R_{q}^{n_{2}}$ )
  - $A$ = KeyGen(
    $s_{2}$ ,
    $N = n_{1} \cdot l o g q$ )
  - $B = A + Powersof2 (s_{1})$
- SwitchKey(
  $B, c_{1}$ ) (
  $c_{1}$ in
  $s_{1}$ switching to
  $c_{2}$ is
  $s_{2}$ using
  $B$ )
  - $c_{2} = BitDecomp (c_{1})^{T} \cdot B \in R_{q}^{n_{2}}$
Modulus Switching from modulo
$q$ to
$p$ , i.e.
$< c, s >_{q} =< c, s >_{p} mod 2$ just need scaling (and rounding) by
$p / q$ , this also reset noises (given
$p < q$ sufficiently)
Refresh switching key from
$s^{2}$ to
$s$ and do the modulus switching to reduce noise !!! or using the chain of moduli:
- $c_{1} = Powersof2 (c, q_{j})$
- Scale
  $c_{1}$ in
  $q_{j}$ to
  $c_{2}$ in
  $q_{j - 1}$ (goes down the chain)
- Now in modulo $q_{j-1}:
  $c_{2}$ can be switched from
  $s_{j}^{2}$ to
  $c_{3}$ in
  $s_{j - 1}$

If represented as RNS, can also do hybrid key switching, GHS method is efficient but need to double N or do q/2

TRLWE (Torus)

Torus is roughly
$m / q mod 1$ (the fractional part only)
Message space is
$T = R [X] / (X^{N} + 1)$ and ciphertext space is
$T^{(k + 1) l}$ ;
Decryption requires computing
$κ - L i p s c h i t z$ function
$ϕ_{s} : T^{N} x T \to T$ s.t.
$ϕ_{s} (a, b) = b - s a$

CKKS, approx., i.e. Rescaling

TFHE, binary, short, i.e. Programmable Boostrapping

More recent advances

GBFV

Important concepts (whiteboxing the schemes)

Key Switching

No noise reset but can switch to a new (can be smaller) key

Modulus Switching

Can reset noise and switch to a smaller modulus

Packing/Unpacking

Pack and unpack, think about sending a packed ciphertext (not possible to do mult) that can be unpacked (using some big key) into several ciphertexts (that can do mult)

Amortization

Doing things in a batch so that it cost less in average (caution: may not parallelizable)

Rotation (Automorphism)

Cyclic rotation

x \to x^{k}

Programmable Bootstrapping

Beyond multiplication, basically a lookup table

Functional Bootstrapping

This is similar to PB?, think about doing a specific hash like SHA256?

Scheme Switching

CHIMERA on LWE/RLWE schemes, think about switching from non-binary to binary and maybe just extraction of some bits:

Transciphering (or Hybrid HE)

Caution

All convenient things may require a BIG one time setup (but may be it is fine we do that and we are free next time)

A bit more complex: Arithmetic Simulation

A whole number modulo p can fit "natively" into an FHE ciphertext prime p base but can be expansive depending on p.

A whole number modulo p can fit "non-natively" into an FHE ciphertext composite n in rns form

n = \prod_{0}^{l - 1} p_{i}

resulting in many ciphertexts, cheaper in computation but has to do modulo reduction on p and this is expansive.

A byte array can fit "natively" into a binary ciphertext.

FHE Schemes

Preliminaries on Mumber/Polynomial Representation

Generations of FHE schemes

Preliminaries on Encapsulation and Noise Growth Management

Noise Growth Management Strategy

Relinearization (a.k.a Key Switching, 2nd + 3rd gen FHE) [2]

Modulus Switching (2nd + 3rd gen FHE) [4]

Rescaling (4th gen FHE) [2]

Important Schemes

1st gen FHE [3]

2nd gen FHE [3]

3rd gen FHE [3]

4th gen FHE [3]

Comparison [3]

A bit more details on important Schemes

Notation

General

BGV/BFV, non-binary, can be fixed point, i.e. Key Switching + Modulus Switching

TRLWE (Torus)

CKKS, approx., i.e. Rescaling

TFHE, binary, short, i.e. Programmable Boostrapping

More recent advances

GBFV

Important concepts (whiteboxing the schemes)

Key Switching

Modulus Switching

Packing/Unpacking

Amortization

Rotation (Automorphism)

Programmable Bootstrapping

Functional Bootstrapping

Scheme Switching

Transciphering (or Hybrid HE)

Caution

A bit more complex: Arithmetic Simulation

Read more

PPD and Where to find It?

Public Verifiable MPC

Circom-MPC

FHE-FFT