The Sponge Construction in Symmetric Cryptography

Introduction

Block ciphers have been widely used in cryptography for many years. The idea is to separate a plaintext into fixed-sized blocks and perform transformations on these blocks in order to obtain a ciphertext. Block ciphers are also used in the construction of compression functions for the design of hash functions.

More recently, a new design paradigm for hash functions has been discovered a so-called sponge construction [2]. The cryptographic funtion keccak, which is widely used in the Ethereum protocol, was the first implementation of sponge constructions [3].

The goal of this article is to introduce a general algorithm for sponge constructions without presenting any practical use. The main focus will be in giving a presentation of the algorithm for any fixed-length function with some accompanying code snippets. The idea behind this approach is to present some concrete realizations of all parts of the sponge construction. Security will not be discussed here. For more details the reader is encouraged to refer to the original references at the end of this article. The code snippets that will appear will be written in Python for the sake of claritiy.

The Sponge Construction

The figure below (taken from https://keccak.team/sponge_duplex.html) presents a general view of the sponge construction.

Let us start explaining each part of this figure. Here $M$ is the input message and $Z$ is the output. Both $M$ and $Z$ are strings of bits. We will denote by $| M |$ and $| Z |$ the number of bits in $M$ and $Z$ , respectively.

A sponge function is divided in two stages: (1) absorbing and (2) squeezing.

Absorbing stage

The idea for the absorbing stage is to divide $M$ is groups of bits each of length $r$ where $r$ is a given positive integer. If $| M |$ is not a multiple of $r$ then, for security reasons as argued in [1, Theorem 10], we use the following padding rule refered to as a multi-rate padding.

#M is a string a bits and r is a positive integer
def pad(M,r):
    if len(M)%r != 0:
        paddingBits = len(M)//r*r+r-len(M)
        if paddingBits >= 2:
            return M + "1" + "0"*(paddingBits-2) + "1"
        else:
            return M + "1" + "0"*(paddingBits-2+r) + "1"
    else:
        return M

A multi-rate padding adds a string $10^{*} 1$ to the end of $M$ such that the resulting string is a multiple of $r$ . From now on, with no loss of generality, we will assume that $M$ if of length $n = k r$ where $k$ is the number of groups of bits of length $r$ in $M$ .

In the figure above note that there is a function $f$ that is applied iteratively to $b = r + c$ bits. The number $b$ is called the width, $r$ is called the rate, and $c$ is called the capacity. Thus $f$ is a function of the form $f : Z_{2}^{b} \to Z_{2}^{b}$ ; for cryptographic applications $f$ must be a one-way permutation. Each group of $r$ bits of $M$ is used $k$ times. Let $M_{i}$ denote the $i$ -th group of $k$ bits of $M$ with $1 \leq i \leq k$ .

Step 0 of the the absorbing stage is an initizialization of step before applying the function $f$ . We define the state $s_{0}$ of the sponge at step 0 as

s = M1 + 0*c

where M1 is the block $M_{1}$ of $M$ . An XOR operation is applied to the block $M_{1}$ with the string 0*r, which is just equal to $M_{1}$ , and then it is padedd with $c$ 0s. The total the number of bits of $s_{0}$ is $b$ .

Let $s_{i - 1}$ be the state just before applying the xoroperation and $f$ at step $i$ , where step $i$ is the $i$ -th time $f$ is used. The code below implements step $i$ .

#M is the i-th block of bits
#s is the state at the (i-1)-th step
#r is the rate
def step(M,s,r):
    xors = ""
    for(i in range(r)):
        xors += str(int(s[i])+int(M[i])-2*int(s[i])*int(M[i]))
    newS = xors + s[r:]
    return f(newS)

The for cycle computes de xor between $M_{i}$ and the first $r$ bits of $s_{i - 1}$ . Note that the last $b - r$ bits do not go through the xor operation and it is directly used in the call to $f$ .

This last subrutine is applied $k$ times thus obtaining a state $s_{k}$ . After computing $s_{k}$ we use it as input for the squeezing stage.

Squeezing stage

Let $s_{k}$ be the state after the absobing stage. Let $ℓ$ the the number of output bits, that is, the length of $Z$ . During the squeezing stage, the output $Z$ is constructed by iteratively concatenating blocks of $r$ bits obtained by a sequential application of the function $f$ on the state. The code below executes one step of the squeeze stage.

#s is the current state
#r is the rate
def squeeze_step(s,r):
    sNew = f(s)
    outterState = sNew[0:r]
    return outterState, sNew

The function squeeze_step is called until the length of the output is at least $ℓ$ .

#sk is the state at the end of the absorbe stage
#r is the rate
#l is the number of output bits
def squeeze(sk,r,l):
    Z = sk[0:r]
    s = sk
    while len(Z) < l:
        output_r, s = squeeze_step(s,r)
        Z += output_r
    return Z[0:l]

This concludes our brief explanation of the sponge construction.

References

Bertoni, G., Daemen, J., Peeters, M. & Van Assche, G. (2011). Cryptographic sponge functions. [link].
Bertoni, G., Daemen, J., Peeters, M., & Van Assche, G. (2007). Sponge functions. In ECRYPT hash workshop (Vol. 2007, No. 9). [link]
Bertoni, G., Daemen, J., Peeters, M., & Van Assche, G. (2008) Keccak specifications, NIST SHA-3 Submission. [link]

Introduction

The Sponge Construction

Absorbing stage

Squeezing stage

References

Read more

Final Dev Update

Week 14 Update 14.1

Week 13 Update 13.1

Week 12 Update 12.1