Penumbra DEX Engine Design

Overview

We follow the notation in Improved Price Oracles: Constant Function Market Makers.

Penumbra uses a hybrid, order-book-like AMM with automatic routing. Liquidity on Penumbra is recorded as many individual concentrated liquidity positions, akin to an order book. Each liquidity position is its own AMM, with its own fee tier, and that AMM has the simplest possible form, a constant-sum (fixed-price) market maker. These component AMMs are synthesized into a global AMM by the DEX engine, which optimally routes trades across the entire liquidity graph. Because each component AMM is of the simplest possible form, this optimization problem is easy to solve: it's a graph traversal.

Execution Phases

Penumbra has no intra-block trade ordering, so DEX execution operates at the end of the block in four phases:

All newly opened liquidity positions are added to the active set.
Trades are batched by liquidity pair and executed (in what order? random?).
The chain arbitrages all active positions using an infinite-sized flash loan and burns the arbitrage profits.
All newly closed positions are removed from to the active set.

Because execution is phased, market-makers can create fill-or-kill positions with prices valid for exactly one block without having to compete for ordering within that block, by opening and then closing a liquidity position in the same transaction.

Trading Functions

Each position's trading function is of the form

φ (R) = p_{1} R_{1} + p_{2} R_{2}

with a fee parameter

γ

A trade with input amounts

Δ = (Δ_{1}, Δ_{2})

and output amounts

Λ = (Λ_{1}, Λ_{2})

is accepted if and only if

φ (R) = φ (R + γ Δ - Λ) .

The new reserves from this trade are

R^{'} = R + γ Δ - Λ

. Conservation of value is recorded by the equation

R + Δ = (R + γ Δ - Λ) + Λ + (1 - γ) Δ .

On the left-hand side, we have the initial reserves

R

(held by the AMM) and the trade input

Δ

(held by the trader); on the right-hand side, we have the updated reserves

R^{'} = R + γ Δ - Λ

(held by the AMM), the trade output

Λ

(held by the trader), and the fee

(1 - γ) Δ

The fee is recorded by being accumulated into the position's reserves, which are updated as

R^{″} = R^{'} + (1 - γ) Δ .

Explicit Formulas

In practice,

Δ_{1} Δ_{2} = 0

, i.e., one or the other input type is zero.

When

Δ_{2} = 0

Λ_{1} = 0

and the trader is trading

Δ_{1}

for

Λ_{2}

. The formulas become

φ (R) = φ (R + γ Δ - Λ) p_{1} R_{1} + p_{2} R_{2} = p_{1} (R_{1} + γ Δ_{1}) + p_{2} (R_{2} - Λ)

so we can compute

Λ_{2}

p_{2} Λ_{2} = p_{1} (R_{1} + γ Δ_{1}) - p_{1} R_{1} + p_{2} R_{2} - p_{2} R_{2} Λ_{2} = \frac{p_{1}}{p_{2}} γ Δ_{1}

Similarly, when trading

Δ_{2}

for

Λ_{1}

, we have

Λ_{1} = \frac{p_{2}}{p_{1}} γ Δ_{2} .

These formulas clarify the role of

p_{1}

and

p_{2}

as controlling the price, and

γ

as controlling the spread.

Composition of AMMs

Given two AMMs,

φ (R_{1}, R_{2}) = p_{1} R_{1} + p_{2} R_{2}

with fee

γ

trading between assets

1

and

2

and

ψ (S_{2}, S_{3}) = q_{2} S_{2} + q_{3} S_{3}

with fee

δ

trading between assets

2

and

3

, we can compose

φ

and

ψ

to obtain a synthetic position

χ

trading between assets

1

and

3

that first trades along

φ

and then

ψ

(or along

ψ

and then

φ

We want to write the trading function of this AMM as

χ (T_{1}, T_{3}) = r_{1} T_{1} + r_{3} T_{3}

with fee

ε

, prices

r_{1}, r_{2}

, and reserves

T_{1}, T_{2}

First, write the trade inputs and outputs for each AMM as

Δ^{χ} = (Δ_{1}^{χ}, Δ_{3}^{χ})

Λ^{χ} = (Λ_{1}^{χ}, Λ_{3}^{χ})

Δ^{φ} = (Δ_{1}^{φ}, Δ_{2}^{φ})

Λ^{φ} = (Λ_{1}^{φ}, Λ_{2}^{φ})

Δ^{ψ} = (Δ_{2}^{ψ}, Δ_{3}^{ψ})

Λ^{ψ} = (Λ_{2}^{ψ}, Λ_{3}^{ψ})

, where the subscripts index the asset type and the superscripts index the AMM. We want

Δ^{χ} = Δ^{φ} + Δ^{ψ}

and

Λ^{χ} = Λ^{φ} + Λ^{ψ}

, meaning that

(Δ_{1}^{χ}, Δ_{3}^{χ}) = (Δ_{1}^{φ}, Δ_{3}^{ψ}), (Λ_{1}^{χ}, Λ_{3}^{χ}) = (Λ_{1}^{φ}, Λ_{3}^{ψ}), (Δ_{2}^{φ}, Δ_{2}^{ψ}) = (Λ_{2}^{ψ}, Λ_{2}^{φ}) .

The reserves

T_{1}

are precisely the maximum possible output

Λ_{1}^{χ}

. On the one hand, we have

Λ_{1}^{χ} = Λ_{1}^{φ} \leq R_{1}

, since we cannot obtain more output from

φ

than its available reserves. On the other hand, we also have

Λ_{1}^{χ} = Λ_{1}^{φ} = \frac{p_{2}}{p_{1}} γ Δ_{2}^{φ} = \frac{p_{2}}{p_{1}} γ Λ_{2}^{ψ} \leq \frac{p_{2}}{p_{1}} γ S_{2},

since we cannot input more into

φ

than we can obtain as output from

ψ

. This means we have

T_{1} = max {R_{1}, \frac{p_{2}}{p_{1}} γ S_{2}} T_{3} = max {S_{3}, \frac{q_{2}}{q_{3}} δ R_{2}},

using similar reasoning for

T_{3}

as for

T_{1}

On input

Δ_{1}^{χ}

, the output

Λ_{3}^{χ}

Λ_{3}^{χ} = Λ_{3}^{ψ} = \frac{q_{2}}{q_{3}} δ Δ_{2}^{ψ} = \frac{q_{2}}{q_{3}} δ Λ_{2}^{φ} = \frac{q_{2} p_{1}}{q_{3} p_{2}} δ γ Δ_{1}^{φ} = \frac{q_{2} p_{1}}{q_{3} p_{2}} δ γ Δ_{1}^{χ},

and similarly on input

Δ_{3}^{χ}

, the output

Λ_{1}^{χ}

Λ_{1}^{χ} = Λ_{1}^{φ} = \frac{p_{2}}{p_{1}} γ Δ_{2}^{φ} = \frac{p_{2}}{p_{1}} γ Λ_{2}^{ψ} = \frac{p_{2} q_{3}}{p_{1} q_{2}} γ δ Δ_{1}^{ψ} = \frac{p_{2} q_{3}}{p_{1} q_{2}} γ δ Δ_{1}^{χ},

so we can write the trading function

χ

of the composition as

χ (T_{1}, T_{3}) = r_{1} T_{1} + r_{3} T_{3}

with

r_{1} = p_{1} q_{2}

r_{3} = p_{2} q_{3}

, fee

ε = γ δ

, and reserves

T_{1}

T_{3}

Liquidity Positions

A liquidity position consists of:

A trading pair
$(a_{1}, a_{2})$ recording the asset IDs of the assets in the pair. The asset IDs are
$F_{q}$ elements, and the pair is made order-independent by requiring that
$a_{1} < a_{2}$ .
A trading function
$φ$ , specified by
$p_{1}, p_{2}, γ$ .
A random, globally-unique 32-byte nonce
$n$ .

This data is hashed to form the position ID, which uniquely identifies the position. The position nonce ensures that it is not possible to create two positions with colliding position IDs.

The reserves are pointed to by the position ID and recorded separately, as they change over time as trades are executed against the position. One way to think of this is to think of the position ID as an ephemeral account content-addressed by the trading function whose assets are the reserves and which is controlled by bearer NFTs recorded in the shielded pool.

Positions have four position states, and can only progress through them in sequence:

an opened position has reserves and can be traded against;
a closed position has been deactivated and cannot be traded against, but still has reserves;
a withdrawn position has had reserves withdrawn;
a claimed position has had any applicable liquidity incentives claimed.

Control over a position is tracked by a liquidity position NFT (LPNFT) that records both the position ID and the position state. Having the LPNFT record both the position state and ID means that the transaction value balance mechanism can be used to enforce state transitions:

the PositionOpen action debits the initial reserves and credits an opened position NFT;
the PositionClose action debits an opened position NFT and credits a closed position NFT;
the PositionWithdraw action debits a closed position NFT and credits a withdrawn position NFT and the final reserves;
the PositionRewardClaim action debits a withdrawn position NFT and credits a claimed position NFT and any liquidity incentives.

Separating closed and withdrawn states is necessary because phased execution means that the exact state of the final reserves may not be known until the closure is processed position is removed from the active set.

However, having to wait for the next block to withdraw funds does not necessarily cause a gap in available capital: a marketmaker wishing to update prices block-by-block can stack the PositionWithdraw for the last block's position with a PositionOpen for their new prices and a PositionClose that expires the new position at the end of the next block.

Separating withdrawn and claimed states allows retroactive liquidity incentives (e.g.,

X

rewards over some time window, allocated pro rata to liquidity provided, etc). As yet there are no concrete plans for liquidity incentives, but it seems desirable to build a hook for them, and to allow them to be funded permissionlessly (e.g., so some entity can decide to subsidize liquidity on X pair of their own accord).

Routing

To route a trade, perform a graph traversal from the source asset

s

to the target asset

t

, building up a path that tracks its component positions as well as the trading function of the whole path. Select the path with the lowest price, and fill as much of the trade as possible. If not all of the trade can be filled, repeat the process with the remaining quantity.

To perform the graph traversal, use a variant of Bellman-Ford. Maintain a mapping BTreeMap<asset::Id, Path> recording the optimal route from

s

a

for each intermediate asset

a

we've considered. Starting at

s

, iterate over all neighboring assets

a

Fill in description

Each Path contains:

a "whole-path" trading function and synthetic reserves
data implying the list of positions we would execute against if we were to use the path
- do we want to track a list of positions here, or do we want to track just the list of intermediate assets?

The Path API would primarily be:

Path::extend(&self, edge: TradingFunction) -> Result<Path> extends self along edge or errors if the asset types don't line up
impl PartialOrd for Path where comparison is None if the start and end assets aren't equal, and otherwise compares by price

Data Types

TradingPair: a pair of assets, made logically order-independent by always choosing a canonical ordering on asset IDs
BatchSwapOutputData: describes the results of a batched swap
TradingFunction: describes the AMM, consisting of
- TradingPair declaring the trading pair
- BareTradingFunction with
  $R_{1}$ ,
  $R_{2}$ ,
  $p_{1}$ ,
  $p_{2}$
Position: contains a TradingFunction, and a nonce that should be globally unique (so that we can treat positions NFTs as bearer assets).
Reserves: a pair of Amounts

DEX State

Consensus State

We need to record:

A global, append-only set of nonces used by existing positions
The set of positions for each trading pair (grouped by position state)
The current reserves for each position (only needed for open and closed positions)
The volume executed against each position (can be deleted at some point?)

Scratch:

dex/position_nonce/{nonce}
- Records nonces used by existing liquidity positions
- On PositionOpen, check that the nonce is not present, then add it
dex/positions/{trading_pair}/opened/{position_id}
- Records opened positions for the given trading_pair

Nonconsensus State

Additional indices on position data (what would be useful?)
Cached routing data?
Pre-compute some common routes on liquidity position creation? i.e. to penumbra, atom, btc, usdc etc.
Store effective prices per block height and pair (tricky because they can change, is this even useful?)

scratch (the following is partly wrong)

Informally but slightly more precisely, a path consists of a list of liquidity positions, a fillable quantity

Δ

, and a price

q

To route an input trade

Δ_{a}

from

a

b

, initialize an empty path with

Δ = Δ_{a}

p = 1

, and an empty list of positions.

To extend a path along the liquidity graph (say via asset

c

), construct (or query) the list of all liquidity positions with trading pair

(a, c)

with nonzero reserves

R_{a}

of asset

a

, sorted by the effective price

γ p / q

.^[1] Select the position with minimum price, push it to the list, and update

Δ \leftarrow min {Δ, R_{a}}

r \leftarrow r γ p / q

How do we formulate this in terms of exact integer arithmetic?

Exact version of one hop:

pair
$(a, c)$ with positions
${(φ^{(0)}, R^{(0)}), (φ^{(1)}, R^{(1)}), \dots}$ sorted by effective price
$γ^{(i)} p^{(i)} / q^{(i)}$ with
$R_{c}^{(i)} \neq 0$
- how do we construct byte strings such that the lex order on those byte strings is the numeric order on
  $γ^{(i)} p^{(i)} / q^{(i)}$ ?
- we have an ordered k/v store for nonconsensus indexes, and we want the key to be some byte string with the correct ordering and the value to be data we use for working with the position
- big-endian integer encodings have the property that the ordering on encodings is the ordering on integers, but we have a fraction
- set the key as big-endian repr of
  $⌊ 2^{128} γ^{(i)} p^{(i)} / q^{(i)} ⌋$ ?
- set the value as
  $(φ^{(i)}, R^{(i)})$ ? or as the position ID? if position data, we have what we need immediately, if position ID, we have to query again, but we can then name the position when we want to execute against it later. or, we could save both
Best price will be
$(φ^{(0)}, R^{(0)})$ , reserves are
$R^{(0)} = (R_{a}^{(0)}, R_{c}^{(0)})$
We want to fill as much of the trade as possible against the reserves. There are two cases: either the reserves of the best position are enough to execute the entire trade, or they aren't, in which case we want to consume the entire available reserves, setting
$R_{c} = 0$ exactly rather than a dust amount so that we'll exclude it from the list in future iterations.
The maximal output occurs when
$Λ = R_{c}$ . Since (with exact arithmetic)
$Λ = γ \frac{q}{p} Δ$ , we have
$Δ = \frac{p}{q} \frac{1}{γ} Λ$ , and we can write the input amount that causes the maximal output as
$Δ_{m a x} = ⌈ \frac{p}{q} \frac{1}{γ} R_{c} ⌉$ . (Here, the correct way to ensure conservation of value is to round up, so that we consume more of the input amount).
- If
  $Δ_{m a x} \leq Δ$ , the maximal output is less than the desired output, so we should consume the entire reserves, setting
  $Λ \leftarrow R_{c}$ and
  $Δ \leftarrow Δ_{m a x}$
- If
  $Δ_{m a x} > Δ$ , the maximal output is greater than the desired output, so we should
Suppose we do the following:
- Set
  $Δ^{'} = min {Δ, R_{c}^{(0)}}$
- Compute
  $Λ = round (γ \frac{q}{p} Δ^{'})$ ("correct" defn of round tbd)
- Update reserves
  $R \leftarrow R + (Δ, 0) - (0, Λ)$
  - this will accumulate fees into position
  - common case is that
    $Δ > Δ^{'}$ , in this case we want to execute so that
    $R_{c}^{(0)}$ is set to
    $0$ exactly (rather than a dust amount), so that in the future, we exclude it from the list
  - can we get that by working "forwards" from
    $Δ^{'}$ to
    $Λ$ or do we need to work "backwards"?
tmp

If the canonical ordering of the pair is
$(c, a)$ , this would instead be sorting by
$γ q / p$ , but we ignore this case for simplicity of presentation. ↩︎

Penumbra DEX Engine Design

Overview

Execution Phases

Trading Functions

Explicit Formulas

Composition of AMMs

Liquidity Positions

Routing

Data Types

DEX State

Consensus State

Nonconsensus State

scratch (the following is partly wrong)

Read more

Penumbra ICA V2 Proposal Sketch

Dutch Auctions for Penumbra

Routing V2

Core Engineer @ Penumbra Labs