# Defragmenting Liquidity with Asynchronous and Synchronous Composability
L2s have enriched the Ethereum ecosystem by horizontally scaling throughput, enabling new execution environments, lowering transaction costs, and reducing transaction latency. But L2s today act as isolated systems, instead of one unified layer on top of Ethereum’s L1. In particular, this has resulted in *liquidity* *fragmentation* across chains. It is [Espresso’s mission](https://medium.com/@espressosys/introducing-espresso-confirmations-946dbcc1c176) to fix this fragmentation — to build a future where **applications across all L2 chains can interoperate as if they were a part of one unified chain,** without compromising the benefits of having many unique chains.
This post examines how L2 fragmentation can be addressed through faster bridging of native assets, fast message passing, or to a greater extent, through making chains **synchronously composable.** Whether asynchronous or synchronous, bridging tokens between chains must be both direct (rather than through third-party liquidity providers) and provide the canonical (as opposed to bridge-wrapped) version of the token on the destination chain. L2 fragmentation can also be addressed by redesigning applications to act upon cross-chain messages alone, operating on tokens that live on other chains, and eliminating the need for cross-chain movement of assets entirely.
We also offer an original protocol, CIRC, for both fast asynchronous and synchronous composability among L2 chains.
### The Problems with L2 Bridging Today and Liquidity Fragmentation
For most assets (like ETH or DAI), moving them between Ethereum L2 chains today — without ending up with a wrapped asset on the destination chain — is both costly and takes a long time. It requires first withdrawing the asset from one chain onto the L1 and then depositing it into the second chain. This takes several days for optimistic rollups and several hours for ZK rollups. Even for future chains that promise to have fast ZK settlement proofs this will take at least 12-15 minutes due to the latency of Ethereum finality. Exceptions are special tokens like USDC, WBTC, or Ethena, which have centralized asset issuers. However these issuers do not work with all L2 chains. Bridging USDC over [CCTP](https://www.circle.com/cross-chain-transfer-protocol) still takes 12-15 minutes as it waits for L1 finality.
The friction of moving assets between L2 chains pushes each chain to replicate the services offered by every other chain, leading to liquidity fragmentation among other consequences. One such basic function that most chains need to offer their users is the ability to swap assets (i.e., provide decentralized exchanges like AMMs).
AMMs offer one of the most acute examples of liquidity fragmentation across L2s. When AMMs have greater liquidity (i.e., larger reserve pools) the trade slippage is lower, which is better for users. When every L2 has its own copy of an AMM, that forces liquidity providers to spread their capital across AMMs' liquidity pools instead of concentrating it all in one place. This redundancy results in all AMMs providing a worse experience for users.
Fast bridges should in theory help solve this problem. If users can quickly bridge assets to an L2 with high AMM liquidity, trade them there, and bridge the new assets back, that would eliminate the need for each L2 to have its own AMM.
The most widely used bridges today rely on third-party liquidity providers called solvers to address the latency and cost issues with bridging through the L1. However, these so-called *intent-based bridges* have a liquidity fragmentation problem of their own as they require solvers to hold native versions of assets on both sides of the bridge, rebalancing as needed. This is the sequence:
1. The user who wishes to bridge assets from one chain to another first authorizes a transaction escrowing tokens on the source chain.
2. The tokens are locked in a special escrow contract that will only release the tokens to the solver after it uses a cross-chain messaging protocol to verify that the solver paid the requisite amount of tokens (minus a fee) to the user on the second chain.
3. The solver pays the user on the second chain when it feels confident the user's escrow transaction has succeeded. If it trusts the sequencer of the first chain, it may fulfill the user's intent in a matter of seconds, though it takes on risk in order to provide a better user experience.
4. After paying the user, the solver can claim the tokens from the escrow contract on the first chain, but may need to wait a long time depending on the latency of the messaging protocol in use.
The safest messaging protocol for a contract on an Ethereum L2 chain is to read a message from another Ethereum L2 chain through the L1 itself. This does not require an L1 transaction. Most L2 chains periodically post their state Merkle roots to the L1 and also have access to the latest L1 state Merkle root. Thus, a contract on any one of these L2 chains can verify a Merkle proof about the past state of another L2 chain, including any messages it created. An example implementation is [RIP-7755](https://ethereum-magicians.org/t/rip-7755-contract-standard-for-cross-l2-calls-facilitation/20776). Passing messages securely in this way requires waiting for settlement of the sending chain’s state root, which is at least 12-15 minutes (just for the L1 to finalize this information) and up to several days for most rollups today.
Other messaging protocols like UMA, LayerZero, Hyperlane, and Wormhole rely on additional security assumptions, like offchain oracles or multisignature committees, but ultimately cannot securely attest to messages faster than the 12-15 minutes required for L1 finality.
Bridges that rely on solvers do not adequately address DEX liquidity fragmentation. To understand why, consider a scenario where a user holds ETH on a small rollup, but wants to swap their ETH for DAI. Becase the rollup doesn’t have much liquidity, the user is forced to bridge their ETH to a larger rollup (let's use Base in this example) to access liquidity. The sequence would look like this:
1. User pays a solver to bridge their ETH to Base
2. On Base, the user swaps their ETH for DAI
3. User then pays a solver to bridge the DAI back to their small rollup
In this case the solver needs to hold sufficiently large reserves of ETH and DAI on both the small rollup and Base, and is effectively running an exchange, acting as a market maker, and using the trade price on Base as a price oracle! The “bridging” is contrived. The solver might as well facilitate the trade directly, receiving ETH from the user and giving DAI to the user, all on the small rollup. The solver can choose how it wants to asynchronously rebalance in other exchanges elsewhere, or how it wants to use information from other exchanges to determine the rates it is willing to offer on specific L2s (which, for example, could be done using centralized exchanges in addition to L2 DEX protocols).
### Towards Faster Bridging and Defragmenting Liquidity
Passing messages at latency faster than 15 minutes, and ideally within seconds, either requires chains to trust each other's sequencers or adoption of an alternative protocol for fast confirmations of transaction completion on the respective L2s. Examples of these alternatives are L2s with economically staked sequencers that will be slashed for providing incorrect confirmations, or even better, a BFT consensus protocol run by economically staked validators, such as the Espresso Network). Paired with verifiable state proofs (e.g., ZK or TEEs), the BFT consensus validators do not need to execute transactions, only finalize their order in a way that prevents the L2 sequencers from equivocating when ultimately posting to the Ethereum L1. Introducing a BFT protocol in this way is a change to the way L2 chains settle today, but is also a first step towards sequencer decentralization.
It is also possible for the chains themselves to optimistically accept messages at low-latency from untrusted sequencers, and to catch discrepancies at settlement time — this is the design principle behind CIRC, our aforementioned stack-agnostic standard for coordinated and predictive message-passing between chains.
CIRC, which stands for Coordinated Inter-Rollup Communication, was inspired by Brendan Farmer’s earlier [post](https://mirror.xyz/0xfa892B19c72c2D2C6B10dFce8Ff8E7a955b58A61/TXMyZhhRFa-bjr7YHwmJpKBwt2-_ysirbh_VpNy3qZY) on Aggregated Blockchains in support of [AggLayer](https://github.com/AggLayer), and also reflected in the new [OP Interoperability protocol](https://specs.optimism.io/interop/overview.html). It pushes the problem of real-time message verification onto the rollup operators (who are incentivized to prevent a reorg) rather than being handled through the chain logic itself or through introducing new security assumptions. It is still important for rollup operators to check verifiable state proofs and fast BFT confirmations so that they have high confidence in the state of their rollup in real-time (as they generally do today instead of waiting for settlement time).
We’ll describe our design of CIRC in more detail later on. It enables faster asynchronous message passing, and even synchronous message passing when combined with a mechanism for coordinated block building (e.g., shared sequencing), all ultimately secured by the L1.
An immediate consequence of faster message passing is that solvers in intent-based bridges can recuperate their funds faster on the source chain. This allows them to rebalance faster and improves their capital efficiency. However, as discussed, faster message passing on its own does not immediately solve liquidity fragmentation.
One way to address liquidity fragmentation is through a shared L1 bridge. For example, this could be implemented as a dedicated L2 chain of its own, to which all assets are deposited from the L1 and withdrawn to the L1. This dedicated “bridge chain” maintains an accounting table for each deposited asset, recording which L2 chain has current custody of that asset. When it receives a deposit from the L1 destined for a particular L2 user account, it updates the table and sends a message instructing the destination L2 to mint the asset into the user’s account. A user sending an asset to an account on a different L2 creates a transaction that triggers a message to the bridge chain, which updates the accounting table and sends a message instructing the new destination L2 to mint the asset. Any L1 withdrawals of assets from one of these connected L2s similarly go through the bridge chain. This way, assets received on one L2 either directly from the L1 or from any other L2 participating in the shared bridge are all fungible. In particular, if only one of these L2 chains operates a DEX, then users on any other L2 can quickly bridge their assets over, swap, and bridge back, without relying on third-party solvers to provide liquidity for the trade.
All the L2s sharing this L1 bridge would be dependent on the security of the L1 bridge chain, both for the safety of assets they handle and for the liveness of withdrawals. For this reason, given the diversity of rollup designs today and the security/UX tradeoffs they make, it is hard to imagine all L2s across all rollup stacks sharing an L1 bridge for all token deposits.
However, the design goal of an L1 bridge can be generalized in the following way. With sufficiently fast message passing, assets may not need to be bridged between chains at all. Just as every asset on the L1 is governed by a unique token contract, every L2 asset could live on a unique L2 chain chosen by the token contract developers. For example, while OP would naturally live on Optimism, ARB on Arbitrum, and ZK on ZKSync, the MakerDAO developers would choose (or deploy) one chain to be the canonical L2 for DAI. An L2 chain hosting a DEX would not need to have any native token contracts at all! Instead, the DEX could list trading pairs of tokens that live on other L2 chains.
As an example, an AMM application on the Polygon zkEVM could manage a reserve of ARB tracked by Arbitrum One and a reserve of OP tracked by OP mainnet. To swap ARB for OP, a user would send the transaction to the Polygon AMM with the signatures required to authorize a transaction depositing ARB into the special account on Arbitrum One controlled by the external Polygon AMM. This transaction would cause the Polygon AMM to calculate a swap price and send a message to Arbitrum One triggering the deposit, wait for a message back confirming success, and finally send a message to Optimism to move the calculated amount of OP from the AMM’s account there into the user’s account. Technically, the Polygon AMM does not wait for a message response, rather, a transaction in the subsequent Polygon zkEVM block would consume the message response from Arbitrum One and generate the message for OP. In this example, the role of the Polygon AMM is to linearize and order all user swap transactions, while OP mainnet and Arbitrum One handle the actual movement of tokens between reserve accounts to user accounts.
Finally, there are ways in which liquidity fragmentation manifests that can only be solved with synchronous composability. An example is liquidity for flash loans. In a flash loan assets are borrowed from a liquidity pool, used to fund actions in other applications (e.g., arbitrage trades in AMMs), and returned in the same transaction. Getting this to work across liquidity pools and applications on different L2 chains requires all the involved actions on these different chains (including the messages passed between them) to occur as one [ACID](https://en.wikipedia.org/wiki/ACID) transaction, which is how we define synchronous composability next.
### Differences between synchronous and asynchronous composability
We define synchronous composability as the ability to execute a single, logical ACID transaction among applications deployed on different chains. ACID properties are the gold standard for interoperation between independent executions in databases, making them useful for composing independently executing L2s.
There are two key differences between synchronous and asynchronous composability under the ACID framework: isolation and atomicity.
#### Isolation
We define a *logical transaction* as an abstract set of user operations (i.e., a collection of transactions) needed to perform an intended task (or intent). These transactions do not all need to be associated with a single chain.
*Physical transactions*, on the other hand, are the individual transactions submitted to particular chains, which execute the user operations in a certain way. A single logical transaction may consist of several physical transactions.
In the case of asynchronous composition, a single logical transaction must be split into two or more parts that are sequenced separately and sequentially. Any number of arbitrary transactions can be sequenced between different parts, and these in-between transactions can read the intermediate state of the asynchronous intent. Thus, asynchronous composition forgoes an important tenet of ACID transactional guarantees: *Isolation*, the idea that even if separate transactions run simultaneously, the resulting state is as if each transaction were executed individually in sequence without the ability of outside parties to read any intermediate state of a transaction.
Consider the following example: Alice wants to send Bob X amount of ETH in return for Y USDC. Alice’s funds are on chain A, and Bob’s funds are on chain B.
In the asynchronous case, this logical swap transaction must be separated into several physical transactions:
1. Alice must first lock her funds on chain A, stipulating that she can reclaim these funds after some predetermined time period if they have not already been claimed.
2. Bob must send Y USDC to Alice’s address (on chain B).
3. Bob can claim his ETH.
However, consider what could happen because this logical transaction does not have Isolation:
1. Alice locks X ETH as agreed, which causes the price of ETH to go up.
2. Bob sees this price change and decides to sell X ETH for a higher price than Y/X, and then “buy back” ETH from Alice at the agreed swap price Y/X.
This arbitrage opportunity is possible because exchanges could observe and react to the change in available ETH in the middle of a logical transactional swap in an asychronous context.
Synchronous composability, on the other hand, guarantees Isolation by ensuring that no outside physical transactions can be sequenced in the middle of a logical transaction. In the case of our example above, under synchronous composition no one would be able to observe the intermediate state of the swap. Exchanges would not be able to observe the change in ETH supply caused by Alice’s transaction until the entire logical transaction had completed.
#### Atomicity
While it is possible to achieve some level of atomicity using asynchronous composition through complicated cross-chain flows, asynchronous composability often does not enforce atomicity (the A in ACID). Instead, transactions on individual L2 chains which are part of cross-chain flows are executed *sequentially* in asynchronous composability, with no guarantee that the entire sequence will be successful. Additionally, without a fast confirmation layer, executing cross-chain transactions asynchronously leads to much slower confirmation times than synchronous execution.
Instead, synchronous composability enforces atomic settlement of each L2 chain’s state. Blocks across chains settle within a single L1 transaction, guaranteeing that an entire logical transaction executes successfully or reverts.
### How to Achieve Synchronous Composability
There are three components needed for synchronous composability:
#### 1. Aggregated Settlement
First and foremost, synchronous composability requires chains to settle *atomically* — meaning the settlement of multiple transactions spanning multiple L2s occurs within a single L1 transaction. This requires aggregating the settlement decisions across multiple L2s into a *single all-or-nothing decision*. Aggregate settlement is the key component to creating atomic cross-chain transactions.
An aggregate settlement[^1] layer performs two validity checks before settling L2s: First, it checks the execution validity of each L2 with regards to a specific input (i.e., a specific transaction ordering). Second, it checks inter-rollup consistency (i.e., that input and output pairs across chains match). Only if both these conditions pass does the aggregate settlement layer settle each L2.[^2]
#### 2. Coordination
The aggregated settlement layer *verifies* conditional dependencies among chains, but coordination determines what these conditional dependencies are. Coordination occurs before aggregated settlement.
The hallmark of synchronous composition over atomic execution is the ability for chains to rely on state from other chains. Specifically, it is the ability for one chain’s state transition function to take as input the output of another chain’s state transition function. Coordination is the process of passing the correct inputs into each chain to determine the outputs.
Synchronous composability creates an abstraction of a shared state. L2s can be modeled as processes in traditional concurrent programming. These processes use channels to pass state between each other, but each process has its own local state. In this analogy, coordination is like the kernel that executes the entire program \- it records the state passed between processes during execution, and outputs this data for later use.
The most natural coordinators are shared sequencers. Shared sequencers already order and execute transactions for multiple chains at the same time, meaning they are already recording the input and output data for each chain. However, there are protocols, such as CIRC, that can be adapted to allow synchronous composition between chains that do not share a sequencer.
#### 3. Confirmation Layer
Parties involved in synchronous composability do not necessarily trust each other. This is why the aggregate settlement layer must verify the correctness of each chain and verify the consistency of state between chains. But first, the data required for these checks must be confirmed in an irreversible, neutral place.
A basic approach would be to use Ethereum as this confirmation layer. The shared sequencer for chains A and B would post to the L1 the sequenced block data along with input/output messages for each chain. Once the L1 has finalized, chains A and B will use this data to execute their blocks independently of one another, and, finally, post execution proofs to the aggregate settlement layer. Chain sequencers could not start building their next block until the previous block was *confirmed* on Ethereum, increasing L2 block times by an order of magnitude.
Ethereum works as a confirmation layer, but it is slow and expensive. Instead, we can greatly improve latency and cost by using a fast, yet neutral, confirmation layer (e.g., the Espresso Network). A performant confirmation layer is able to offer synchronous composability and faster asynchronous composability.
### Introducing CIRC
Finally, we propose CIRC, our Coordinated Inter-Rollup Communication protocol. Composability requires chains to use a standard protocol to communicate with each other. This is analogous to TCP, HTTP, or SSH communication standards that underpin the internet today. CIRC defines such a messaging standard and offers a potential algorithm that uses the messaging standard to enforce asynchronous or synchronous composability. CIRC has the following desirable properties:
* L2s do not need to alter their existing VM. Enforcement and message handling can be done via smart contracts deployed on each chain.
* The protocol allows parallel proving among chains. L2s interoperating between each other can execute and prove state transitions separately.
* The protocol is cryptographically enforceable. It does not rely on economic security or slashing mechanisms for safety.
* The messaging protocol does not impose any format on the messages’ content, nor any constraints on how those messages are interpreted. As a consequence, CIRC naturally supports a diversity of VMs.
Below we briefly introduce the protocol. First we describe the messaging standard itself, which is a tool used to achieve composability when combined with another workflow. Second, we describe such a workflow that uses this standard to achieve both synchronous or asynchronous composability.
For a more detailed explanation of CIRC, [read our short paper](https://espresso.discourse.group/t/circ-coordinated-inter-rollup-communication/43) on the Espresso Research forum.
![][image1]
#### Messaging Standard
Each synchronously composable chain uses a *mailbox* smart contract to house input and output messages. The inbox stores all messages sent to that chain, and the outbox stores all messages sent from that chain.
Messages in the mailbox are stored in a hashmap. The key of this map is composed of: *source chain id, destination chain id, sender address, receiver address, session identifier,* and *label*. The first four fields route messages to the correct place. The *session identifier* field represents a specific execution context shared among all chains, while the *label* field differentiates messages within the same execution context. The value of the hashmap is the message contents itself, which has no specified format.
Transactions themselves have permission to write directly to the outbox smart contract via function calls. The inbox smart contract is written to by a coordinator, such as a shared sequencer. Any address is allowed to read from the inbox. All messages that are sent from one chain must end up in the mailbox of the other chain. However, it is not required that any smart contract actually reads every message in its inbox.
So long as each chain’s mailbox conforms to the CIRC standard, mailboxes can be written in any language or VM and maintain compatibility.
#### Workflow for Synchronous and Asynchronous Composability
The inbox for each rollup block is filled by a coordinator, and the outbox for each rollup is filled by transactions’ calls to the outbox smart contract. To do this, the coordinator executes rollup blocks together using a standard parallel execution algorithm. When the coordinator reaches a transaction that sends a message to another chain, the coordinator writes this message to the inbox of the other chain.
The coordinator publishes each chain’s transaction data and the total set of inboxes across all chains to the confirmation layer (e.g., the Espresso Network).
All chains share a settlement contract on the L1 as required by aggregate settlement (step 5 in Figure 1). This contract:
1. Verifies each chain’s execution is a valid result using the block data posted to the confirmation layer as input. This includes verifying that the claimed outbox is a result of this execution.
2. Verifies that the set of inboxes and outboxes are consistent across all chains. More precisely, the settlement contract verifies that the set of the union of all inboxes equals the set of the union of all outboxes.
If either of these conditions is untrue, the settlement contract refuses to settle blocks for all chains involved.
To illustrate the intuition behind the CIRC design, consider an atomic, cross-chain, train-hotel booking scenario: train tickets are sold on rollup A, and hotel rooms are sold on rollup B. Users want all-or-nothing booking on both. Let’s walkthrough how this example works with the CIRC protocol:
1. The user sends two transactions to the shared sequencer of chains A and B:
* Tx1 on chain A purchases a train ticket and then writes to the outbox of chain A that the purchase was successful. In that *same* transaction, chain A also reads a message from chain B indicating that the hotel booking was also successful.
* Tx2 on chain B reads the message that the train ticket was successfully booked. It then books a hotel room and writes a message back to chain A saying that the hotel was successfully booked.
2. A shared sequencer builds a block for both rollups. While executing both transactions above, the shared sequencer is able to determine the contents of the messages passed between chains.
3. The blocks built by the shared sequencer are sent to a fast confirmation layer like the Espresso Network.
4. Once the block data is confirmed, provers and full nodes from each rollup can read the data from the confirmation layer and execute the transactions.
5. Finally, the aggregated settlement contract on L1 accepts rollup proofs from provers or full nodes and mailbox states from the confirmation layer. It verifies the proofs, and conducts mailbox consistency checks. Upon successful verification, the aggregate settlement layer updates the state of both rollups on the L1.
The above example illustrates using CIRC in a synchronously composable scenario that requires atomicity. However, CIRC can be adapted to enable asynchronous composability as well.
To read the CIRC short paper and join the discussion, visit [the Espresso Research forum](https://espresso.discourse.group/t/circ-coordinated-inter-rollup-communication/43).
[^1]: Aggregate settlement layers can work for both ZK rollups and optimistic rollups.
[^2]: In this post we describe aggregate settlement as rejecting an entire L2 block if verification fails. Espresso is researching ways to reject only the inconsistent state instead of the entire block.
[image1]: <>