Pikachu RFP: Checkpointing Filecoin onto Bitcoin

Motivation

Blockchains based on a reusable resource (such as proof-of-stake or proof-of-space) are not as secure as those based on proof-of-work. Specifically, they are vulnerable to long-range attacks (LRA), where an adversary can create a long fork very cheaply.

Long-range attacks rely on the inability of a user who disconnects from the system at time \(t_1\) and reconnects at a later time to tell that validators who were legitimate at time \(t_1\) and left the system (by e.g., transferring their stake) are not to be trusted anymore.

In a PoS system, where the creation of blocks is costless and timeless, these validators could create a fork that starts from the past, i.e., at time \(t_1\), and runs until the present. This is unlike, for example, proof-of-work (PoW) systems, where creating blocks requires time and money (e.g., performing actual computation) and not just using cryptographic keys. A user would be unable to recognize the attack as they are presented with a "valid" chain fork. Because the past keys do not hold value in the present, previous validators can easily be bribed by an adversary intending on performing this attack.

We propose to design a checkpointing mechanism that leverages the security of proof-of-work blockchains by anchoring the state of the Filecoin chain onto the Bitcoin blockchain. In the case of long-range attacks, the Bitcoin chain can be used to determine the honest chain. We present our current design below, before explaining its limitations and some associated open problems.

Initial Design

Intuition

We aim to design a solution to LRA, inspired by Steinhoff et al. (who showed how to do this on Ethereum (Eth 1.0)) using Bitcoin’s PoW, as Ethereum is moving to proof-of-stake. However, the implementation and design of such a scheme on Bitcoin is more challenging, compared to the implementation on Eth 1.0 of Steinhoff et al., because Bitcoin’s expressivity is considerably more limited. Besides, the approach designed by Steinhoff et al. leverages multi-signatures for anchoring, which can quickly bloat the transaction size, making it, at worst, impossible to anchor PoS networks with large number of validators, and, at best, very costly to do so.

To address this constraint, our approach is to use the capabilities enabled by the recent Taproot upgrade to Bitcoin, which allows for more efficient Schnorr threshold signatures. As Bitcoin does not allow for stateful smart contracts, we will instead use an aggregated public key to represent the set of validators in the PoS system. When the set changes, the aggregated key must be updated in the Bitcoin blockchain. This is done by having a transaction transferring the funds associated with the aggregated key of the previous validators to the new aggregated key. Instead of having each validator send a transaction to the Bitcoin network, this transaction is signed interactively, off-chain, and all the signatures are aggregated into one constant-size signature.

We note that since our work is based on Schnorr threshold signatures and uses Bitcoin's Taproot, it could be of independent interest to any project looking to implement large-scale threshold signing transactions on Bitcoin (for example, sidechains).

High-level Protocol Description

Each configuration \(C_i\) (i.e. set of participants) is associated with a Taproot public key \(Q_i\) that consists of an internal key, in this case an aggregate public key \(pk_i\), that participants computed with an interactive DKG protocol and a tweaked part (see figure below).

We choose to tweak the internal key using a commitment to the PoS chain (i.e., the hash of the state of the PoS blockchain), i.e., we have: \(Q_{i} = pk_{i} + H_{TapTweak}(pk_{i}||ckpt)G\). Each player \(j\) in the configuration then knows a share of the secret key associated with \(pk_i\), \(s_{i,j}\), such that \(t_i\) of the shares are enough to compute a valid signature on any message, but fewer than \(t_i\) participants cannot compute a signature.

Configuration \(C_i\) is responsible for anchoring the state of the PoS chain at this point in time in the Bitcoin blockchain, which also includes updating the new configuration. In order to do so, the new configuration \(C_{i+1}\) must first compute their aggregated public key \(pk_{i+1}\) using the DKG algorithm.

This key is then tweaked using a commitment \(ckpt\) to the PoS chain (i.e., the hash of the PoS chain at that time). The tweaked key becomes \(Q_{i+1} = pk_{i+1} + H_{TapTweak}(pk_{i+1}||ckpt)G\).

Note that only the tweaked key will appear on the blockchain so the hash \(ckpt\) will not be visible by anyone looking at the blockchain without external knowledge. However, anyone who has access to \(pk_{i+1}\) and \(ckpt\) can easily reconstruct \(Q_{i+1}\) to verify that their view of the PoS chain is correct.

To update the configuration from \(C_i\) to \(C_{i+1}\), a transaction from \(Q_i\) to \(Q_{i+1}\) must be included in the Bitcoin blockchain. The transaction needs to be signed by \(t_i\) participants from configuration \(C_i\) where \(t_i\) is chosen to be strictly more than \(f|C_i|\) as this ensures that at least one honest participant signs, preventing an adversary from signing an illegitimate transaction. We will use the FROST algorithm for signing.

Since we assume that online validators can distinguish a LRA chain, it is enough to have the transaction signed by \(t_i\) participants as no honest validators can be fooled into signing an illegitimate transaction. If forks were allowed in the case of an adversary with only \(f\) fraction of the power (i.e., outside of LRA forks), this would be more problematic, as two conflicting transactions could then be signed, and we would require at least two thirds of the participants to sign the transaction, for \(f=1/3\) (this could be fixed by considering a block in the past, i.e., one that has been finalized).

In addition to the transfer of coins from \(Q_i\) to \(Q_{i+1}\), the transaction spent by configuration \(C_i\) will have a second output that does not receive any bitcoins and that is unspendable, but that contains an identifier \(cid\) used to retrieve the full details of the configuration. This is done using the \(OP\_RETURN\) opcode of Bitcoin that allows storing of extra information in the chain.

This identifier will be useful in the case where a user does not have access to the right PoS chain (i.e., does not have the correct value for \(pk_{i+1}\) and \(c\) due to a LRA). In this case, the content identifier \(cid\) can be used, together with a content-addressable decentralized storage, for example IPFS (or content-addressable storage implemented on PoS network validators) to retrieve the identities of the nodes in the correct configuration.

The transaction updating the configuration will look as follows:
\(tx_i:Q_i\rightarrow((\textsf{amount},Q_{i+1}),(0,OP\_RETURN=cid))\), meaning that \(\textsf{amount}\) is transferred to \(Q_{i+1}\) and 0 is transferred to \(OP\_RETURN=cid\) (unspendable output).
This information is then publicly available.

Limitations and open problems

The approach presented above is limited mostly due to the heavy communication cost of the threshold DKG, where each participant must share a secret with the rest of the participants. This limitation is even more serious when considering a non-flat model, where a participant must hold a number of keys proportional to their amount of stake/power in the system. With a blockchain like Filecoin, we could easily end up with hundreds of thousands or even millions of keys in total. This solution is hence not viable for large blockchains. We are thus interested in developing a solution that could scale to hundreds of thousands of keys or incorporate the power without dramatically increasing the number of keys necessary.

Although scalable threshold DKG and signatures schemes have recently been proposed, none of them are currently compatible with Bitcoin (e.g. BLS signature, Mithril). Solving the following open problems would help scale our solution to thousands or millions of nodes:

Non-interactive DKG compatible with Schnorr threshold signing schemes on the secp256 curve and scalable to thousands of nodes.
Parallelized DKG: group the participants into different subgroup of some "size" (i.e., power) and have them compute a DKG inside this subgroup, then merge the keys of all the subgroups together; this approach is an open problem as the interactions between subgroups to compute the final threshold key would be highly complex.
Efficient share aggregation: aggregate the shares associated with one participant, such that the complexity of our protocol is quadratic in the number of players and not the number of "unit of power" (i.e., moving from a flat to non-flat model should not significantly increase the communication complexity of the algorithm); some shares can currently be aggregated but not to the point of being equivalent to a flat model.
Sampling: similar to the approach used in Mithril, one could elect a subset of the participants to perform the signing instead of having everyone contribute; the issue with this approach is that, in our case, the sample elected to perform the signing must be known ahead of time (as they need to compute the DKG beforehand) and hence could be corrupted by an adversary prior to the signature. This is unlike the approach in Mithril, where the sample elected is revealed at the time of creating the signature.

Marko Vukolic

2022/03/16 15:39:03

We present our current design below, before explaining its

This does not read like an RFP, perhaps we should focus on open problems with simply pointers to this content, which should be elsewhere (Edited)

2022/03/16 15:40:10

ecently, [Steinhoff et al.](https://arxiv.org/abs/2109.03913) proposed an approach to deal with LRA by anchoring the PoS membership into Ethereum’s proof-of-work (PoW) blockchain (Eth 1.0), which is not vulnerable to this type of attack. The main idea of their work is to have a smart contract on the Ethereum blockchain that keeps track of the state of the membership of the underlying PoS system. In a typical Byzantine Fault-Tolerant (BFT) protocol underlying PoS, the smart contract on Ethereum would only be updated if two thirds of the current staking power (or blockchain members in case of uniform voting rights) instruct the smart contract to do so. In the approach of Steinhoff et al. each validator will send a transaction to the smart contract that indicates a vote for a new set of validators. As soon as two thirds of the votes for the same set have been received, the smart contract automatically updates its state to the new set. From this moment on, the members of the new set are in charge of voting for the next set and so forth. Every user that needs to verify the set of validators can do so by simply checking the smart contract. An adversary cannot change the state of the smart contract, even with the keys of former validators. A LRA will never succeed, as any user can resort to the Ethereum smart contract to verify the correct state of the PoS chain. Unfortunately, as Ethereum is abandoning PoW, this approach is no longer viable. PoS of Eth 2.0 cannot be used instead of PoW for anchoring as it too suffers from the LRA vulnerability.

I'd remove this Ethereum discussion and give a reference (which comes in the next paragrph, perhaps also with our spec). These 3 paragraphs do not contribute much. (Edited)

Jorge Soares

2022/03/16 21:24:43

It depends on how broad we want it to be. Some of our previous RFPs were /very/ specific in what directions they wanted to pursue. (Edited)

2022/03/16 15:37:39

and timeless,

In light of this last talk of Joachim Neu and e.g., Algorand, is this timeless the right word? (Edited)

2022/03/16 21:28:01

(ignore the above if you meant "elsewhere in this document", which seems to be the current state) (Edited)

Syntax	Example	Reference
# Header	Header	基本排版
- Unordered List	Unordered List
1. Ordered List	Ordered List
- [ ] Todo List	Todo List
> Blockquote	Blockquote
Bold font	Bold font
Italics font	Italics font
~~Strikethrough~~	~~Strikethrough~~
19^th^	19^th
H~2~O	H₂O
++Inserted text++	Inserted text
==Marked text==	Marked text
[link text](https:// "title")	Link
![image alt](https:// "title")	Image
`Code`	`Code`	在筆記中貼入程式碼
```javascript var i = 0; ```	`var i = 0;`	在筆記中貼入程式碼
:smile:		Emoji list
{%youtube youtube_id %}	Externals
$L^aT_eX$	L^aT_eX
:::info This is a alert area. :::	This is a alert area.