Rust: Command-Line Options for CFI

References, and existing work: - [Original Zulip thread in t-compiler](https://rust-lang.zulipchat.com/#narrow/stream/131828-t-compiler/topic/.60-Zbranch-protection.60.20stability) - [... continued in project-exploit-mitigations](https://rust-lang.zulipchat.com/#narrow/stream/343119-project-exploit-mitigations/topic/CFI.20option.20stabilisation) - [`branch-protection` tracking issue](https://github.com/rust-lang/rust/issues/93754) - [`cf-protection` (CET) tracking issue](https://github.com/rust-lang/rust/issues/93754) - [Tracking issue for sanitizer support](https://github.com/rust-lang/rust/issues/39699) - [Tracking Issue for LLVM Control Flow Integrity (CFI) Support for Rust](https://github.com/rust-lang/rust/issues/89653) # Summary |Options like this... |Become... | |-----------------------------------------|------------------------------------------------------| |_Various target-specific CFI choices..._ |`-Cabi-variant=cfi` | |`-Zbranch-protection=pac-ret,bti` |`-Zabi-variant=aarch64:branch-protection:pac-ret,bti` | |`-Ccontrol-flow-guard=yes` |`-Cabi-variant=windows:control-flow-guard:on` | # Objectives Our long-term objectives are **to make CFI mitigations easy to enable in Rust/Cargo, project-wide.** However, the scope of _this_ proposal is simply **to define a command-line interface for controlling CFI options**, as a step towards the overall objective. In particular, whilst the command-line interface for stable and unstable options is considered, no additional stabilisation is considered here today. ## Background and scope Typically, CFI (control-flow integrity) mitigations are implemented as target-specific ABI variants. Rust already has some support for enabling these variants, for example as target-specific CodeGen options: - [`-Zbranch-protection`][] (for AArch64), - [`-Zcf-protection`][] (for x86), - [`-Ccontrol-flow-guard`][] (for Windows). _In addition, this proposal overlaps somewhat with [`-Zsanitizer`][]. See the [open questions (at the end)](#-Zsanitizer), for a discussion._ However, they aren't straightforward to use: 1. All but `-Ccontrol-flow-guard` are unstable. 2. The unstable (Cargo) `build-std` option is required to get a `std` built with the same flags. 3. `cc-rs` does not forward these settings to the C compiler, so included C code (including some examples in `std`) will not include mitigations. A similar, but technically distinct problem occurs with the C library, which is usually dynamically linked with Rust binaries. Since this is already an external dependency, we might consider this out of Rust's scope, but that at least requires documentation. Note that, to be effective, most of these CFI techniques require that _all_ code in a process is protected. Even unreachable binary code can be targeted by CFI attacks if it is loaded into an executable memory region. Usually, this is a security concern, but we also consider that correct deployment may be required for the _correctness_ of some techniques. We hope to find solutions to these problems, perhaps by stabilising `build-std`, but such solutions are out of scope of this proposal. One minor inconsistency with the existing options is the handling of errors: `control-flow-guard` is ignored for targets other than Windows, whilst `branch-protection` fails if the target is not AArch64. This proposal aims to unify the behaviours here. [`-Zbranch-protection`]: https://doc.rust-lang.org/unstable-book/compiler-flags/branch-protection.html [`-Zcf-protection`]: https://doc.rust-lang.org/unstable-book/compiler-flags/cf-protection.html [`-Ccontrol-flow-guard`]: https://doc.rust-lang.org/rustc/codegen-options/index.html#control-flow-guard [`-Zsanitizer`]: https://doc.rust-lang.org/unstable-book/compiler-flags/sanitizer.html ## Use-cases and motivation ### Rust-C interoperability C (or C++) projects might already deploy CFI mitigations, and this might even be mandated by a system's security analysis team. If they introduce Rust code, they will need a way to ensure that it implements the same standard of mitigation. This may also be required for some mixed-language LTO use-cases. **Rust should provide options that map, in a simple and easy-to-audit fashion, onto common C compiler options.** In addition, if new options are added to future C compilers, we should be able to support them in Rust too. ### Pure-Rust applications A notable disadvantage of matching C compiler options is that they are target-specific, which makes them difficult to use in a project that is otherwise target-agnostic Rust. **Rust should provide simple, target-agnostic Rust options that map, in a target-specific manner, onto some ABI variant.** The specific mitigations used will vary from one target to another, and for some targets may not be possible at all. The behaviour should be "best-effort": targets that don't implement the option can simply ignore it, perhaps with a warning. ### Fine control where required Fine control is required for Rust-C interoperability, but may also apply to pure-Rust programs, providing very fine control over the mitigations used, at expense of portability. For example, the `pac-ret` scheme for AArch64 can be deployed with one of two keys ("A" or "B"). **Rust should allow this flexibility**, even if it is not exposed by default. This is quite unlike the general, best-effort approach. If fine control is requested, it would be misleading for Rust to ignore it, so Rust should instead refuse to build the project. ### Room for future growth C compilers typically have to be concerned about compatibility, so that compiled functions can be linked together without recompilation. Rust's approach of statically linking (almost) everything somewhat frees it from this restriction, and potentially allows it to deploy mitigations very quickly, at least when using the Rust ABI. In addition, we observe that whilst all mitigations listed here concern CFI, that need not always be the case. For example, AArch64 Pointer Authentication can also be used for data protection. Ideally, the option name should be chosen with this in mind. We propose no new ABI variants today, but the command-line interface proposal should allow for such additions. ### Turning things off Currently, none of these features are enabled by default. However, if that were to change (or if a default was set earlier on the command line), it can be useful to have a means to turn them off. This is possible today for `cf-protection` and `control-flow-guard`, but not for `branch-protection`, which is purely additive. It would be useful to make these consistent. # The proposal ## `-Cabi-variant=...` / `-Zabi-variant=...` 1. This is principally a [CodeGen option][] so it makes sense under `-C` rather than at the top level. 2. Keeping both `-C` and `-Z` forms provides a deployment path for new options that avoids them being instantly stable after `abi-variant` itself is stable. 3. Grouping everything under one option makes similar options for different targets easier to find, and makes it easier to keep the interface consistent. 4. The existing, stable `-Ccontrol-flow-guard` should remain, for backwards-compatibility, but becomes an alias for `abi-variant` options. 5. The existing, unstable `-Zbranch-protection` and `-Zcf-protection` should also remain for some transition period, but may be removed in the future. 6. The name `abi-variant` is not currently in use, so should not come with any particular behaviour expectations. 7. The name is not specific to CFI. [CodeGen option]: https://doc.rust-lang.org/rustc/codegen-options/index.html The option's value is a comma-separated list of zero or more `<variant>` names, preceded by zero or more `<group>:` selectors. However, behaviour differs slightly if no `<group>` is provided. ### `<variant>` Top-level variants (without any `<group>`) are high-level, best-effort protections. Suggested flags include: - `backward-edge-cfi`, which enables best-effort backward-edge CFI. - This might enable `aarch64:branch-protection:pac-ret`, `x86:cf-protection:return`, something else, or nothing at all. - `forward-edge-cfi`, which enables best-effort forward-edge CFI. - This might enable `aarch64:branch-protection:bti`, `x86:cf-protection:branch`, something else, or nothing at all. - `cfi` enables both of the above. - `none` (the default) disables all top-level protections. All of these top-level variants work on a best-effort basis. It is an error to name a variant that doesn't exist, but if a known variant cannot be implemented for the compiler target, it is simply ignored. In addition, Rust may change the exact meanings of these flags, for example if new techniques are introduced in future versions. Top-level variants are treated as simple shortcuts for target-specific variants _that the target platform supports_, and are intended to avoid target-specific configuration in build infrastructure. ### `<group>:[<group>:]...[<variant>,]...` `<group>`-specific variants are treated as precise, target-specific controls. Typically, we expect the first `<group>` to specify the target (or group of targets), and the next `<group>` to match the name of the de-facto C compiler option (if there is one). For example: - `-Cabi-variant=aarch64:branch-protection:pac-ret,bti` - `-Cabi-variant=x86:cf-protection:branch` - `-Cabi-variant=x86:cf-protection:branch,return` (or equivalently, `x86:cf-protection:full`) - `-Cabi-variant=windows:control-flow-guard:on` Groups may be named loosely, but are chosen as a compromise between verbosity and specificity. Notably, we don't expect to see full target triples here. For example, "aarch64" would group features that only make sense on `aarch64*` targets, but we don't require that _all_ `aarch64*` targets can actually implement the included `<variants>`. It is an _error_ to specify a variant under a named group unless that variant is actually supported by the target. ### Combinations It is valid to combine target-specific variants with a more general top-level baseline, but we provide no forward-compatibility guarantees for this use-case. For example, the following will work according to this proposal, but if `cfi` is later updated to use something other than `pac-ret`, then `b-key` will be invalid: `-Cabi-variant=cfi -Cabi-variant=aarch64:branch-protection:b-key` However, it may be useful to permit these combinations for use-cases where we want a "good" baseline, but want fine control over some aspects: `-Cabi-variant=forward-edge-cfi -Cabi-variant=aarch64:branch-protection:pac-ret,b-key` **Is this behaviour reasonable? Should we instead forbid mixing these at all?** ### Evaluation Evaluation should be intuitive, but the exact rules are described here to avoid misunderstandings. The existing options have slightly different behaviours, so different readers may have different intuitions. Variants are evaluated in command-line order, then first to last in the comma-separated list. Variants are typically additive, but that isn't required. We expect `none` and `off` to have the effect of disabling the whole `<group>` (but nothing outside it). We don't check validity until the last `abi-variant` has been parsed. Note that this differs from today's `branch-protection`, for example, which checks validity as it goes (and therefore permits `pac-ret,leaf` but not `leaf,pac-ret`). `<group>` selections can only be specified once per `-Cabi-variant`, but in all other respects, multiple `abi-variant` options are equivalent to a single option with multiple `<variant>`s. For example, the following are all equivalent (assuming that `none` is the default): - `-Cabi-variant=aarch64:branch-protection:pac-ret,bti` - `-Cabi-variant=aarch64:branch-protection:pac-ret -Cabi-variant=aarch64:branch-protection:bti` - `-Cabi-variant=aarch64:branch-protection:b-key -Cabi-variant=aarch64:branch-protection:none,bti -Cabi-variant=aarch64:branch-protection:pac-ret` ## `-Zabi-variant` The unstable form of the option should remain permanently, and behave identically to the stable form. The only difference is that the unstable `abi-variant` can accept unstable `<variant>`s. This allows us to extend the interface gracefully, without breaking stability rules. Notably, it is possible to combine both forms. The following are all equivalent (with a nightly compiler): - `-Cabi-variant=forward-edge-cfi -Zabi-variant=backward-edge-cfi` - `-Cabi-variant=cfi` - `-Zabi-variant=cfi` ## Initially-supported variants |`-Zabi-variant=...` |Meaning | |---------------------------------|---------------------------------------------------| |`aarch64:branch-protection:...` |`-Zbranch-protection=...` | |`x86:cf-protection:...` |`-Zcf-protection=...` | |`windows:control-flow-guard:...` |`-Ccontrol-flow-guard=...` | |`forward-edge-cfi` |_Varies with target._ | |`backward-edge-cfi` |_Varies with target._ | |`cfi` |`-Zabi-variant=backward-edge-cfi,forward-edge-cfi` | Once `abi-variant` itself is stabilised: |`-Cabi-variant=...` |Meaning | |---------------------------------|---------------------------------------------------| |`aarch64:branch-protection:...` |_Rejected until `branch-protection` is stable._ | |`x86:cf-protection:...` |_Rejected until `cf-protection` is stable._ | |`windows:control-flow-guard:...` |`-Ccontrol-flow-guard=...` | |`forward-edge-cfi` |_Varies with target._ | |`backward-edge-cfi` |_Varies with target._ | |`cfi` |`-Cabi-variant=backward-edge-cfi,forward-edge-cfi` | # Open questions ## `-Zsanitizer` There is considerable overlap between this proposed `{-Z,-C}abi-variant` and the existing (unstable) `-Zsanitizer`, because many of the available sanitizers could be viewed as ABI variants. **Question: should this proposal be merged with `-Zsanitizer`?** `{-Z,-C}abi-variant` has three notable properties that `-Zsanitizer` lacks: 1. Target-specific variants are distinguished, at least partially. 2. Best-effort options can be described. 3. Individual variants can be stabilised without affecting other unstable variants (though the same technique could be applied to `{-Z,-C}sanitizer`). It is clearly undesirable to maintain `abi-variant` mappings for every `sanitizer` option. Would it instead be acceptable to _replace_ `sanitizer` with `abi-variant`? Alternatively, if the `abi-variant` behaviours described here are deemed useful, should we attempt to fit them under the existing `sanitizer` option? Also note that the naming might require further attention. `abi-variant` currently covers technologies that are designed to be used in release builds, in production. They will have a performance and code-size overhead, but should be "light". Conversely, the term `sanitizer` suggests more comprehensive debug and test tools, though many of its options are clearly designed for deployment too. # Answers to common or expected questions ## Can't this be a target feature? The ABI variant describes what the compiler should _do_, whilst the target features describe the set of tools it has available to do it. For example: - Variants might be backwards-compatible, operating in a reduced form (or as no-ops) if some target features are unavailable. For example, `pac-ret` generates instructions that were no-ops before Armv8.3 Pointer Authentication was introduced. - Conversely, better code generation might be possible if the compiler _knows_ that a given target feature is available. For example, if `paca` is enabled at compile-time, Rust might implement `pac-ret` using instructions that weren't interpreted as no-ops prior to Armv8.3 Pointer Authentication. We also consider the possibility that an ABI variant might _require_ a target, or target feature. Specifying such a variant should cause a compile-time error, as noted earlier. For example, [`-Zsanitizer=shadow-call-stack`][] is only available on `aarch64-linux-android`. [`-Zsanitizer=shadow-call-stack`]: https://doc.rust-lang.org/unstable-book/compiler-flags/sanitizer.html#shadowcallstack

Syntax	Example	Reference
# Header	Header	基本排版
- Unordered List	Unordered List
1. Ordered List	Ordered List
- [ ] Todo List	Todo List
> Blockquote	Blockquote
Bold font	Bold font
Italics font	Italics font
~~Strikethrough~~	~~Strikethrough~~
19^th^	19^th
H~2~O	H₂O
++Inserted text++	Inserted text
==Marked text==	Marked text
[link text](https:// "title")	Link
![image alt](https:// "title")	Image
`Code`	`Code`	在筆記中貼入程式碼
```javascript var i = 0; ```	`var i = 0;`
:smile:		Emoji list
{%youtube youtube_id %}	Externals
$L^aT_eX$	L^aT_eX
:::info This is a alert area. :::	This is a alert area.