- 
                Notifications
    You must be signed in to change notification settings 
- Fork 314
[DRAFT] v0 mangling on nightly #1730
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| @@ -0,0 +1,171 @@ | ||||||||||
| +++ | ||||||||||
| path = "2025/10/28/switching-to-v0-mangling-on-nightly" | ||||||||||
| title = "Switching to Rust's own mangling scheme on nightly" | ||||||||||
| authors = ["David Wood"] | ||||||||||
|  | ||||||||||
| [extra] | ||||||||||
| team = "the compiler team" | ||||||||||
| team_url = "https://www.rust-lang.org/governance/teams/compiler" | ||||||||||
| +++ | ||||||||||
|  | ||||||||||
| **TL;DR:** rustc will use its own "v0" mangling scheme by default on nightly | ||||||||||
| versions | ||||||||||
|  | ||||||||||
| #### Context | ||||||||||
|  | ||||||||||
| When Rust is compiled into object files and binaries, each item (functions, | ||||||||||
| statics, etc) must have a globally unique "symbol" identifying it. | ||||||||||
|  | ||||||||||
| In C, the symbol name of a function is just the name that the function was | ||||||||||
| defined with, such as `strcmp`. This is straightforward and easy to | ||||||||||
| understand, but requires that each item have a globally unique name | ||||||||||
| that don't overlap with any symbols from shared libraries that might be linked | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 | ||||||||||
| against. If two items had the same symbol then when the linker tried to resolve | ||||||||||
| a symbol to an address in memory (of a function, say), then it wouldn't know | ||||||||||
| which symbol is the correct one. | ||||||||||
|  | ||||||||||
| Languages like Rust and C++ define "symbol mangling schemes", leveraging information | ||||||||||
| from the type system to give each item a unique symbol name. Otherwise every | ||||||||||
| instantiation of a generic or templated function (or an overload in C++), which has | ||||||||||
| the same name in the surface language would end up with clashing symbols. | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Feels like this should mention paths vs identifiers, e.g. Rust lets you have  | ||||||||||
|  | ||||||||||
| Rust originally used a symbol mangling scheme based on the | ||||||||||
| [Itanium ABI's name mangling scheme][itanium-mangling] used by C++ (sometimes). Over | ||||||||||
| the years, it was extended in an inconsistent and ad-hoc way to support Rust | ||||||||||
| features that the mangling scheme wasn't originally designed for. Rust's current legacy | ||||||||||
| mangling scheme has a number of drawbacks: | ||||||||||
|  | ||||||||||
| - Information about generic parameter instantiations is lost during mangling | ||||||||||
| - It is internally inconsistent - some paths use an Itanium ABI-style encoding | ||||||||||
| but some don't | ||||||||||
| - Symbol names can contain `.` characters which aren't supported on all platforms | ||||||||||
| - Symbol names depend on compiler internals and can't be easily replicated by | ||||||||||
| other compilers or tools | ||||||||||
|  | ||||||||||
| If you've ever tried to use Rust with a debugger or a profiler and found it hard | ||||||||||
| to work with because you couldn't work out which functions were which, it's probably | ||||||||||
| because information was being lost in the mangling scheme. | ||||||||||
|  | ||||||||||
| Rust's compiler team started working on our own mangling scheme back in 2018 | ||||||||||
| with [RFC 2603][rfcs#2603] (see the ["v0 Symbol Format"][v0-mangling] chapter in | ||||||||||
| rustc book for our current documentation on the format). Our "v0" mangling scheme has | ||||||||||
| multiple advantageous properties: | ||||||||||
|  | ||||||||||
| - An unambigious encoding for everything that can end up in a binary's symbol table | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 | ||||||||||
| - Information about generic parameters are encoded in a reversible way | ||||||||||
| - Mangled symbols are decodable such that it should be possible to identify concrete | ||||||||||
| instances of generic functions | ||||||||||
| - It doesn't rely on compiler internals | ||||||||||
| - Symbols are restricted to only `A-Z`, `a-z`, `0-9` and `_`, helping ensure | ||||||||||
| compatibility with tools on varied platforms | ||||||||||
| - It tries to stay efficient and avoid unnecessarily long names and | ||||||||||
| computationally-expensive decoding | ||||||||||
|  | ||||||||||
| However, rustc is not the only tool that interacts with Rust symbol names: the | ||||||||||
| aforementioned debuggers, profilers and other tools all need to be updated to | ||||||||||
| understand Rust's v0 symbol mangling scheme so that Rust's users can continue | ||||||||||
| to work with Rust binaries using all the tools they're used to. Furthermore, all | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 | ||||||||||
| of those tools need to have new releases cut and then those releases need to be | ||||||||||
| picked up by distros. This takes time! | ||||||||||
|  | ||||||||||
| Fortunately, the compiler team now believe that support for our v0 mangling | ||||||||||
| scheme is now sufficiently widespread that it can start to be used by default by | ||||||||||
| rustc. | ||||||||||
|  | ||||||||||
| #### Benefits | ||||||||||
|  | ||||||||||
| Reading Rust backtraces, or using Rust with debuggers, profilers and other | ||||||||||
| tools that operate on compiled Rust code, will be able to output much more | ||||||||||
| useful and readable names. This will especially help with async code, | ||||||||||
| closures and generic functions. | ||||||||||
|  | ||||||||||
| It's easy to see the new mangling scheme in action, consider the following | ||||||||||
| example: | ||||||||||
|  | ||||||||||
| ```rust | ||||||||||
| fn foo<T>() { | ||||||||||
| panic!() | ||||||||||
| } | ||||||||||
|  | ||||||||||
| fn main() { | ||||||||||
| foo::<Vec<(String, &[u8; 123])>>(); | ||||||||||
| } | ||||||||||
| ``` | ||||||||||
|  | ||||||||||
| With the legacy mangling scheme, all of the useful information about the generic | ||||||||||
| instantiation of `foo` is lost in the symbol `f:foo`.. | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 | ||||||||||
|  | ||||||||||
| ``` | ||||||||||
| thread 'main' panicked at f.rs:2:5: | ||||||||||
| explicit panic | ||||||||||
| stack backtrace: | ||||||||||
| 0: std::panicking::begin_panic | ||||||||||
| at /rustc/d6c...582/library/std/src/panicking.rs:769:5 | ||||||||||
| 1: f::foo | ||||||||||
| 2: f::main | ||||||||||
| 3: core::ops::function::FnOnce::call_once | ||||||||||
| note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace. | ||||||||||
| ``` | ||||||||||
|  | ||||||||||
| ..but with the v0 mangling scheme, the useful details of the generic instantiation | ||||||||||
| are preserved with `f::foo::<alloc::vec::Vec<(alloc::string::String, &[u8; 123])>>`: | ||||||||||
|  | ||||||||||
| ``` | ||||||||||
| thread 'main' panicked at f.rs:2:5: | ||||||||||
| explicit panic | ||||||||||
| stack backtrace: | ||||||||||
| 0: std::panicking::begin_panic | ||||||||||
| at /rustc/d6c...582/library/std/src/panicking.rs:769:5 | ||||||||||
| 1: f::foo::<alloc::vec::Vec<(alloc::string::String, &[u8; 123])>> | ||||||||||
| 2: f::main | ||||||||||
| 3: <fn() as core::ops::function::FnOnce<()>>::call_once | ||||||||||
| note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace. | ||||||||||
| ``` | ||||||||||
|  | ||||||||||
| #### Possible drawbacks | ||||||||||
|  | ||||||||||
| Symbols using the v0 mangling scheme can be larger than symbols with the | ||||||||||
| legacy mangling scheme, which can result in a slight increase in linking | ||||||||||
| times. Fortunately this impact should be minor, especially with modern | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 
        Suggested change
       
 | ||||||||||
| linkers like lld, which Rust [will now default to on some targets][switch-to-lld]. | ||||||||||
|  | ||||||||||
| Some old versions of tools/distros or niche tools that the compiler team are | ||||||||||
| unaware of may not have had support for the v0 mangling scheme added. | ||||||||||
|  | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It should be mentioned explicitly that the consequences here are having to look at mangled symbols, which isn't great but also not completely terrible. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. And maybe mention https://github.com/luser/rustfilt as a fallback? I think it can handle v0 mangling? | ||||||||||
| In any case, using the new mangling scheme can be disabled if any problem | ||||||||||
| occurs: use the `-Csymbol-mangling-version=legacy -Zunstable-options` flag | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Why is the  There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The legacy mangling scheme is unstable (and might stay unstable, I guess to be decided). There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It feels very odd that the mangling scheme that has been used forever is unstable. Maybe that warrants more explanation? | ||||||||||
| to revert to using the legacy mangling scheme. | ||||||||||
|  | ||||||||||
| #### Summary | ||||||||||
| There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we should have a section for tool owners that want to add support pointing at rustc-demangle which has a C and Rust implementation of the v0 demangler, which hopefully makes it easier for people to quickly add support if they find it's missing. (I can try to write some text later, don't have time right this moment). | ||||||||||
|  | ||||||||||
| rustc will use our "v0" mangling scheme on nightly for all targets | ||||||||||
| starting in tomorrow's rustup nightly (`nightly-2025-XX-XX`). | ||||||||||
|  | ||||||||||
| Let us know if you encounter problems, by [opening an | ||||||||||
| issue](https://github.com/rust-lang/rust/issues/new/choose) on GitHub. | ||||||||||
|  | ||||||||||
| If that happens, you can use the legacy mangling scheme with | ||||||||||
| the `-Csymbol-mangling-version=legacy -Zunstable-options` flag. Either by | ||||||||||
| adding it to the usual `RUSTFLAGS` environment variable, or to a | ||||||||||
| project's [`.cargo/config.toml`] configuration file, like so: | ||||||||||
|  | ||||||||||
| ```toml | ||||||||||
| [build] | ||||||||||
| rustflags = ["-Csymbol-mangling-version=legacy", "-Zunstable-options"] | ||||||||||
| ``` | ||||||||||
|  | ||||||||||
| If you like the sound of the new symbol mangling version and would | ||||||||||
| like to start using it on stable or beta channels of Rust, then you can | ||||||||||
| similarly use the `-Csymbol-mangling-version=v0` flag today via | ||||||||||
| `RUSTFLAGS` or [`.cargo/config.toml`]: | ||||||||||
|  | ||||||||||
| ```toml | ||||||||||
| [build] | ||||||||||
| rustflags = ["-Csymbol-mangling-version=v0"] | ||||||||||
| ``` | ||||||||||
|  | ||||||||||
| [`.cargo/config.toml`]: (https://doc.rust-lang.org/cargo/reference/config.html) | ||||||||||
| [rfcs#2603]: https://rust-lang.github.io/rfcs/2603-rust-symbol-name-mangling-v0.html | ||||||||||
| [itanium-mangling]: https://refspecs.linuxbase.org/cxxabi-1.86.html#mangling | ||||||||||
| [v0-mangling]: https://doc.rust-lang.org/nightly/rustc/symbol-mangling/v0.html | ||||||||||
| [switch-to-lld]: https://blog.rust-lang.org/2025/09/01/rust-lld-on-1.90.0-stable/ | ||||||||||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think we should say "compatible" because someone might read it as this makes us incompatible with C++. Maybe just "instead of the previous default scheme which reused C++ mangling"?