Move punctuation index to an appendix, and expand for other syntaxes #2061

ehuss · 2025-10-24T15:24:30Z

This adds a new syntax index which shows some common, potentially hard-to-search syntax examples along with links to the definitions for those forms.

This is based on the syntax index that is in the book (https://doc.rust-lang.org/1.90.0/book/appendix-02-operators.html). There have been occasional requests for this, with links in the reference.

I decided to organize it by concepts instead of by syntax. For example (…) could be a grouped expression, type, or pattern. However, I think the amount of overlap between concepts is small, and I liked the idea of emphasizing the distinction of these concepts to really convey that they should not be conflated. But I would also be open to a more syntax-based organization.

This is not intended to be 100% exhaustive. There are many permutations and qualifiers, and I felt it would just make the list too long to try to cover everything.

Closes #211

This adds a new appendix caled "Syntax index" which is an index of various syntactical elements of the language. This initial commit just moves the index of punctuation symbols as-is without modification.

This adjusts the capitalization just because I felt the mixed and inconsistent capitalization looked odd, and it felt simpler to just keep it lowercase. I also didn't feel like there was a particular need to sentence case these.

None of the other lists use "and" for the final element. I'm not sure why this was here.

This adds a pipe at the end of the line because it is the recommended style for gfm tables.

The pipe symbol wasn't updated when or-patterns were stabilized. This updates it for that new grammar.

When use bounds were stabilized, I forgot to add the angle brackets to the index.

This adds let statements to the `=` index entry. It really is distinct from assignment, and I probably just overlooked that.

This switches all the links to rule links for consistency. It's easier to link to specific rules this way. Some of the links were adjusted to go to more specific rules.

This pollutes the search index, since the reader probably wants to go to the things that these point to.

Unfortunately the comments chapter isn't organized in a way that it is easy to link to specific kinds of comments.

"The inferred type" and "the wildcard pattern" are singular things. Let's not pluralize these.

We have now not just the inferred type but also the inferred const, so let's include that in the syntax index.

The placeholder lifetime is the lifetime equivalent of the inferred const and the inferred type (and, in fact, we should probably rename it to the inferred lifetime). Let's mention it alongside these similar things in the syntax index. We could ask whether it should really appear in the "type expressions" section. After all, it's not exactly a type expression. However, neither is `for<...>`, but that appears in this section. We can justify both by observing that they may appear within `impl Trait` and `dyn Trait` -- and these are type expressions.

In the patterns section of the syntax index, we had listed other kinds of patterns, and we had listed `..` for rest patterns, but we hadn't noted its use for range patterns or listed `..=`. Let's do that.

The text referred to `_` as applying to "unnamed items in constants", but this doesn't really speak to me. Let's say instead, "unnamed constant items".

We had described `as` as applying to an "extern crate alias" or "use alias". While I gather the meaning, and we do use this term in some code examples, I'm not sure in the index we'd necessarily want to treat these as canonical names. The term "use alias" seems particularly tenuous; I'd probably at least say, e.g. "use declaration alias" or "use item alias" or similar. Other items in the list often just refer to the top-level feature; let's do that here. We'll keep (and inline) the link to the specific section.

We had listed many other things as being related to `unsafe`, but not unsafe blocks, so let's list those.

For the semicolon, we had listed "array types", but we had not listed "array expressions" even though the semicolon is used for those as well. Let's add that.

For `->`, we had listed "function return type" and "closure return type". For me, I read this table as "the thing on the left is used in the thing on the right". But that's not really true here; the `->` is not used *in* the function or closure return type. It's probably better just to say that `->` is used in functions and closures, so let's do that.

For `#`, we had noted that it was used in attributes, but it's also used in raw string literals, raw byte string literals, and raw C string literals, so let's list those as well.

For `?`, in this branch, we had listed "questionably sized" in the syntax index. I'm not a fan of this terminology. As I wrote elsewhere, in Rust PR 145924: > I don't think we should use the word "maybe" to refer to "?X" bound > relaxations. Either there's a predicate stating that a type > parameter implements some trait or there isn't. While of course I > get it -- the type argument provided may or may not implement the > trait -- I just think "maybe" is speaking to the wrong thing here. > It focuses on the type argument when what makes more sense to focus > on is the type parameter. And the type parameter is not in a > "maybe" state. In this regard, I feel the same about "questionably" as I do "maybe". We don't use this terminology elsewhere in the Reference, so let's not use it here. Let's instead refer simply to "relaxed trait bounds", as we do elsewhere in the syntax index.

In writing RFC 3531, I came to believe that the most correct fully qualified term for what `kind` is in `$ident:kind` is a "macro matcher fragment specifier", and I still think that's right. It's unfortunate that it's such a mouthful, but let's use that term in the syntax index.

On the branch, in the syntax index, we had called `$ident` a "macro fragment substitution". The way that I think about it is that `$ident` is a macro metavariable that is bound to a fragment. That is, the metavariable is the parameter while the fragment is the argument. That's why a "macro matcher fragment specifier" makes sense -- we're specifying the kind of the fragment that can be bound to this macro metavariable in the matcher. In this light, let's call `$ident` simply a "macro metavariable" in the syntax index.

In closures without an ascribed return type, an expression follows the vertical bars. In closures with an ascribed return type, a block follows. Let's add this second syntax to the syntax index.

Elsewhere in this table, we spell `Type` and `Trait` in uppercase, so let's do that here as well.

On the branch, we had an entry in the expressions table for `Type<ident=Type>`. This was notated as "explicit associated type bounds" and referenced a TODO item about default type parameters not being documented. Perhaps what this means to describe are associated type bindings and associated type bounds. Let's document those in the type expressions table. We'll remove the TODO note, as it's not relevant to this.

The syntax `Type<..>`, without turbofish, is only valid in a type expression. Let's move this to that table and document `Trait<..>` along with it.

On the branch, we had pluralized other related expressions, so let's also pluralize "single element tuple expressions".

For macro invocations, struct expressions, and struct patterns, we had shown the opening curly brace without a space in front of it. This can make it visually difficult to distinguish from an opening parenthesis. Using the common Rust style of putting a space in front helps with that, so let's do that here.

On the branch, in the syntax index, items were described as "declarations in a crate". Some items may be declarations, but many items are actually definitions. It's probably better to not use either term here. Let's instead say that items are the components of a crate as that's the language that's used in the items chapter.

On the branch, in the index, in the table for items, we had written `mod ident;` and `use path;` but we hadn't similarly indicated a trailing semicolon for type aliases, const items, and static items. Let's be consistent and indicate the trailing semicolon for those.

src/syntax-index.md

In the syntax index, in the type expressions table, we list some primitive types but not all of them. Let's use the ellipses to indicate that the list is not exhaustive.

On the branch, the syntax index had used "compound type bounds" to refer to a bound with a `+` in it. This isn't a term we use elsewhere; let's not use it here.

On the branch, in the syntax index, we had notated a slice type so as to include a type and then ellipses, but it's more correct for a slice type to just have a type in the square brackets, so let's do that.

In the syntax index, in a list of things to which type paths can refer, let's refer to "type aliases" rather than just to "aliases" for better clarity.

In the syntax index, in the table about type expressions, we have a list of things that can be referred to by type paths. Let's link to each of the items in that list.

Like "the wildcard pattern", there is only one "rest pattern". Let's singularize this and add the appropriate redirect.

In the keywords table in the syntax index, every entry except the one for constant items for `_` consists entirely of a link body. For that one, though, we had left "unnamed" out of the link body. Visually, that just stands out too much. We could just put "unnamed" in the link body, but most of the other entries aren't really that specific. It seems OK to just say "constant items" for this, so let's do that.

In each table in the syntax index, we have some bit of syntax on the left and then on the right we have features in which that bit of syntax is used. Should we title the column on the right "use" (or maybe "uses") or "usage"? One could almost, superficially, make a case for "usage". As relevant here, Webster's defines it as "the way in which a word, phrase, etc. is used to express a particular idea". That sounds close. The trouble is that it's nearly the wrong way around. The "idea" here is actually the thing on the right. The "usage" would be the place or way in which the syntax is used to express that idea. Conversely, definitions of "use" apply cleanly: to "employ for or apply to a given purpose", "an instance or way of using", etc. As Garner says: > Whenever *use* is possible, *usage* shouldn't appear. Let's say "use".

ehuss added 16 commits October 24, 2025 05:51

Create a syntax index

2c5f759

This adds a new appendix caled "Syntax index" which is an index of various syntactical elements of the language. This initial commit just moves the index of punctuation symbols as-is without modification.

Lowercase punctuation index links

12678cb

This adjusts the capitalization just because I felt the mixed and inconsistent capitalization looked odd, and it felt simpler to just keep it lowercase. I also didn't feel like there was a particular need to sentence case these.

Remove a stray "and"

89294c2

None of the other lists use "and" for the final element. I'm not sure why this was here.

Add a trailing pipe for the punctuation table

f0a259b

This adds a pipe at the end of the line because it is the recommended style for gfm tables.

Update pipe for or-patterns

f8b53c7

The pipe symbol wasn't updated when or-patterns were stabilized. This updates it for that new grammar.

Add use bounds for punctuation index

cb0194f

When use bounds were stabilized, I forgot to add the angle brackets to the index.

Add let statements for =

bc145b8

This adds let statements to the `=` index entry. It really is distinct from assignment, and I probably just overlooked that.

Switch links to rule links

81f798a

This switches all the links to rule links for consistency. It's easier to link to specific rules this way. Some of the links were adjusted to go to more specific rules.

Add link forwarding to the syntax index

98ae16b

Disable indexing for the syntax index

120227e

This pollutes the search index, since the reader probably wants to go to the things that these point to.

Add keywords to the syntax index

1a1c5bd

Add comments to syntax index

56826a3

Unfortunately the comments chapter isn't organized in a way that it is easy to link to specific kinds of comments.

Add other tokens to the syntax index

fc33501

Add macros to the syntax index

b49facd

Add attributes to syntax index

41e7de9

Add expressions, items, types, and patterns to syntax index

16c8bbc

rustbot added the S-waiting-on-review Status: The marked PR is awaiting review from a maintainer label Oct 24, 2025

traviscross added 3 commits October 25, 2025 04:24

Singularize "wildcard pattern", "inferred type"

34ccba1

"The inferred type" and "the wildcard pattern" are singular things. Let's not pluralize these.

Add inferred const to keyword index

addbb94

We have now not just the inferred type but also the inferred const, so let's include that in the syntax index.

traviscross force-pushed the syntax-index branch 6 times, most recently from 1ec735f to 6a61838 Compare October 28, 2025 19:17

traviscross approved these changes Oct 28, 2025

View reviewed changes

traviscross added 3 commits October 28, 2025 19:24

Add range patterns to patterns section of index

29a0852

In the patterns section of the syntax index, we had listed other kinds of patterns, and we had listed `..` for rest patterns, but we hadn't noted its use for range patterns or listed `..=`. Let's do that.

Clarify role of _ with constants

4ae18da

The text referred to `_` as applying to "unnamed items in constants", but this doesn't really speak to me. Let's say instead, "unnamed constant items".

traviscross added 15 commits October 28, 2025 19:24

Add "unsafe blocks" to syntax index

59b8fa7

We had listed many other things as being related to `unsafe`, but not unsafe blocks, so let's list those.

Add "array expressions" for semicolon in index

11133a9

For the semicolon, we had listed "array types", but we had not listed "array expressions" even though the semicolon is used for those as well. Let's add that.

List raw string literals for # in index

02c7d47

For `#`, we had noted that it was used in attributes, but it's also used in raw string literals, raw byte string literals, and raw C string literals, so let's list those as well.

Add |..| -> Type { .. } syntax for closures to index

7ae6d27

In closures without an ascribed return type, an expression follows the vertical bars. In closures with an ascribed return type, a block follows. Let's add this second syntax to the syntax index.

Fix Type as Trait capitalization

256eb2f

Elsewhere in this table, we spell `Type` and `Trait` in uppercase, so let's do that here as well.

Move Type<..> to type expressions in index

a4ea4a1

The syntax `Type<..>`, without turbofish, is only valid in a type expression. Let's move this to that table and document `Trait<..>` along with it.

Pluralize "single element tuple expressions"

4946e96

On the branch, we had pluralized other related expressions, so let's also pluralize "single element tuple expressions".

traviscross force-pushed the syntax-index branch from 6a61838 to a2b6ab5 Compare October 28, 2025 19:24

ehuss commented Oct 28, 2025

View reviewed changes

src/syntax-index.md Outdated Show resolved Hide resolved

traviscross added 8 commits October 28, 2025 19:35

Indicate list of primitive types is non-exhaustive

33cb862

In the syntax index, in the type expressions table, we list some primitive types but not all of them. Let's use the ellipses to indicate that the list is not exhaustive.

Remove "compound type bounds" from syntax index

f5b74cf

On the branch, the syntax index had used "compound type bounds" to refer to a bound with a `+` in it. This isn't a term we use elsewhere; let's not use it here.

Fix slice types syntax in syntax index

fc3a743

On the branch, in the syntax index, we had notated a slice type so as to include a type and then ellipses, but it's more correct for a slice type to just have a type in the square brackets, so let's do that.

Refer to "type aliases" rather than "aliases"

bca3f52

In the syntax index, in a list of things to which type paths can refer, let's refer to "type aliases" rather than just to "aliases" for better clarity.

Link to items listed for type paths

a1cd147

In the syntax index, in the table about type expressions, we have a list of things that can be referred to by type paths. Let's link to each of the items in that list.

Singularize "the rest pattern"

4ac6626

Like "the wildcard pattern", there is only one "rest pattern". Let's singularize this and add the appropriate redirect.

traviscross force-pushed the syntax-index branch from a2b6ab5 to e93dce5 Compare October 28, 2025 19:35

traviscross added this pull request to the merge queue Oct 28, 2025

Merged via the queue into rust-lang:master with commit a48cad1 Oct 28, 2025
5 checks passed

rustbot removed the S-waiting-on-review Status: The marked PR is awaiting review from a maintainer label Oct 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move punctuation index to an appendix, and expand for other syntaxes #2061

Move punctuation index to an appendix, and expand for other syntaxes #2061

ehuss commented Oct 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move punctuation index to an appendix, and expand for other syntaxes #2061

Move punctuation index to an appendix, and expand for other syntaxes #2061

Conversation

ehuss commented Oct 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants