Add tokio metrics by sfackler · Pull Request #236 · palantir/witchcraft-rust-server

sfackler · 2025-02-21T02:20:22Z

Before this PR

We didn't have any metrics tracking the state of tasks in the Tokio runtime.

After this PR

==COMMIT_MSG==
Added metrics tracking the state of the Tokio runtime.
==COMMIT_MSG==

Unfortunately, many of these rely on unstable Tokio APIs. As a result, you have to opt-in both with the standard tokio_unstable cfg and a tokio_unstable Cargo feature in this crate.

changelog-app · 2025-02-21T02:20:25Z

Generate changelog in `changelog/@unreleased`

What do the change types mean?

feature: A new feature of the service.
improvement: An incremental improvement in the functionality or operation of the service.
fix: Remedies the incorrect behaviour of a component of the service in a backwards-compatible way.
break: Has the potential to break consumers of this service's API, inclusive of both Palantir services
and external consumers of the service's API (e.g. customer-written software or integrations).
deprecation: Advertises the intention to remove service functionality without any change to the
operation of the service itself.
manualTask: Requires the possibility of manual intervention (running a script, eyeballing configuration,
performing database surgery, ...) at the time of upgrade for it to succeed.
migration: A fully automatic upgrade migration task with no engineer input required.

Note: only one type should be chosen.

How are new versions calculated?

❗The break and manual task changelog types will result in a major release!
🐛 The fix changelog type will result in a minor release in most cases, and a patch release version for patch branches. This behaviour is configurable in autorelease.
✨ All others will result in a minor version release.

Type

Description

Added metrics tracking the state of the Tokio runtime.

Check the box to generate changelog(s)

Generate changelog entry

sfackler · 2025-02-21T02:24:18Z

+                .precision_exact(0)
+                .min_value(Duration::from_micros(100))
+                .max_value(Duration::from_secs(10))
+                .build(),


I am a bit unsure about what values are appropriate here. This setup gives us 20 buckets with these ranges:

0ns..65.536µs 65.536µs..131.072µs 131.072µs..262.144µs 262.144µs..524.288µs 524.288µs..1.048576ms 1.048576ms..2.097152ms 2.097152ms..4.194304ms 4.194304ms..8.388608ms 8.388608ms..16.777216ms 16.777216ms..33.554432ms 33.554432ms..67.108864ms 67.108864ms..134.217728ms 134.217728ms..268.435456ms 268.435456ms..536.870912ms 536.870912ms..1.073741824s 1.073741824s..2.147483648s 2.147483648s..4.294967296s 4.294967296s..8.589934592s 8.589934592s..17.179869184s 17.179869184s..18446744073.709551615s

The intent is to avoid spamming the metrics infrastructure with a huge number of buckets, while still giving us enough information to go off of. Future polls under 100us or so are in a totally good place and I don't think we really care about splitting those out, and above a few seconds the poll is so unreasonably long the specific length doesn't matter too much.

sfackler · 2025-03-03T00:18:55Z

+//!
+//! * `tokio.blocking.threads` (gauge) - The number of threads in Tokio's blocking pool.
+//! * `tokio.blocking.threads.idle` (gauge) - The number of threads in Tokio's blocking pool that are idle.
+//! * `tokio.tasks.polls` (gauge) - The number of individual poll calls to tasks.


This is mostly useful to normalize the values of tokio.tasks.poll-duration-bucket into percentages.

stale · 2025-06-27T04:31:26Z

This PR has been automatically marked as stale because it has not been touched in the last 14 days. If you'd like to keep it open, please leave a comment or add the 'long-lived' label, otherwise it'll be closed in 7 days.

stale · 2025-10-18T05:06:49Z

This PR has been automatically marked as stale because it has not been touched in the last 14 days. If you'd like to keep it open, please leave a comment or add the 'long-lived' label, otherwise it'll be closed in 7 days.

sfackler added 3 commits February 20, 2025 19:57

clippy fixes

8d2252b

Add the stable tokio.tasks metric

92fb004

Add unstable tokio metrics

8bced81

sfackler commented Feb 21, 2025

View reviewed changes

Comment thread witchcraft-server/src/metrics/tokio.rs Outdated

sfackler added 4 commits February 20, 2025 21:27

fix ci

9c5bcdf

remove println

dc4b906

reorder for a bit more build reuse

69f4f09

Update bucket tags to be closer to prometheus

145ca6a

sfackler force-pushed the tokio-metrics branch from 40698a5 to 145ca6a Compare February 21, 2025 18:54

sfackler marked this pull request as ready for review February 23, 2025 19:45

sfackler requested a review from a team February 23, 2025 19:45

svc-changelog and others added 3 commits February 23, 2025 19:46

Add generated changelog entries

030b17e

add tokio.tasks.polls

873c390

Merge remote-tracking branch 'origin/develop' into tokio-metrics

9975e35

sfackler commented Mar 3, 2025

View reviewed changes

stale bot added the stale label Jun 27, 2025

sfackler removed the stale label Jun 27, 2025

stale bot added the stale label Oct 18, 2025

sfackler removed the stale label Jan 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tokio metrics#236

Add tokio metrics#236
sfackler wants to merge 10 commits intodevelopfrom
tokio-metrics

sfackler commented Feb 21, 2025

Uh oh!

changelog-app bot commented Feb 21, 2025 •

edited by sfackler

Loading

Uh oh!

sfackler Feb 21, 2025

Uh oh!

Uh oh!

sfackler Mar 3, 2025

Uh oh!

stale bot commented Jun 27, 2025

Uh oh!

stale bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sfackler commented Feb 21, 2025

Before this PR

After this PR

Uh oh!

changelog-app bot commented Feb 21, 2025 • edited by sfackler Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Generate changelog in changelog/@unreleased

Uh oh!

sfackler Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sfackler Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

stale bot commented Jun 27, 2025

Uh oh!

stale bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changelog-app bot commented Feb 21, 2025 •

edited by sfackler

Loading

Generate changelog in `changelog/@unreleased`