[WIP] Support Intel GPU (DRM) by Bowen951209 · Pull Request #127 · PolpOnline/gpu-usage-waybar

Bowen951209 · 2026-02-11T09:10:39Z

Overview

Support monitoring Intel GPU render and video engine utilization, by processing DRM client file descriptors.

Utilization Computation

The engine utilization is calculated using the formula

$$ \frac {\text{engine used duration within delta time}} {\text{delta time}}. $$

The overall GPU utilization is defined as the maximum value between these two engines. (#101 (comment))

Refactoring

With the extension of supported GPU brands from two (NVIDIA & AMD) to three, I have introduced a GpuStatus trait. This replaces the previous universal struct, which was becoming sparse with None values—particularly since we can only get utilization info for Intel GPU.

Credits

The computation logic is heavily based on qmassa.

Closes #101.

Tracker

Refactor the architecture to make supporting Intel easier.
Select the GPU to monitor via the CLI argument --gpu <index>. If not specified, it defaults to index 0. The GPU index is derived from /dev/dri/cardN. (Closes [Feature Request] Add Function to Display Single or Multiple GPU Data #106)
Compute engine utilization via processing file descriptors.
- Render engine
- Video engine
Update README.
qmassa credits.

Summary by CodeRabbit

New Features
- Added Intel GPU support and DRM-based GPU discovery; prints GPU model and nodes at startup.
- Per-field unit selection and decimal-precision formatting for memory, temperature, power, TX/RX.
- New text format option alongside tooltip format; selectable GPU index via config.
Configuration
- Updated default tooltip/text formats to include unit specifiers and precision controls.

- Add byte-unit dependency. - Parse display units from template suffix, e.g., `{mem_used_MiB}`. - Small optimizations.

Add `uom` dependency and support degree C, degree F, and Kelvin for temperature.

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

- Replace `byte_unit` crate with `uom` for memory and PCIe units. - Change field-unit separator from `_` to `:` in templates. Note: This is a large merge refactors.

Added KB, MB, GB, Kib, Mib, Gib, Kb, Mb, Gb.

Remove the `to_string()`s and directly write displays to buffer.

Also renamed `DisplayValue` to `SimpleDisplay`.

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

- Support configurable decimal places for memory, temperature, and power fields. - Users can specify decimal places using a format like `{temperature:f.2}`. - The intended precision is specified after the *dot*.

This reverts commit 352074f.

trim_trailing_zeros now takes a protected_len to avoid trimming earlier segments.

A test is added to cover that.

If precision of a field is not provided, display all available digits.

Sort the drm devices based on their child named `cardX`.

- Remove Instance struct. Now we directly call get_handle(). - Nvml and AmdGpuHandle contexts are now created and held inside NvidiaGpuStatus and AmdGpuStatus. - Print GPU device name in main().

coderabbitai · 2026-02-11T09:10:47Z

📝 Walkthrough

Walkthrough

This PR refactors GPU status into a trait-based design, adds DRM-based device discovery and Intel iGPU support, introduces unit-aware formatting with precision, updates NVIDIA and AMD backends to per-field getters, and adjusts CI/build deps for udev-related tooling.

Changes

Cohort / File(s)	Summary
System Dependencies & CI `Cargo.toml`, `.github/workflows/ci.yml`	Added `uom`, `udev` (hwdb feature), and `libc` crates; CI now installs `libudev-dev` and `pkg-config`.
Formatter: new unit-aware system `src/formatter/mod.rs`, `src/formatter/units.rs`, (deleted) `src/formatter.rs`	Replaced old formatter with a unit-aware parser/State (`Chunk`, `State`, `FormatSegments`), unit conversions (Mem/Temperature/Power), regex-based parsing, precision/trailing-zero handling, and tests.
GPU status: trait-based redesign `src/gpu_status/mod.rs`, `src/gpu_status/fields.rs`, (deleted) `src/gpu_status.rs`	Removed monolithic `GpuStatusData`; added `GpuStatus` trait with per-field getters, `GpuHandle` wrapper, `Field`/U8/Mem enums, and unit/precision parse errors.
Backends: NVidia, AMD, Intel `src/nvidia.rs`, `src/amd.rs`, `src/intel.rs`	Updated NVIDIA and AMD to implement `GpuStatus` per-field APIs; added `IntelGpuStatus` using DRM client manager for render/video utilization and process tracking.
DRM discovery & clients `src/drm.rs`, `src/drm/client.rs`, `src/drm/device.rs`	New DRM module: device scanning (udev/HWDB -> `DrmDevice`) and per-process engine tracking (`DrmClient`, `ClientManager`, `EngineStats`) to feed Intel backend and device metadata.
Config & formatting defaults `README.md`, `config.example.toml`, `src/config/structs.rs`	Tooltip/text default formats updated to unit specifiers (e.g., `mem_used:MiB.0`, `temperature:c`, `power:w`, `tx:MiB.3`); `TooltipConfig::retain_lines_with_values` now accepts `&GpuHandle` and resolves fields via parser segments.
Main app flow & args `src/main.rs`	Switched to DRM-based GPU selection, added `--gpu` index and `text_format`, replaced `Instance` with `get_handle` producing `GpuHandle`, and integrated per-interval handle updates and formatting.
Large deletions / replacements `src/gpu_status.rs`, `src/formatter.rs`	Removed legacy monolithic modules (gpu_status and old formatter); their functionality replaced by the new modular trait-and-formatter subsystems.

Sequence Diagram

sequenceDiagram
    participant Main as main.rs
    participant DRM as DRM Scanner
    participant Hwdb as udev Hwdb
    participant GPU as GPU Backend<br/>(GpuStatus impl)
    participant Handle as GpuHandle
    participant Formatter as Formatter

    Main->>DRM: scan_drm_devices()
    DRM->>Hwdb: query model/vendor (Hwdb)
    DRM-->>Main: Vec<DrmDevice>

    Main->>Main: select GPU by index
    Main->>GPU: instantiate backend (Nvidia/Amd/Intel)
    Main-->>Handle: wrap backend in GpuHandle

    loop per interval
        Main->>Handle: update(procs)
        Handle->>GPU: update(procs)
        GPU-->>Handle: cache field values

        Main->>Formatter: State::try_from_format(format_str)
        Formatter->>Formatter: parse FormatSegments (field, unit, precision)

        Main->>Handle: get_text/get_tooltip(state)
        Handle->>GPU: get_u8_field/get_mem_field/get_temperature/get_power
        GPU-->>Handle: field values or Unavailable

        Handle->>Formatter: assemble using field values
        Formatter-->>Main: rendered output
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Poem

🐰 I hopped through /dev/dri to learn your name,
Units and fields parsed, no two outputs the same.
Intel joined the garden, DRM led the way,
Handles clutch the data — precise values at play.
Hop, format, render — a carrot-coded hooray!

🚥 Pre-merge checks | ✅ 4 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 64.44% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Merge Conflict Detection	⚠️ Warning	❌ Merge conflicts detected (10 files): ⚔️ `.github/workflows/ci.yml` (content) ⚔️ `CHANGELOG.md` (content) ⚔️ `Cargo.lock` (content) ⚔️ `Cargo.toml` (content) ⚔️ `README.md` (content) ⚔️ `config.example.toml` (content) ⚔️ `src/amd.rs` (content) ⚔️ `src/config/structs.rs` (content) ⚔️ `src/main.rs` (content) ⚔️ `src/nvidia.rs` (content) These conflicts must be resolved before merging into `master`.	Resolve conflicts locally and push changes to this branch.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title '[WIP] Support Intel GPU (DRM)' is specific and directly reflects the main objective of adding Intel GPU support via DRM, which is the primary feature in this changeset.
Linked Issues check	✅ Passed	The PR successfully addresses both issue `#101` (Intel GPU support) and `#106` (GPU selection via --gpu). Code changes implement DRM-based GPU discovery, Intel GPU monitoring via render/video engine utilization computation, and CLI parameter for GPU selection.
Out of Scope Changes check	✅ Passed	All major changes align with objectives: refactored architecture (GpuStatus trait), DRM device scanning, Intel GPU support via ClientManager, NVIDIA/AMD updates for trait compatibility, and CLI --gpu parameter. No unrelated modifications detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

⚔️ Resolve merge conflicts (beta)

Auto-commit resolved conflicts to branch intel
Post resolved changes as copyable diffs in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

- Add libc dependency. - Move code in old drm.rs to drm/device.rs. - Some infrastructure for drm clients.

- Introduce IntelGpuStatus and integrate into main flow. - Refactor ClientManager, EngineStats, EngineSample; improve FD scanning and manage device names. - Move NVIDIA process check.

- Take max of render and video engine utilization as GPU utilization. - Add U8Field::RenderUtilization and U8Field::VideoUtilization. - Remove GetFieldError::NotReady because it's never used.

coderabbitai

Actionable comments posted: 19

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

README.md (1)
9-9: ⚠️ Potential issue | 🟡 Minor

Intel GPU support is not mentioned in the compatibility description.

This PR adds Intel iGPU monitoring, but line 9 still reads "compatible with both NVIDIA and AMD cards." This should be updated to include Intel.
Proposed fix
-It is compatible with both NVIDIA and AMD cards
+It is compatible with NVIDIA, AMD, and Intel cards

🤖 Fix all issues with AI agents

In `@README.md`:
- Around line 88-103: Add an "Intel" support column to the Available fields
table and update each row (for fields like `gpu_utilization`, `mem_used`,
`mem_total`, `mem_rw`, `mem_utilization`, `decoder_utilization`,
`encoder_utilization`, `temperature`, `power`, `p_state`, `p_level`,
`fan_speed`, `tx`, `rx`) to reflect Intel support; at minimum mark
`gpu_utilization` as supported (✅) and set unknown/unsupported fields to ❌ (or
leave blank) until their support is confirmed.

In `@src/amd.rs`:
- Around line 25-31: Replace direct index access to hw_monitors (currently using
&self.handle.hw_monitors[0]) with a safe lookup (e.g.
self.handle.hw_monitors.first().ok_or(/* appropriate amdgpu_sysfs::error::Error
*/)?) in fan_percentage, get_temperature, and get_power; if no monitor exists
return an appropriate amdgpu_sysfs::error::Error instead of panicking so each
function (fan_percentage, get_temperature, get_power) gracefully handles empty
hw_monitors.

In `@src/drm/client.rs`:
- Around line 112-119: The should_manage method can panic because it calls
target.file_name().unwrap(); update should_manage (matching FDTarget::Path and
using self.devnames) to handle the Option from Path::file_name() safely instead
of unwrapping. Replace the unwrap usage with a safe check (e.g., map_or or if
let Some(fname) = target.file_name()) that returns false when file_name() is
None, then compare fname to each entry in self.devnames; keep the FDTarget::Path
and devnames symbols to locate the change.
- Around line 22-47: In update_engines, avoid the unwraps when parsing fdinfo
lines for RENDER_ENGINE_KEY and VIDEO_ENGINE_KEY: replace the current
line.split_whitespace().nth(1).unwrap().parse().unwrap() with a safe parse flow
that checks nth(1) and parse::<u64>().ok(), skip the line (continue) on None or
parse failure, and only create EngineSample::new(value) and call
self.render_engine.update_utilization(...) or
self.video_engine.update_utilization(...) when parsing succeeds; ensure
is_render_updated/is_video_updated are set only after a successful parse so
malformed lines are ignored rather than causing a panic.
- Around line 159-165: The read_id function currently uses unwrap() twice and
will panic for malformed "drm-client-id" lines; change the chain to safely
handle missing tokens and parse errors by replacing the unwraps with fallible
combinators (e.g., use .split_whitespace().nth(1).and_then(|s|
s.parse::<u32>().ok()) or .filter_map(...) so a missing second token or parse
failure returns None instead of panicking), keeping the function signature and
behavior of returning Option<u32> intact and only returning Some(id) when a
valid u32 is present.
- Around line 128-142: In EngineStats::update_utilization, avoid unsigned
underflow and divide-by-zero by computing delta_used with saturating_sub (or
checked_sub and treating negative/None as 0) on sample.value and
last_sample.value, and skip or set utilization to None if delta_sample (computed
from sample.sample_finished_at.duration_since(...).as_nanos()) is 0; update
self.utilization only when delta_sample > 0 and delta_used > 0, then cast to f64
for the division. Use the symbols update_utilization, sample.value,
last_sample.value, delta_used, delta_sample, and utilization when applying the
fix.
- Around line 66-78: The call to client.update_engines().unwrap() in
drm::client::Client::update can panic if a process exits between
scan_process_fds and update_engines; replace the unwrap with proper error
handling in update (e.g., match or if let Err(e)) so IO errors from
update_engines() are logged and that client is skipped/left unchanged rather
than crashing the process. Locate the update method and the update_engines
method on the client struct, catch and log errors returned by update_engines(),
and ensure the loop continues even when update_engines() fails (do not propagate
the panic).

In `@src/drm/device.rs`:
- Around line 44-46: The error messages for hwdb query failures contain a typo
"exits" instead of "exists"; update the strings used in the two query_one calls
that produce those errors (the model_name assignment that calls
query_one(modalias, "ID_MODEL_FROM_DATABASE") and the vendor_name assignment
that calls query_one(modalias, "ID_VENDOR_FROM_DATABASE")) to replace "No
model/vendor name result exits in database" with "No model/vendor name result
exists in database".
- Around line 29-36: get_dri_card_index currently uses to_str().unwrap() and
parse().unwrap() which can panic on non-UTF-8 sysnames or numbers > u8::MAX;
change the implementation to be fully fallible by replacing unwraps with safe
conversions (use dev.sysname().to_str().ok() and then strip_prefix("card") and
parse::<u8>().ok()), e.g., iterate self.children and use find_map or filter_map
chaining to return None on invalid UTF-8 or parse failures so the function
returns an Option<u8> without panicking.
- Around line 93-107: PciId::from_device currently uses unwraps that will panic
on malformed PCI_ID; change to safely propagate None by replacing
to_str().unwrap() with to_str().ok()? and split_once(':').unwrap() with
split_once(':')? and handle the numeric parses by using u16::from_str_radix(...,
16).ok()? so vendor_str/device_str are obtained via split_once('?')? (i.e., use
? for Option-producing operations) and vendor_id/device_id use .ok()? before
constructing Some(Self { vendor_id, device_id }). This keeps the function
returning None for any invalid PCI_ID instead of panicking.
- Around line 17-26: DrmDevice::new currently calls
PciId::from_device(...).unwrap(), which will panic for non-PCI DRM devices;
change the constructor to propagate or handle the absence of a PCI id by either
(A) returning Result<DrmDevice, E> from DrmDevice::new and returning an
appropriate error when PciId::from_device returns None, or (B) change the struct
field pci_id: PciId to pci_id: Option<PciId> and store PciId::from_device(...)
directly (no unwrap), then update callers (notably scan_drm_devices) to filter
out or handle devices with None pci_id; update any code that assumed pci_id to
handle the Result or Option accordingly.

In `@src/gpu_status/fields.rs`:
- Line 98: Fix the typo in the doc comment that currently reads "Render engine
utilization in perccent." — update the comment text to "Render engine
utilization in percent." to correct the spelling for the field whose doc comment
contains that exact phrase.

In `@src/intel.rs`:
- Around line 45-57: The get_u8_field function currently multiplies a decimal
utilization (from compute_render_utilization and compute_video_utilization) by
100 and casts to u8, which can yield >100 (or saturate to 255); clamp the
decimal to the 0.0–1.0 range before scaling so the returned percentage is always
0–100: compute the decimal as you already do (using compute_render_utilization /
compute_video_utilization and the U8Field match), then replace the direct
scaling/cast with a clamped value (e.g., let pct = (decimal.clamp(0.0, 1.0) *
100.0).round() and cast pct to u8), ensuring the function get_u8_field always
returns a 0–100 u8 for GpuUtilization/RenderUtilization/VideoUtilization.

In `@src/main.rs`:
- Around line 24-53: get_handle currently constructs NvidiaGpuStatus via
NvidiaGpuStatus::new() which internally always selects NVML device index 0;
change the constructor usage and implementation so the DRM device's PCI bus ID
is passed through and used to look up the NVML device (use
nvml.device_by_pci_bus_id()) instead of device_by_index(0). Specifically, modify
get_handle to extract the DRM device PCI bus id from the DrmDevice (for the
NVIDIA branch) and call a new NvidiaGpuStatus::with_pci(pci_bus_id: String) (or
add a parameter to NvidiaGpuStatus::new) and update src/nvidia.rs to resolve the
NVML device via nvml.device_by_pci_bus_id(pci_bus_id) so the selected --gpu DRM
index maps to the correct NVML device.
- Around line 25-29: The code currently calls
gpu.get_vendor_name(...).into_string().unwrap() which will panic if the OsString
is non-UTF-8; change this to convert safely (e.g., use to_string_lossy()) and
then lowercase to produce a String. Locate the vendor_name assignment (variable
vendor_name and the call gpu.get_vendor_name) and replace the
into_string().unwrap() usage with a lossless conversion such as
.to_string_lossy().into_owned() (or otherwise handle the Err from into_string())
before calling .to_lowercase() so non-UTF-8 vendor names do not panic.
- Around line 128-143: print_gpu currently unwraps nodes and OsStr conversions
and can panic if gpu.children is empty or sysnames are non-UTF-8; modify
print_gpu to collect child names safely (e.g., map each
dev.sysname().to_string_lossy().into_owned()), join them with commas, and handle
the empty case by printing something like "<none>" instead of calling
nodes.next().unwrap(); keep using gpu.get_model_name(hwdb)? for model lookup but
replace all .to_str().unwrap() with to_string_lossy() conversions so no panics
occur when sysnames are non-UTF-8.
- Around line 112-118: The loop currently early-continues when
procfs::process::all_processes() returns Err, causing a tight spin; modify the
loop around procfs::process::all_processes() (the Ok(procs) pattern) so that on
error you call std::thread::sleep for a short duration (e.g., a few hundred
milliseconds) before continuing, rather than immediately continuing; ensure you
keep the existing error handling for gpu_status_handle.data.update and only add
the sleep in the error branch that handles procfs failures to avoid burning CPU.

In `@src/nvidia.rs`:
- Around line 215-235: The function has_running_processes currently hardcodes
"/dev/nvidia0"; change its signature to accept the target device (e.g.,
device_path: &str or device_index: usize) and use that value when comparing
FDTarget::Path in the fd loop, so that the comparison uses the actual GPU device
(format "/dev/nvidia{index}" or a path derived from the GPU's bus ID supplied by
NvidiaGpuStatus::new()). Update callers (including NvidiaGpuStatus::new) to pass
the correct device path/index derived from the selected GPU instead of relying
on index 0, and keep the existing proc.fd() and FDTarget::Path checks intact.
- Around line 43-45: The device() helper currently unwraps NVML lookup and will
panic if the GPU is unavailable; change fn device(&self) -> Device<'_> to return
a Result<Device<'_>, GetFieldError> (or appropriate error type) and convert the
nvml.device_by_index(0).unwrap() into a fallible call that maps NVML errors to
GetFieldError::Unavailable; then update all callers (the field getters) to use
self.device()? so the error is propagated instead of panicking; ensure signature
changes and error mapping are applied to functions that call device() so they
return Result and use the ? operator.

🧹 Nitpick comments (9)

src/intel.rs (1)
45-54: Both utilizations are recomputed on every get_u8_field call, even if only one is needed.

Lines 46-47 always compute both render_utilization and video_utilization regardless of which U8Field variant is requested. This iterates the clients list twice per call. For RenderUtilization or VideoUtilization, one of the two sums is wasted.
Proposed refactor: compute lazily
     fn get_u8_field(&self, field: U8Field) -> Result<u8, GetFieldError> {
         let decimal = match field {
-            U8Field::GpuUtilization => render_utilization.max(video_utilization),
-            U8Field::RenderUtilization => render_utilization,
-            U8Field::VideoUtilization => video_utilization,
+            U8Field::GpuUtilization => {
+                self.compute_render_utilization().max(self.compute_video_utilization())
+            }
+            U8Field::RenderUtilization => self.compute_render_utilization(),
+            U8Field::VideoUtilization => self.compute_video_utilization(),
             _ => return Err(GetFieldError::BrandUnsupported),
         };
src/gpu_status/fields.rs (1)

112-123: MemField conflates memory quantities with PCIe throughput.

Tx and Rx are throughput rates, not memory amounts. Grouping them under MemField works because they share the Information UOM type, but the name is misleading. Consider renaming to InformationField or similar if this grows. Fine for now in a WIP PR.
src/config/structs.rs (1)
93-97: Parse errors in format segments are silently treated as "unavailable", removing the line.

On Line 96, if Field::try_from fails (e.g., user wrote {temperature:xyz} — an invalid unit), map_or(true, ...) silently drops that line instead of surfacing the parse error. This could mask typos in user-defined tooltip formats.

Consider logging a warning here for Err cases, similar to how parse() in src/formatter/mod.rs prints a warning for unknown fields.
Proposed change
             let has_unavailable = re.captures_iter(line).any(|caps| {
                 let format_segments = FormatSegments::from_caps_unchecked(&caps);
-                Field::try_from(format_segments).map_or(true, |f| handle.is_field_unavailable(f))
+                match Field::try_from(format_segments) {
+                    Ok(f) => handle.is_field_unavailable(f),
+                    Err(e) => {
+                        eprintln!("Warning: failed to parse field `{}`: {e}", format_segments.field);
+                        true
+                    }
+                }
             });
src/amd.rs (1)
62-69: Unnecessary eyre! allocation — error is immediately discarded.

The eyre!(format!(...)) on Lines 65-68 creates an error that is immediately mapped away by .map_err(|_| GetFieldError::Unavailable). Simplify to ok_or(GetFieldError::Unavailable) directly.
Proposed fix
         let temp = temps
             .iter()
             .find(|t| t.0 == TEMP_SENSOR_NAME)
-            .ok_or(eyre!(format!(
-                "No \"{}\" temperature sensor found",
-                TEMP_SENSOR_NAME
-            )))
-            .map_err(|_| GetFieldError::Unavailable)?;
+            .ok_or(GetFieldError::Unavailable)?;
         let temp = temp.1.current.ok_or(GetFieldError::Unavailable)?;
src/gpu_status/mod.rs (3)
54-56: Consider restricting data field visibility to pub(crate).

GpuHandle.data is public, exposing the internal Box<dyn GpuStatus> to external consumers. Since main.rs accesses handle.data.update(procs), pub(crate) would be sufficient to limit exposure.
Proposed change
 pub struct GpuHandle {
-    pub data: Box<dyn GpuStatus>,
+    pub(crate) data: Box<dyn GpuStatus>,
 }
137-159: is_field_unavailable calls the same getters that write_field will call later — double work.

During tooltip rendering, retain_lines_with_values calls is_field_unavailable for each field, then assemble calls write_field which calls the same getters again. For fields backed by NVML or sysfs calls, this doubles the I/O. Since this only happens once at startup (not in the hot loop), it's acceptable, but worth noting if this is ever used in the render loop.

161-171: compute_mem_usage is correct but lacks a defensive clamp.

Line 170: (ratio * 100.0).round() as u8 — if mem_used ever exceeds mem_total (e.g., shared/swap memory accounting), the result would overflow u8. A .clamp(0.0, 100.0) before the cast would be defensive.
Proposed fix
         let ratio: f32 = (mem_used / mem_total).into();
-        Some((ratio * 100.0).round() as u8)
+        Some((ratio * 100.0).round().clamp(0.0, 100.0) as u8)
src/formatter/mod.rs (1)
71-73: get_regex() recompiles the regex on every call.

While it's only called a few times (not in the hot loop), regex compilation isn't free. Consider using std::sync::LazyLock (stable since Rust 1.80) or once_cell to compile once.
Example using LazyLock
+use std::sync::LazyLock;
+
+static FORMAT_RE: LazyLock<Regex> = LazyLock::new(|| {
+    Regex::new(r"\{(\w+)(?::(\w+)(?:\.(\d+))?)?\}").unwrap()
+});
+
 pub fn get_regex() -> Regex {
-    Regex::new(r"\{(\w+)(?::(\w+)(?:\.(\d+))?)?\}").unwrap()
+    FORMAT_RE.clone()
 }
Or return &'static Regex and update callers to use a reference.
src/nvidia.rs (1)

48-78: Multiple self.device() calls per field query adds NVML overhead.

Each field getter calls self.device() which performs nvml.device_by_index(0). While NVML handle lookups are fast, this is called many times per update cycle (once per field in the format string). Consider caching the Device in update() if profiling shows this matters.

README.md

src/amd.rs

src/drm/client.rs

src/main.rs

src/nvidia.rs

DrmDevice::new() now returns NotPciDeviceError if PciId::from_device returns None.

- NvidiaGpuStatus::new() now takes a bus_id instead of getting it from NVML index 0 device. - nvidia::has_running_processes now checks devnames, usually being `card*` and `renderD*`, instead of `nvidia0`. - main::print_gpu() now prints bus ID.

Use Nvml::device_by_pci_bus_id() instead of Nvml::device_by_index().

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/nvidia.rs`:
- Around line 232-238: The loop uses path.file_name().unwrap() which can panic
for paths like ".."; replace the unwrap with a safe Option check similar to
client.rs: test the file_name() via is_some_and (or map/and_then) and only
compare when present. Update the FDTarget::Path match in this loop to use
path.file_name().is_some_and(|fname| devnames.iter().any(|n| n == fname))
(referencing FDTarget::Path, path.file_name(), and devnames) so the function
returns true only when a file_name exists and matches.

🧹 Nitpick comments (1)

src/intel.rs (1)

45-57: Both utilization values are computed even when only one is needed.

compute_render_utilization() and compute_video_utilization() are both called unconditionally on Lines 46-47, even for RenderUtilization or VideoUtilization which only use one. Consider computing lazily inside the match arms.

Proposed refactor

     fn get_u8_field(&self, field: U8Field) -> Result<u8, GetFieldError> {
-        let render_utilization = self.compute_render_utilization();
-        let video_utilization = self.compute_video_utilization();
-
         let decimal = match field {
-            U8Field::GpuUtilization => render_utilization.max(video_utilization),
-            U8Field::RenderUtilization => render_utilization,
-            U8Field::VideoUtilization => video_utilization,
+            U8Field::GpuUtilization => {
+                self.compute_render_utilization().max(self.compute_video_utilization())
+            }
+            U8Field::RenderUtilization => self.compute_render_utilization(),
+            U8Field::VideoUtilization => self.compute_video_utilization(),
             _ => return Err(GetFieldError::BrandUnsupported),
         };

src/nvidia.rs

Bowen951209 and others added 30 commits January 16, 2026 15:50

Implement customizable memory units

7c8bd8a

- Add byte-unit dependency. - Parse display units from template suffix, e.g., `{mem_used_MiB}`. - Small optimizations.

Add doc

2fe24e0

Fix tests with new format

6edf14b

Support temperature units

d17757c

Add `uom` dependency and support degree C, degree F, and Kelvin for temperature.

Support power unit (Watt and kilowatt)

780fa74

Explicit use Temperature::new::<degree_celsius>

17b8697

Update README with available units

003b3df

Update tooltip arg doc with available units

3da46d3

Update config.example.toml with available units

f6ae09f

Format README

d963ed3

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

Merge branch 'master' into mem-unit-PolpOnline#117

2f66d75

- Replace `byte_unit` crate with `uom` for memory and PCIe units. - Change field-unit separator from `_` to `:` in templates. Note: This is a large merge refactors.

Add more memory units

8cbe1a8

Added KB, MB, GB, Kib, Mib, Gib, Kb, Mb, Gb.

Move units enums to formatter::units module

8b96c57

Optimize field display by directly write

8783461

Remove the `to_string()`s and directly write displays to buffer.

Rewrite DisplayValue enum for static dispatch

307918b

Also renamed `DisplayValue` to `SimpleDisplay`.

Move field things in formatter module to formatter::fields

210f08c

Simplify code

4586fed

Adding docs and implementing todo

99fba6c

Update README with supported memory units

72f82a9

Fix power unit strum serialize

6a9fc22

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

More docs and comments

f774648

Implement Configurable Decimal Places

e82eeb6

- Support configurable decimal places for memory, temperature, and power fields. - Users can specify decimal places using a format like `{temperature:f.2}`. - The intended precision is specified after the *dot*.

Fix doc

3e277f3

Update README, doc, and example config

352074f

Revert "Update README, doc, and example config"

bd0cd10

This reverts commit 352074f.

Move UnitParseError struct from units module to fields module

f24b9b7

Add test_trim_trailing_zeros_without_decimal

5b69801

Fix trim_trailing_zeros modifying previous buffer content

4f23a59

trim_trailing_zeros now takes a protected_len to avoid trimming earlier segments.

Fix trim_trailing_zeros by finding LAST dot pos

93e45db

A test is added to cover that.

Make precision optional

a4ce4d8

If precision of a field is not provided, display all available digits.

Bowen951209 added 7 commits February 10, 2026 09:42

Make PowerUnit strum parsing case-insensitive

5194878

Update temperature units in README with actual values

557e2bf

Merge branch 'mem-unit-PolpOnline#117' into intel

a111a55

Add drm module and implement scan_drm_devices

c40e965

Sort the drm devices returned in scan_drm_devices()

5fd38d5

Sort the drm devices based on their child named `cardX`.

Inline the sort_by_dri_index() function

47098b1

Simplify instance creating and implement DrmDevice::get_model_name

6bc5659

- Remove Instance struct. Now we directly call get_handle(). - Nvml and AmdGpuHandle contexts are now created and held inside NvidiaGpuStatus and AmdGpuStatus. - Print GPU device name in main().

This comment was marked as outdated.

Sign in to view

Bowen951209 added 10 commits February 11, 2026 17:32

Fix CI build by installing libudev-dev and pkg-config

d64085d

chore: rearrange function

16938e4

Introduce a safer PciId type

cf7746c

Create drm::client module.

c6c7443

- Add libc dependency. - Move code in old drm.rs to drm/device.rs. - Some infrastructure for drm clients.

Clean up spaghetti in ClientManager::update() by extracting functions

beb6b15

Implement fd render engine utilization tracking

f0d02ee

Move read_id() to free function

e7f415f

Add Intel GPU support

7f6ef0a

- Introduce IntelGpuStatus and integrate into main flow. - Refactor ClientManager, EngineStats, EngineSample; improve FD scanning and manage device names. - Move NVIDIA process check.

Remove endless test

b04ca18

Add video engine

d1fbf9b

- Take max of render and video engine utilization as GPU utilization. - Add U8Field::RenderUtilization and U8Field::VideoUtilization. - Remove GetFieldError::NotReady because it's never used.

Bowen951209 changed the title ~~[WIP] Support Intel GPU~~ [WIP] Support Intel GPU (DRM) Feb 14, 2026

Bowen951209 marked this pull request as ready for review February 15, 2026 00:29

coderabbitai bot reviewed Feb 15, 2026

View reviewed changes

Bowen951209 added 7 commits February 16, 2026 08:17

Error handling in ClientManager::update()

f97e552

Replace unwrap file_name with is_some_and

d0ba582

Unwrap outside DrmDevice::new()

3fb628e

DrmDevice::new() now returns NotPciDeviceError if PciId::from_device returns None.

Fix typo

69b2a98

Fix the hard-coded index 0 in nvidia

2eb9aaf

- NvidiaGpuStatus::new() now takes a bus_id instead of getting it from NVML index 0 device. - nvidia::has_running_processes now checks devnames, usually being `card*` and `renderD*`, instead of `nvidia0`. - main::print_gpu() now prints bus ID.

eprintln and sleep before continuing when all_processes fails

41e0c6e

Fix the hard-coded index 0 in NvidiaGpuStatus::device()

47cc7c0

Use Nvml::device_by_pci_bus_id() instead of Nvml::device_by_index().

coderabbitai bot reviewed Feb 16, 2026

View reviewed changes

src/nvidia.rs Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support Intel GPU (DRM)#127

[WIP] Support Intel GPU (DRM)#127
Bowen951209 wants to merge 66 commits intoPolpOnline:masterfrom
Bowen951209:intel

Bowen951209 commented Feb 11, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 11, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

❌ Failed checks (2 warnings)

Uh oh!

This comment was marked as outdated.

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Bowen951209 commented Feb 11, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Utilization Computation

Refactoring

Credits

Tracker

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

❌ Failed checks (2 warnings)

Uh oh!

This comment was marked as outdated.

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Bowen951209 commented Feb 11, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 11, 2026 •

edited

Loading