You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Pods logs:
WARNING: failed to read int from file: open /sys/devices/system/cpu/cpu0/online: no such file or directory
I0425 11:40:08.915607 1 exporter.go:103] Kepler running on version: v0.7.12-dirty
I0425 11:40:08.915887 1 config.go:293] using gCgroup ID in the BPF program: true
I0425 11:40:08.915915 1 config.go:295] kernel version: 6.9
I0425 11:40:08.916275 1 rapl_msr_util.go:136] input/output error
I0425 11:40:08.916404 1 power.go:78] Unable to obtain power, use estimate method
I0425 11:40:08.916479 1 redfish.go:169] failed to get redfish credential file path
I0425 11:40:08.917741 1 acpi.go:71] Could not find any ACPI power meter path. Is it a VM?
I0425 11:40:08.917775 1 power.go:79] using none to obtain power
E0425 11:40:08.917816 1 accelerator.go:154] [DUMMY] doesn't contain GPU
E0425 11:40:08.917872 1 exporter.go:154] failed to init GPU accelerators: no devices found
WARNING: failed to read int from file: open /sys/devices/system/cpu/cpu0/online: no such file or directory
I0425 11:40:08.923544 1 exporter.go:84] Number of CPUs: 24
panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x0 pc=0x87b273]
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi all
I have deployed prometheus and kepler on clean kubernetes cluster like described here https://sustainable-computing.io/installation/kepler-helm/
All kepler pods are crashing in loop
Versions:
Pods logs:
WARNING: failed to read int from file: open /sys/devices/system/cpu/cpu0/online: no such file or directory
I0425 11:40:08.915607 1 exporter.go:103] Kepler running on version: v0.7.12-dirty
I0425 11:40:08.915887 1 config.go:293] using gCgroup ID in the BPF program: true
I0425 11:40:08.915915 1 config.go:295] kernel version: 6.9
I0425 11:40:08.916275 1 rapl_msr_util.go:136] input/output error
I0425 11:40:08.916404 1 power.go:78] Unable to obtain power, use estimate method
I0425 11:40:08.916479 1 redfish.go:169] failed to get redfish credential file path
I0425 11:40:08.917741 1 acpi.go:71] Could not find any ACPI power meter path. Is it a VM?
I0425 11:40:08.917775 1 power.go:79] using none to obtain power
E0425 11:40:08.917816 1 accelerator.go:154] [DUMMY] doesn't contain GPU
E0425 11:40:08.917872 1 exporter.go:154] failed to init GPU accelerators: no devices found
WARNING: failed to read int from file: open /sys/devices/system/cpu/cpu0/online: no such file or directory
I0425 11:40:08.923544 1 exporter.go:84] Number of CPUs: 24
panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x0 pc=0x87b273]
goroutine 1 [running]:
github.com/sustainable-computing-io/kepler/pkg/bpf.(*hardwarePerfEvents).close(0x0)
/workspace/pkg/bpf/exporter.go:274 +0x13
github.com/sustainable-computing-io/kepler/pkg/bpf.(*exporter).Detach(0xc0000006c0)
/workspace/pkg/bpf/exporter.go:195 +0x15a
github.com/sustainable-computing-io/kepler/pkg/bpf.NewExporter()
/workspace/pkg/bpf/exporter.go:58 +0x13e
main.main()
/workspace/cmd/exporter/exporter.go:159 +0x86b
Thanks for help
Yann
Beta Was this translation helpful? Give feedback.
All reactions