Skip to content

Collect pod resource usage and cgroup metrics in must-gather#2493

Closed
evanwang9x wants to merge 4 commits into
skupperproject:mainfrom
evanwang9x:Evan/Improve-Must-Gather
Closed

Collect pod resource usage and cgroup metrics in must-gather#2493
evanwang9x wants to merge 4 commits into
skupperproject:mainfrom
evanwang9x:Evan/Improve-Must-Gather

Conversation

@evanwang9x

@evanwang9x evanwang9x commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

Must gather script can now collect memory and CPU usage via container exec and also captures cgroup limits, requests, and throttling stats.
Outputs the info to "must-gather/namespaces/"namespace"/pods/"pod-name"/"container-name"/"container-name"/resources/". Creates files to track what each does

  1. memory.info: Tracks current memory usage and shows the memory limit.
  2. cpu.stat: Provides CPU stats on total usage time, user/syxstem breakdown, and nr_throttled which shows performance bottlenecks.
Screenshot 2026-06-10 at 1 54 43 PM 3. cpu.max: maximum amount of CPU processing power allotted.
  1. last.terminated: Provides an error message on why the previous container crashed if applicable. Helpful for identifying errors like Out of memory.

  2. memory.pressure / cpu.pressure: Tracks how much time the pod spends waiting for CPU or memory to become available. This helps identify if the pod is slowing down because the node is too busy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant