Home / BeaverDeck / Docs / Insights Guide / GPU Insights
GPU Insights
GPU discovery, allocation pressure, placement, quota, fragmentation, and expensive-capacity usage.
Data Evaluated
Node nvidia.com/gpu allocatable capacity, active pod GPU requests, scheduling state, selected namespaces, and ResourceQuotas.
Checks
| Check | When it reports | Alert severity |
|---|---|---|
GPU Capacity Discoverygpu-capacity-discovery |
Selected namespaces contain active GPU requests, but no Node advertises allocatable nvidia.com/gpu capacity. |
Critical |
GPU Allocation Pressuregpu-allocation-pressure |
Active pod GPU requests reach at least 80% of allocatable GPUs on a GPU node or across the selected namespaces. The severity becomes critical at 95%. | Warning at 80%; critical at 95% |
GPU Node Schedulinggpu-node-cordoned |
A Node advertises GPU capacity and is marked unschedulable. | Warning |
GPU Idle Allocationgpu-node-idle-allocation |
A Node advertises GPU capacity, but no active pod in the selected namespaces requests a GPU on that node. | Warning |
GPU Node Workload Mixnon-gpu-pods-on-gpu-node |
An active, non-DaemonSet Pod without a GPU request is scheduled on a GPU node. | Warning |
GPU Fragmentationgpu-fragmentation |
A GPU Pod has been Pending for at least 5 minutes, total free GPU capacity across schedulable GPU nodes is sufficient, but no single node has enough free GPUs for that Pod. | Warning |
Namespace GPU Usagegpu-namespace-usage |
A selected namespace has one or more active GPU workload requests. BeaverDeck reports the total requested GPU count and whether a GPU quota exists. | Informational passing check |
GPU Quotagpu-quota |
A selected namespace has active GPU requests but no ResourceQuota hard limit for requests.nvidia.com/gpu, limits.nvidia.com/gpu, or nvidia.com/gpu. |
Warning |
GPU Pod Pendinggpu-pod-pending |
An active Pod requests GPUs and remains Pending for at least 5 minutes. | Warning |
GPU Pod Readinessgpu-pod-unready |
A GPU-requesting Pod is assigned to a node but remains not Ready for at least 10 minutes from its Ready-condition transition or creation time. | Warning |
GPU Pod Requestsgpu-pod-requests |
An active GPU-requesting Pod has an init or application container without a CPU request or memory request. | Warning |
Open an individual check for risk context, recommended response, and limitations. Passing checks are
visible when Show all checks is enabled in BeaverDeck.