Home / BeaverDeck / Docs / Insights Guide / GPU Insights / GPU Allocation Pressure
GPU Allocation Pressure
BeaverDeck uses this check to identify a specific gpu condition that may need operator review.
| Check type | gpu-allocation-pressure |
|---|---|
| Insights section | GPU Insights |
| Alert severity | Warning at 80%; critical at 95% |
When It Reports A Finding
Active pod GPU requests reach at least 80% of allocatable GPUs on a GPU node or across the selected namespaces. The severity becomes critical at 95%.
Why This Is A Problem
Little unallocated capacity remains for new workloads, failover, rolling updates, or autoscaling. Pending GPU pods become more likely as allocation approaches capacity.
Recommended Response
- Review requested versus allocatable GPUs at node and selected-namespace scope.
- Remove stale workloads and right-size GPU requests where the workload can use smaller allocations.
- Redistribute workloads or add compatible GPU nodes before planned demand or maintenance.
Scope And Limitations
This is scheduling allocation pressure, not actual GPU utilization. A requested GPU can be idle, while a busy GPU still counts as one allocation.
After remediation: refresh GPU Insights and verify the underlying
resource or metric. Suppress the finding only when the condition is intentional and its risk is accepted.