Configuring GPU Idle Reclaim

Describes how to configure the GPU idle reclaim, view pod details, and view GPU usage.

You can view frameworks, the number of vGPUs assigned, framework status, priority level, and the idle time threshold in the GPU Control Panel screen. You can also view the pod details and the GPU utilization chart.

To navigate to the GPU Control Panel screen,
  1. Sign in to HPE Ezmeral Unified Analytics Software as Administrator.
  2. In the left navigation bar, click AdministrationResource Management.
You are now in the GPU Control Panel screen.

In this screen, you can configure the policy settings, view the pod details and GPU usage as follows:

Configuring the Policy Settings

To set the policy settings (priority level and idle time threshold) for your framework and workload, click the Actions menu.

In the Policy Settings screen, set the following boxes:
Priority Level
Set the priority level in the range of 8000-10000 where 8000 is the lowest priority and 10000 is the highest priority. For example, a pod with the 8000 priority level will have a low priority compared to the pod with the 10000 priority level.
  • Default priority level: 8000
WARNING
Do not modify framework priority when a framework is in the pending state. Modifying a framework in the pending state causes the correlating framework pods to fail.
Idle Time Threshold
Set the maximum amount of time a vGPU on a workload can be idle before that workload can be preempted (deallocated) automatically by a pending workload.
  • Minimum idle time threshold: 60 seconds
  • Default idle time threshold: 300 seconds

The new policy settings will not be applied to the pods that are currently in the Running or Idle status. These new policy settings will be applied to the new workloads.

Viewing the Pod Details

To view the pod details, click frameworks that are in the Idle or Running status. This will open a pod detail screen. Here, you can see a list of pods, vGPU assigned, status, age of pods, and the GPU utilization chart.

Viewing the GPU Usage

To view the GPU usage, click the GPU utilization chart icon under Actions. In the GPU utilization screen, you can view the GPU usage for the selected period.