Quality of Service
If you have ever wondered that the performance of the launching instance doesn't meet the expectation of CPUs request in the setting of the instance type. You many want to learn how Kubernetes manages CPU workload which may affect the performance you .
Basically speaking, if the performance is the concern, we can configure the instance type to meet the conditions to have exclusive CPUs:
cpu-manager-policy
:static
(If the circumstance is set up by InfuseAI, by default, it isstatic
.)QoS
classes of a Pod:Guaranteed
; the condition for a Pod in Guaranteed.For a Pod to be given a QoS class of Guaranteed:
1. Every Container in the Pod must have a memory limit and a memory request, and they must be the same.
2. Every Container in the Pod must have a CPU limit and a CPU request, and they must be the same.According to the criteria above, we can set
- An instance type must have a memory limit and a memory request, and they must be the same.
- An instance type must have a CPU limit and a CPU request, and they must be the same.
Only containers that are both part of a Guaranteed pod and have integer CPU requests are assigned exclusive CPUs.
According to the suggestion above, we can set
- An instance type must have integer CPU requests/limit*.