Configure Model Deployment
Enterprise
Applicable to Enterprise Edition
Introduction
Here is the advanced configuration for model deployment.
Increase the Timeout of Model Deployment Endpoint
If you find your endpoints need more time for each request, you can modify the following timeout settings.
Increase client-body-timeout
Please check the definition of client-body-timeout
in the official doc.
The setting affects globally of whole system, not only model deployment.
Here are the steps to modify this setting:
- Check the namespace where the ingress pod is running by
kubectl get ns
. The default namespace isingress-nginx
. - Check the name of the pod by
kubectl get pods -n ${YOUR_NAMESPACE}
. The name is similar tonginx-ingress-controller-79cfc6dcc5-m2rhw
. - Check the name of the configmap by
kubectl get pod -n ${YOUR_NAMESPACE} ${YOUR_POD_NAME} -o yaml | grep configmap
. The result is similar to--configmap=${YOUR_NAMESPACE}/nginx-ingress-controller
and the name isnginx-ingress-controller
in this case. - Edit the config by
kubectl edit cm -n ${YOUR_NAMESPACE} ${YOUR_CONFIGMAP_NAME}
. Add/Modify theclient-body-timeout
under thedata
section.
apiVersion: v1
data:
client-header-buffer-size: 16k
enable-vts-status: "true"
client-body-timeout: "120" # means the timeout is 120sec
kind: ConfigMap