Configure Model Deployment

Introduction

Here is the advanced configuration for model deployment.

Increase the Timeout of Model Deployment Endpoint

If you find your endpoints need more time for each request, you can modify the following timeout settings.

Increase client-body-timeout

Please check the definition of client-body-timeout in the official doc.

The setting affects globally of whole system, not only model deployment.

Here are the steps to modify this setting:

Check the namespace where the ingress pod is running by kubectl get ns. The default namespace is ingress-nginx.
Check the name of the pod by kubectl get pods -n ${YOUR_NAMESPACE}. The name is similar to nginx-ingress-controller-79cfc6dcc5-m2rhw.
Check the name of the configmap by kubectl get pod -n ${YOUR_NAMESPACE} ${YOUR_POD_NAME} -o yaml | grep configmap. The result is similar to --configmap=${YOUR_NAMESPACE}/nginx-ingress-controller and the name is nginx-ingress-controller in this case.
Edit the config by kubectl edit cm -n ${YOUR_NAMESPACE} ${YOUR_CONFIGMAP_NAME}. Add/Modify the client-body-timeout under the data section.

apiVersion: v1
data:
  client-header-buffer-size: 16k
  enable-vts-status: "true"
  client-body-timeout: "120" # means the timeout is 120sec
kind: ConfigMap

PrimeHub

3.9

Setup Kubernetes

Setup PrimeHub Community

Setup PrimeHub Enterprise

Upgrade

Notebook Extension

Tutorial

Tutorial

Tutorials

Pre-packaged servers

Package from Language Wrapper

Advanced

Tutorials

Builtin Apps

Manage users and groups

Manage Instance Type

Manage and build images

Manage Datasets and Upload

Manage and obtain secrets

Manage Notebook instances

Monthly Report

System

Maintenance tool kits

Quick Start

Admin Portal

Concepts

Designs

Jupyter Images

Persistence

API

Monitoring

MLflow

JupyterHub

Keycloak

SSH Server

Configure Model Deployment

Introduction

Increase the Timeout of Model Deployment Endpoint

Increase client-body-timeout