Package a Model Image for Python

This doc shows how to package a model into a format-valid docker image for the PrimeHub model deployment feature.

The PrimeHub model deployment feature is based on Seldon. This doc takes reference from Seldon official documentations and other resources which are listed in the last part.

Prerequisites

Docker: https://www.docker.com/products/docker-desktop

Prepare the Model and Code (Python)

Create a requirements.txt file and write down all required packages.
```
seldon-core
keras
tensorflow
numpy
...
```

Create a Dockerfile with the following content.

FROM python:3.7-slim
COPY . /app
WORKDIR /app
RUN pip install -r requirements.txt
EXPOSE 9000

# Define environment variable
ENV MODEL_NAME MyModel
ENV SERVICE_TYPE MODEL
ENV PERSISTENCE 0

CMD exec seldon-core-microservice $MODEL_NAME --service-type $SERVICE_TYPE --persistence $PERSISTENCE --access-log

Create a MyModel.py file with the following example template.

class MyModel(object):
    """
    Model template. 
    You can load your model parameters in __init__ from a location accessible at runtime.
    """

    def __init__(self):
        """
        Add any initialization parameters. These will be passed at runtime from the graph definition parameters 
        defined in your seldondeployment kubernetes resource manifest.
        """
        print("Initializing")

    def predict(self, X, features_names=None):
        """
        Return a prediction.

        Parameters
        ----------
        X : array-like
        feature_names : array of feature names (optional)
        """
        print("Predict called - will run identity function")
        return X

File and class name MyModel should be the same as MODEL_NAME in Dockerfile
Load or initiate your model under the __init__ function
The predict method takes a numpy-array X and list of string feature_names (optional), then returns an array of predictions (the return array should be at least 2-dimensional)

More detailed information on how to write the Python file for model deployment in different frameworks, please refer to the section Example Codes for Different Frameworks.

Build the Image

Make sure you are in the folder that includes requirements.txt, Dockerfile, python file for model deployment, and model file.
Execute following command to install environment and package our model file into the target image my-model-image.
```
docker build . -t my-model-image
```

Then check the image by docker images.

REPOSITORY          TAG                 IMAGE ID            CREATED             SIZE
my-model-image      latest              f373fdcc10c5        3 minutes ago       2.46GB
python              3.7-slim            ea12296513d7        2 weeks ago         112MB

Test the Image

In order to make sure your model image is well packaged, you can run your model as a Docker container locally.
```
docker run -p 9000:9000 --rm my-model-image
```

And curl (replace ndarray content in curl example according to your application).

curl -X POST localhost:9000/api/v1.0/predictions \
    -H 'Content-Type: application/json' \
    -d '{ "data": { "ndarray": [[5.964,4.006,2.081,1.031]]}}'

You have successfully built the docker image for the PrimeHub model deployment.

Push the Image

Next, push the image into the docker hub (or other docker registries) and check PrimeHub tutorial to serve the model under PrimeHub.

Tag your docker image.
```
docker tag my-model-image test-repo/my-model-image
```
Then push to docker registry.
```
docker push test-repo/my-model-image
```

(Optional) Example Codes for Different Frameworks

Here are some Python snippets of how to export a model file then load it and run the prediction in another file. By following the Python wrapper format, PrimeHub supports various popular ML frameworks to serve models.