Serve a MobileNet image classifier on Kubernetes#
Note: The Python files for the Ray Serve application and its client are in the repository ray-project/serve_config_examples.
Step 1: Create a Kubernetes cluster with Kind#
kind create cluster --image=kindest/node:v1.26.0
Step 2: Install KubeRay operator#
Follow this document to install the latest stable KubeRay operator from the Helm repository.
Note that the YAML file in this example uses serveConfigV2. You need KubeRay version v0.6.0 or later to use this feature.
Step 3: Install a RayService#
# Create a RayService
kubectl apply -f https://raw.githubusercontent.com/ray-project/kuberay/v1.3.0/ray-operator/config/samples/ray-service.mobilenet.yaml
The mobilenet.py file needs
tensorflowas a dependency. Hence, the YAML file usesrayproject/ray-mlimage instead ofrayproject/rayimage.The request parsing function
starlette.requests.form()needspython-multipart, so the YAML file includespython-multipartin the runtime environment.
Step 4: Forward the port for Ray Serve#
# Wait for the RayService to be ready to serve requests
kubectl describe rayservice/rayservice-mobilenet
# Conditions:
# Last Transition Time: 2025-02-13T02:29:26Z
# Message: Number of serve endpoints is greater than 0
# Observed Generation: 1
# Reason: NonZeroServeEndpoints
# Status: True
# Type: Ready
# Forward the port for Ray Serve service
kubectl port-forward svc/rayservice-mobilenet-serve-svc 8000
Step 5: Send a request to the ImageClassifier#
Step 5.1: Prepare an image file.
Step 5.2: Update
image_pathin mobilenet_req.pyStep 5.3: Send a request to the
ImageClassifier.python mobilenet_req.py # sample output: {"prediction":["n02099601","golden_retriever",0.17944198846817017]}