MLOPS: Deploying ML Model using Kubeflow Pipeline, KServe & Kubernetes

In this video, we’ll walk you through building a powerful machine learning model using Kubeflow and deploying it seamlessly to KServe with InferenceService!

YouTube Link

Click here for full video: click here

Requirements

Docker Desktop (on Mac & Windows) or Docker Engine (on Linux)
Kubectl
Minikube
Python Virtual Environment (venv) - optional
kfp
AWS IAM User (with S3 privileges)
AWS S3 Bucket
Custom Domain Name (DNS)
Helm

Launch Ubuntu EC2 Instance

t3.2xlarge with 8 cpus, 32 GiB Memory, 80 GiB Storage -> $0.33 per hour

chmod 600 <keypair>
ssh -i <keypair> ubuntu@<PublicIP>

Open the following Ports in Inbound Rules for Smooth Operation

Port 22: ssh
Port 80, 443: http & https
Port 8443: Kubernetes API
Port 8080: Kubeflow Dashboard
Port 30000-32767: Kubernetes NodePort Service
Port 5000, 8081, 9000: KServe Model Serving
Port 31390: KServe Inference
Port 31380: Kubeflow Ingress Gateway

Update the System

sudo apt update && apt upgrade -y

Install & Activate Docker

sudo apt-get update && sudo apt-get install docker.io -y
sudo groupadd docker
sudo usermod -aG docker $USER
newgrp docker

Install Kubectl

sudo snap install kubectl --classic
kubectl version --client  # Verify installation

Create or Check Python Environment & Activate it

python3 --version
pip --version
sudo apt install python3-pip -y
sudo apt install python3.12-venv -y
python3 -m venv path/to/venv
path/to/venv/bin/python --version
path/to/venv/bin/pip --version

source path/to/venv/bin/activate

Install Minikube

curl -LO https://github.com/kubernetes/minikube/releases/latest/download/minikube-linux-amd64
sudo install minikube-linux-amd64 /usr/local/bin/minikube && rm minikube-linux-amd64
minikube version

Start Minikube

minikube start --cpus=4 --memory=10240 --driver=docker
kubectl get nodes
kubectl cluster-info

Install Kubeflow Pipelines SDK (kfp)

which kfp
path/to/venv/bin/pip install kfp

Install & Set up Kubeflow - This may take about 15-30 min depending on your System

export PIPELINE_VERSION=2.4.0
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/cluster-scoped-resources?ref=$PIPELINE_VERSION"
kubectl wait --for condition=established --timeout=60s crd/applications.app.k8s.io
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/env/platform-agnostic?ref=$PIPELINE_VERSION"

kubectl get all -n kubeflow

Access Kubeflow Pipeline UI - Use a dedicated terminal

(1) If you're using your local machine

kubectl port-forward -n kubeflow svc/ml-pipeline-ui 8080:80

(2) If you're using Amazon EC2 Instance - Use a dedicated terminal

kubectl port-forward -n kubeflow svc/ml-pipeline-ui 8080:80 &

Run this ssh tunnel coommand in a separate terminal

cd Downloads
ssh -i <keypair> -L 8080:localhost:8080 -N ubuntu@<Public_ip>

On your Browser

localhost:8080

Generate Yaml & Build the Pipeline Script

source path/to/venv/bin/activate

touch pipeline.py
path/to/venv/bin/python pipeline.py
path/to/venv/bin/kfp pipeline create -p IrisProject pipeline.yaml

Set Up AWS User and Access Keys

IAM -> Create User -> Attach Policy directly (AmazonS3FullAccess) -> Create User
Click on Newly Created User -> Create Access Keys -> CLI -> Create Access Keys -> Download .csv file -> Done
Configure AWS User on local terminal with Access Key ID & Secret Key ID using "aws configure"

Create S3 Bucket

Create S3 Bucket on AWS (or use an already existing bucket) S3 -> Create bucket -> General purpose -> Name: kubeflow-bucket-iquant01 -> Keep all default selections -> Create bucket

On Kubeflow UI

Create on Pipeline name -> Create run -> Provide Details -> etc

On Kubeflow UI (Cont'd)

Provide all the details including

Access key id, Secret access key, S3 bucket name, S3 key Start the pipeline.

On Amazon S3 Bucket

Inspect the bucket for the trained ML model (.joblib file)

Install Helm

sudo snap install helm --classic

Install and Verify Istio, Cert Manager, Knative & KServe

curl -s "https://raw.githubusercontent.com/kserve/kserve/release-0.14/hack/quick_install.sh" | bash

kubectl get pods -n kserve
kubectl get pods -n istio-system
kubectl get pods -n knative-serving

Expose KServe via Minikube's ingress

minikube addons enable ingress

Verify Ingress Controller

kubectl get pods -n ingress-nginx

Expose External IP for Minikube (Like a Load Balancer) - Run this command in a dedicated terminal

minikube tunnel

If EXTERNAL-IP shows "pending",

Export Istio Ingress Gateway's ExternalIP & Port

minikube ip

kubectl get svc istio-ingressgateway -n istio-system

export INGRESS_HOST=$(kubectl -n istio-system get service istio-ingressgateway -o jsonpath='{.status.loadBalancer.ingress[0].ip}')
export INGRESS_PORT=$(kubectl -n istio-system get service istio-ingressgateway -o jsonpath='{.spec.ports[?(@.name=="http2")].port}')

Create "kserve-test" Namespace

kubectl create namespace kserve-test

Create InferenceService

kubectl apply -n kserve-test -f - <<EOF
  apiVersion: "serving.kserve.io/v1beta1"
  kind: "InferenceService"
  metadata:
    name: "sklearn-iris"
  spec:
    predictor:
      model:
        modelFormat:
          name: sklearn
        storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
EOF

Check the InferenceService (May show READY: Unknown or False)

kubectl get inferenceservices sklearn-iris -n kserve-test

Check Istio Ingress Gateway

kubectl get svc istio-ingressgateway -n istio-system

InferenceService should be Ready Now (READY: True)

kubectl get isvc sklearn-iris -n kserve-test

Create an Datafile for Inference: iris-input.json

cat <<EOF > "./iris-input.json"
{
  "instances": [
    [6.8,  2.8,  4.8,  1.4],
    [6.0,  3.4,  4.5,  1.6]
  ]
}
EOF

Set the URL of the Model in the InferenceService as an Environment Variable SERVICE_HOST &

Use CURL Command to draw inference from the ML Model to get Prediction

SERVICE_HOSTNAME=$(kubectl get inferenceservice sklearn-iris -n kserve-test -o jsonpath='{.status.url}' | cut -d "/" -f 3)

curl -v -H "Host: ${SERVICE_HOSTNAME}" -H "Content-Type: application/json" "http://${INGRESS_HOST}:${INGRESS_PORT}/v1/models/sklearn-iris:predict" -d @./iris-input.json

Clean Up

Ctrl + C -> To EXIT minikube tunnel

Delete the model

kubectl delete isvc sklearn-iris -n kserve-test

Stop and Delete Minikube

minikube stop 
minikube delete --all

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
pipeline.py		pipeline.py

iQuantC/Kubeflow-KServe

Folders and files

Latest commit

History

Repository files navigation