Scaling [--solutionname--] With Kubernetes

Generated On: --datetime-- UTC

You can scale your solution with Kubernetes. To do so, will will need to apply the following YAML files to your Kubernetes cluster.

Tip

Refer to TML documentation for more information on scaling with Kubernetes.

You can also run the YAML files locally in a 1 node kubernetes cluster called minikube. Refer to Installing minikube

Important

Below assumes you have a Kubernetes cluster and kubectl installed in your Linux environment.

Attention

Make sure to STOP the TSS Container and other containers before running Kubernetes/Minikube.

If you get the following WARNING from Kubernetes:

Warning FailedScheduling default-scheduler 0/1 nodes are available: 1 Insufficient nvidia.com/gpu. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod. Make sure no other application is using the GPU. You can check by executing in terminal the command: nvidia-smi

Oherwise, Issue the commands below:

sudo apt update && sudo apt install -y nvidia-docker2

sudo nvidia-ctk runtime configure --runtime=docker

sudo systemctl restart docker

Based on your TML solution [--solutionname--] - if you want to scale your application with Kubernetes - you will need to apply the following YAML files.

YML File	Description
--solutionnamefile--	This is your main solution YAML file. It MUST be applied to your Kubernetes cluster.
mysql-storage.yml	This is storage allocation for MySQL DB. It MUST be applied to your Kubernetes cluster.
mysql-db-deployment.yml	This is the MySQL deployment YAML. It MUST be applied to your Kubernetes cluster.
privategpt.yml	This is the privateGPT deployment YAML. This is OPTIONAL. However, it must be applied if using Step 9 DAG.
qdrant.yml	This is the Qdrant deployment YAML. This is OPTIONAL. However, it must be applied if using Step 9 DAG.

kubectl Create command

Important

To apply the YAML files below to your Kubernetes cluster simply run this command:

--kubectl--

--solutionnamefile--

Important

Copy and Paste this YAML file: --solutionnamefile-- - and save it locally.

Attention

MAKE SURE to update any tokens and passwords in these fields:

name: GITPASSWORD (MANDATORY)
value: '<ENTER GITHUB PASSWORD>'

name: READTHEDOCS (MANDATORY)
value: '<ENTER READTHEDOCS TOKEN>'

name: KAFKACLOUDPASSWORD (OPTIONAL)
value: '<Enter API secret>'

name: MQTTPASSWORD (OPTIONAL)
value: '<ENTER MQTT PASSWORD>'

################# --solutionnamefile--
--solutionnamecode--

Tip

In the solution YAML file above, you can adjust the replicas field. Currently, replicas: 3 for demonstration purposes.

mysql-storage.yml

Important

Copy and Paste this YAML file: mysql-storage.yml - and save it locally.

################# mysql-storage.yml
apiVersion: v1
kind: PersistentVolume
metadata:
  name: mysql-pv-volume
  labels:
    type: local
spec:
  storageClassName: manual
  capacity:
    storage: 20Gi
  accessModes:
    - ReadWriteOnce
  hostPath:
    path: "/mnt/data"
---
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: mysql-pv-claim
spec:
  storageClassName: manual
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 20Gi

mysql-db-deployment.yml

Important

Copy and Paste this YAML file: mysql-db-deployment.yml - and save it locally.

################# mysql-db-deployment.yml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: mysql
spec:
  selector:
    matchLabels:
      app: mysql
  strategy:
    type: Recreate
  template:
    metadata:
      labels:
        app: mysql
    spec:
      containers:
      - image: maadsdocker/mysql:latest
        name: mysql
        env:
        - name: MYSQL_ROOT_PASSWORD
          value: "raspberry"
        - name: MYSQLDB
          value: "tmlids"
        - name: MYSQLDRIVERNAME
          value: "mysql"
        - name: MYSQLHOSTNAME
          value: "mysql:3306"
        - name: MYSQLMAXCONN
          value: "4"
        - name: MYSQLMAXIDLE
          value: "10"
        - name: MYSQLPASS
          value: "raspberry"
        - name: MYSQLUSER
          value: "root"
        ports:
        - containerPort: 3306
          name: mysql
        volumeMounts:
        - name: mysql-persistent-storage
          mountPath: /var/lib/mysql
      volumes:
      - name: mysql-persistent-storage
        persistentVolumeClaim:
          claimName: mysql-pv-claim

---
apiVersion: v1
kind: Service
metadata:
  name: mysql-service
spec:
  ports:
  - port: 3306
  selector:
    app: mysql

privategpt.yml

Note

This YAML is Optional - Use Only If Step 9 Dag is used

Important

Copy and Paste this YAML file: privategpt.yml - and save it locally.

Note

By default this assumes you have a Nvidia GPU in your machine and so it using the Nvidia privateGPT container:

image: maadsdocker/tml-privategpt-with-gpu-nvidia-amd64

if you DO NOT have a Nvidia GPU installed then change image to:

image: maadsdocker/tml-privategpt-no-gpu-amd64

################# privategpt.yml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: privategpt
spec:
  selector:
    matchLabels:
      app: privategpt
  replicas: 1 # tells deployment to run 1 pods matching the template
  template:
    metadata:
      labels:
        app: privategpt
    spec:
      containers:
      - name: privategpt
        image: maadsdocker/tml-privategpt-with-gpu-nvidia-amd64 # IF you DO NOT have NVIDIA GPU use: maadsdocker/tml-privategpt-no-gpu-amd64
        env:
        - name: NVIDIA_VISIBLE_DEVICES
          value: all
        - name: DP_DISABLE_HEALTHCHECKS
          value: xids
        - name: WEB_CONCURRENCY
          value: "3"
        - name: GPU
          value: "1"
        - name: COLLECTION
          value: "tml"
        - name: PORT
          value: "8001"
        - name: CUDA_VISIBLE_DEVICES
          value: "0"
        - name: TSS
          value: "0"
        - name: KUBE
          value: "1"
        resources:             # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
          limits:              # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
            nvidia.com/gpu: 1  # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
        ports:
        - containerPort: 8001
      tolerations:             # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
      - key: nvidia.com/gpu    # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
        operator: Exists       # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
        effect: NoSchedule     # REMOVE or COMMENT OUT: IF you DO NOT have NVIDIA GPU
---
apiVersion: v1
kind: Service
metadata:
  name: privategpt-service
  labels:
    app: privategpt-service
spec:
  type: NodePort #Exposes the service as a node ports
  ports:
  - port: 8001
    name: p1
    protocol: TCP
    targetPort: 8001
  selector:
    app: privategpt

qdrant.yml

Note

This YAML is Optional - Use Only If Step 9 Dag is used

Important

Copy and Paste this YAML file: qdrant.yml - and save it locally.

################# qdrant.yml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: qdrant
spec:
  selector:
    matchLabels:
      app: qdrant
  replicas: 1
  template:
    metadata:
      labels:
        app: qdrant
    spec:
      #hostNetwork: true
      containers:
      - name: qdrant
        image: qdrant/qdrant
        ports:
        - containerPort: 6333
        volumeMounts:
        - mountPath: /qdrant/storage
          name: qdata
      volumes:
      - name: qdata
        hostPath:
          path: /qdrant_storage
---
apiVersion: v1
kind: Service
metadata:
  name: qdrant-service
  labels:
    app: qdrant-service
spec:
  type: NodePort #Exposes the service as a node ports
  ports:
  - port: 6333
    name: p1
    protocol: TCP
    targetPort: 6333
  selector:
    app: qdrant

Tip

The number of replicas can be changed in the cybersecuritywithprivategpt-3f10.yml file: look for replicas. You can increase or decrease the number of replicas based on the amout of real-time data you are processing.

Kubernetes Dashboard Visualization

To visualize the dashboard you need to forward ports to your solution deployment in Kubernetes. For this solution, the port forward command would be:

--kube-portforward--

After you forward the ports then copy/paste the viusalization URL below and run your dashboard.

--visualizationurl--

Kubernetes Pod Access Commands

To go inside the pods, you can type command:

kubectl exec -it <pod name> -- bash

Note: replace <pod name> with actual pod name..use this command to get the pod name

kubectl get pods -A

To list service pods type:

kubectl get svc -A

To list deployment pods type:

kubectl get deployments -A

To Horizontally AUTO-SCALE Deployments type:

kubectl autoscale deployment  <deployment name> --cpu-percent=50 --min=1 --max=100

Important

The above command instructs Kubernetes to scale pods based on 50% CPU utilization to a minimum number of pods of 1 (small workload) to a maximum of 100 pods for large world loads. Of course, you can easily change these min and max numbers.

This auto-scaling is very important to scale up and down your solution, while efficiently managing cloud computing costs.

To list deployments being auto-scaled type:

kubectl get hpa -A

To delete the pods:

kubectl delete all --all --all-namespaces

To get information on a pod type:

kubectl describe pod <pod name>

Start minikube with NVIDIA GPU Access:

minikube start --driver docker --container-runtime docker --gpus all --cni calico --memory 8192

Note

Note you may need to type: ./minikube

Start minikube with NO GPU:

minikube start --driver docker --container-runtime docker --cni calico --memory 8192

DELETE minikube:

minikube delete

Tip

Adjust the --memory 8192 as needed.