diff --git a/helm-charts/codegen-openshift-rhoai/Chart.yaml b/helm-charts/codegen-openshift-rhoai/Chart.yaml new file mode 100644 index 000000000..5f85bcc99 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/Chart.yaml @@ -0,0 +1,15 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: codegen +description: A Helm chart for deploying codegen on Red Hat OpenShift with Red Hat OpenShift AI +dependencies: + - name: llm-uservice + version: 1.0.0 + repository: "file://llm-uservice" + - name: react-ui + version: 0.1.0 + repository: "file://react-ui" +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift-rhoai/README.md b/helm-charts/codegen-openshift-rhoai/README.md new file mode 100644 index 000000000..90394f2ff --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/README.md @@ -0,0 +1,124 @@ +# CodeGen + +Helm chart for deploying CodeGen service on Red Hat OpenShift with Red Hat OpenShift AI. + +Serving runtime template in this example uses model _ise-uiuc/Magicoder-S-DS-6.7B_ for Xeon and _meta-llama/CodeLlama-7b-hf_ for Gaudi. + +## Prerequisites + +1. **Red Hat OpenShift Cluster** with dynamic _StorageClass_ to provision _PersistentVolumes_ e.g. **OpenShift Data Foundation**) and installed Operators: **Red Hat - Authorino (Technical Preview)**, **Red Hat OpenShift Service Mesh**, **Red Hat OpenShift Serverless** and **Red Hat Openshift AI**. +2. Image registry to push there docker images (https://docs.openshift.com/container-platform/4.16/registry/securing-exposing-registry.html). +3. Access to S3-compatible object storage bucket (e.g. **OpenShift Data Foundation**, **AWS S3**) and values of access and secret access keys and S3 endpoint (https://docs.redhat.com/en/documentation/red_hat_openshift_data_foundation/4.16/html/managing_hybrid_and_multicloud_resources/accessing-the-multicloud-object-gateway-with-your-applications_rhodf#accessing-the-multicloud-object-gateway-with-your-applications_rhodf). +4. Account on https://huggingface.co/, access to model _ise-uiuc/Magicoder-S-DS-6.7B_ (for Xeon) or _meta-llama/CodeLlama-7b-hf_ (for Gaudi) and token with Read permissions. + +## Deploy model in Red Hat Openshift AI + +1. Login to OpenShift CLI and run following commands to create new serving runtime and _hf-token_ secret. + +``` +cd GenAIInfra/helm-charts/codegen-openshift-rhoai/ +export HFTOKEN="insert-your-huggingface-token-here" + +On Xeon: +helm install servingruntime tgi --set global.huggingfacehubApiToken=${HFTOKEN} + +On Gaudi: +helm install servingruntime tgi --set global.huggingfacehubApiToken=${HFTOKEN} --values tgi/gaudi-values.yaml +``` + +Verify if template has been created with `oc get template -n redhat-ods-applications` command. + +2. Find the route for **Red Hat OpenShift AI** dashboard with below command and open it in the browser: + +``` +oc get routes -A | grep rhods-dashboard +``` + +3. Go to **Data Science Project** and click **Create data science project**. Fill the **Name** and click **Create**. +4. Go to **Workbenches** tab and click **Create workbench**. Fill the **Name**, under **Notebook image** choose _Standard Data Science_, under **Cluster storage** choose _Create new persistent storage_ and change **Persistent storage size** to 40 GB. Click **Create workbench**. +5. Open newly created Jupiter notebook and run following commands to download the model and upload it on s3: + +``` +%env S3_ENDPOINT= +%env S3_ACCESS_KEY= +%env S3_SECRET_KEY= +%env HF_TOKEN= +``` + +``` +!pip install huggingface-hub +``` + +``` +import os +import boto3 +import botocore +import glob +from huggingface_hub import snapshot_download +bucket_name = 'first.bucket' +s3_endpoint = os.environ.get('S3_ENDPOINT') +s3_accesskey = os.environ.get('S3_ACCESS_KEY') +s3_secretkey = os.environ.get('S3_SECRET_KEY') +path = 'models' +hf_token = os.environ.get('HF_TOKEN') +session = boto3.session.Session() +s3_resource = session.resource('s3', + endpoint_url=s3_endpoint, + verify=False, + aws_access_key_id=s3_accesskey, + aws_secret_access_key=s3_secretkey) +bucket = s3_resource.Bucket(bucket_name) +``` + +For Xeon download _ise-uiuc/Magicoder-S-DS-6.7B_: + +``` +snapshot_download("ise-uiuc/Magicoder-S-DS-6.7B", cache_dir=f'./models', token=hf_token) +``` + +For Gaudi download _meta-llama/CodeLlama-7b-hf_: + +``` +snapshot_download("meta-llama/CodeLlama-7b-hf", cache_dir=f'./models', token=hf_token) +``` + +Upload the downloaded model to S3: + +``` +files = (file for file in glob.glob(f'{path}/**/*', recursive=True) if os.path.isfile(file) and "snapshots" in file) +for filename in files: + s3_name = filename.replace(path, '') + print(f'Uploading: {filename} to {path}{s3_name}') + bucket.upload_file(filename, f'{path}{s3_name}') +``` + +6. Go to your project in **Red Hat OpenShift AI** dashboard, then "Models" tab and click **Deploy model** under _Single-model serving platform_. Fill the **Name**, choose newly created **Serving runtime**: _Text Generation Inference Magicoder-S-DS-6.7B on CPU_ (for Xeon) or _Text Generation Inference CodeLlama-7b-hf on Gaudi_ (for Gaudi), **Model framework**: _llm_ and change **Model server size** to _Custom_: 16 CPUs and 64 Gi memory. For deployment with Gaudi select proper **Accelerator**. Click the checkbox to create external route in **Model route** section and uncheck the **Token authentication**. Under **Model location** choose _New data connection_ and fill all required fields for s3 access, **Bucket** _first.bucket_ and **Path**: _models_. Click **Deploy**. It takes about 10 minutes to get _Loaded_ status.\ + If it's not going to _Loaded_ status and revision changed status to "ProgressDeadlineExceeded" (`oc get revision`), scale model deployment to 0 and than to 1 with command `oc scale deployment.apps/ --replicas=1` and wait about 10 minutes for deployment. + +## Install the Chart + +To install the chart, back to OpenShift CLI, go to your project and run the following: + +```console +cd GenAIInfra/helm-charts/ + +export NAMESPACE="insert-your-namespace-here" +export CLUSTERDOMAIN="$(oc get Ingress.config.openshift.io/cluster -o jsonpath='{.spec.domain}' | sed 's/^apps.//')" +export MODELNAME="insert-name-of-deployed-model-here" (it refers to the *Name* from step 6 in **Deploy model in Red Hat Openshift AI**) +export PROJECT="insert-project-name-where-model-is-deployed" + +sed -i "s/insert-your-namespace-here/${NAMESPACE}/g" codegen-openshift-rhoai/llm-uservice/values.yaml + +./update_dependency.sh +helm dependency update codegen-openshift-rhoai + +helm install codegen codegen-openshift-rhoai --set image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/codegen --set llm-uservice.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/llm-tgi --set react-ui.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/react-ui --set global.clusterDomain=${CLUSTERDOMAIN} --set global.huggingfacehubApiToken=${HFTOKEN} --set llm-uservice.servingRuntime.name=${MODELNAME} --set llm-uservice.servingRuntime.namespace=${PROJECT} +``` + +## Verify + +To verify the installation, run the command `oc get pods` to make sure all pods are running. Wait about 5 minutes for building images. When 4 pods achieve _Completed_ status, the rest with services should go to _Running_. + +## Launch the UI + +To access the frontend, find the route for _react-ui_ with command `oc get routes` and open it in the browser. diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/Chart.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/Chart.yaml new file mode 100644 index 000000000..10178e8ce --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/Chart.yaml @@ -0,0 +1,8 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: llm-uservice +description: A Helm chart for deploying llm-uservice on Red Hat OpenShift with Red Hat OpenShift AI +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/_helpers.tpl b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/_helpers.tpl new file mode 100644 index 000000000..726fca204 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "llm-uservice.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "llm-uservice.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "llm-uservice.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "llm-uservice.labels" -}} +helm.sh/chart: {{ include "llm-uservice.chart" . }} +{{ include "llm-uservice.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "llm-uservice.selectorLabels" -}} +app.kubernetes.io/name: {{ include "llm-uservice.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/buildconfig.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/buildconfig.yaml new file mode 100644 index 000000000..9b3d0631d --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/buildconfig.yaml @@ -0,0 +1,30 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: {{ include "llm-uservice.fullname" . }} + namespace: {{ .Release.Namespace }} +spec: + output: + to: + kind: "ImageStreamTag" + name: "llm-tgi:latest" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/configmap.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/configmap.yaml new file mode 100644 index 000000000..b8097cf47 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/configmap.yaml @@ -0,0 +1,17 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: ConfigMap +metadata: + name: create-rhoai-istio-rootca-cert-secret +data: + create-rhoai-istio-rootca-cert-secret.sh: | + #!/bin/bash + EXISTS=$(oc get secret --ignore-not-found rhoai-ca-bundle) + + if [[ -z "${EXISTS}" ]]; then + oc create secret generic -n {{ .Release.Namespace }} rhoai-ca-bundle --from-literal=tls.crt="$(oc extract secret/knative-serving-cert -n istio-system --to=- --keys=tls.crt)" + else + echo "oc get secret --ignore-not-found rhoai-ca-bundle returned non-empty string, not creating a secret" + fi diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/deployment.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/deployment.yaml new file mode 100644 index 000000000..17fb4e465 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/deployment.yaml @@ -0,0 +1,66 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "llm-uservice.fullname" . }} + labels: + {{- include "llm-uservice.labels" . | nindent 4 }} + annotations: + image.openshift.io/triggers: '[{"from":{"kind":"ImageStreamTag","name":"llm-tgi:latest"},"fieldPath":"spec.template.spec.containers[?(@.name==\"codegen\")].image"}]' +spec: + replicas: 1 + selector: + matchLabels: + {{- include "llm-uservice.selectorLabels" . | nindent 6 }} + template: + metadata: + labels: + {{- include "llm-uservice.selectorLabels" . | nindent 8 }} + spec: + securityContext: {} + containers: + - name: {{ .Release.Name }} + command: + - /bin/bash + - -c + - | + cp /usr/lib/ssl/cert.pem /tmp/bundle.crt && \ + cat /rhoai-ca/tls.crt | tee -a '/tmp/bundle.crt' && \ + bash ./entrypoint.sh + env: + - name: TGI_LLM_ENDPOINT + value: "https://{{ .Values.servingRuntime.name }}-{{ .Values.servingRuntime.namespace }}.apps.{{ .Values.global.clusterDomain }}" + - name: HUGGINGFACEHUB_API_TOKEN + valueFrom: + secretKeyRef: + key: HUGGING_FACE_HUB_TOKEN + name: hf-token + - name: PYTHONPATH + value: {{ .Values.PYTHONPATH | quote }} + - name: HOME + value: {{ .Values.HOME | quote }} + - name: SSL_CERT_FILE + value: /tmp/bundle.crt + securityContext: {} + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + imagePullPolicy: {{ .Values.image.pullPolicy }} + ports: + - name: llm-uservice + containerPort: 9000 + protocol: TCP + volumeMounts: + - mountPath: /tmp/home + name: local-dir + - mountPath: /rhoai-ca + name: odh-ca-bundle + resources: {} + volumes: + - emptyDir: + sizeLimit: 5Gi + name: local-dir + - name: odh-ca-bundle + secret: + defaultMode: 420 + secretName: rhoai-ca-bundle diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/imagestream.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/imagestream.yaml new file mode 100644 index 000000000..781e85ef7 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/imagestream.yaml @@ -0,0 +1,12 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: llm-tgi + namespace: {{ .Release.Namespace }} +spec: + lookupPolicy: + local: true diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/job.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/job.yaml new file mode 100644 index 000000000..1effabdd6 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/job.yaml @@ -0,0 +1,34 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: batch/v1 +kind: Job +metadata: + name: create-rhoai-istio-rootca-cert-secret +spec: + template: + spec: + containers: + - image: {{ .Values.job.image }} + command: + - /bin/bash + - -c + - | + oc wait --for=condition=ReconcileComplete=True dsc/rhods-datasciencecluster --timeout=-1s + oc wait --for condition=Ready=True knativeserving -n knative-serving knative-serving --timeout=-1s + '/tmp/create-rhoai-istio-rootca-cert-secret.sh' + name: create-rhoai-istio-rootca-cert-secret + volumeMounts: + - mountPath: /tmp/create-rhoai-istio-rootca-cert-secret.sh + name: create-rhoai-istio-rootca-cert-secret + subPath: create-rhoai-istio-rootca-cert-secret.sh + volumes: + - name: create-rhoai-istio-rootca-cert-secret + configMap: + name: create-rhoai-istio-rootca-cert-secret + defaultMode: 0755 + dnsPolicy: ClusterFirst + restartPolicy: Never + serviceAccount: {{ .Values.serviceAccountName }} + serviceAccountName: {{ .Values.serviceAccountName }} + terminationGracePeriodSeconds: 400 diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/crb-rhoai.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/crb-rhoai.yaml new file mode 100644 index 000000000..5bebaacbc --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/crb-rhoai.yaml @@ -0,0 +1,15 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: rbac.authorization.k8s.io/v1 +kind: ClusterRoleBinding +metadata: + name: kubeadmin-rhoai-cluster-admin-rolebinding +roleRef: + apiGroup: rbac.authorization.k8s.io + kind: ClusterRole + name: cluster-admin +subjects: +- kind: User + apiGroup: rbac.authorization.k8s.io + name: 'kube:admin' diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/role.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/role.yaml new file mode 100644 index 000000000..f02d4ac8f --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/role.yaml @@ -0,0 +1,31 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +{{- range $key, $value := .Values.rbac.roles }} +{{- if $value.createRole }} +--- +apiVersion: rbac.authorization.k8s.io/v1 +{{- if $value.scope.cluster }} +kind: ClusterRole +metadata: + name: {{ .name }} +{{- else }} +kind: Role +metadata: + name: {{ $value.name }} +{{- end }} +rules: + - apiGroups: +{{- range $value.apiGroups }} + - {{ . }} +{{- end }} + resources: +{{- range $value.resources }} + - {{ . }} +{{- end }} + verbs: +{{- range $value.verbs }} + - {{ . }} +{{- end }} +{{- end }} +{{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/rolebinding.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/rolebinding.yaml new file mode 100644 index 000000000..4a973dff4 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/rolebinding.yaml @@ -0,0 +1,28 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +{{- range $key, $value := .Values.rbac.roleBindings }} +{{- if $value.createBinding }} +apiVersion: rbac.authorization.k8s.io/v1 +{{- if $value.scope.cluster }} +kind: ClusterRoleBinding +{{- else }} +kind: RoleBinding +{{- end }} +metadata: + name: {{ .name }} +{{- if eq $value.scope.cluster false }} + namespace: {{ $value.scope.namespace }} +{{- end }} +subjects: +- kind: {{ $value.subjects.kind | default "ServiceAccount" }} + name: {{ $value.subjects.name }} + namespace: {{ $value.subjects.namespace }} + apiGroup: "" +roleRef: + kind: {{ $value.roleRef.kind }} + name: {{ $value.roleRef.name }} + apiGroup: rbac.authorization.k8s.io +--- +{{- end }} +{{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/serviceAccount.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/serviceAccount.yaml new file mode 100644 index 000000000..2d0088e91 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/rbac/serviceAccount.yaml @@ -0,0 +1,7 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: ServiceAccount +metadata: + name: {{ .Values.serviceAccountName }} diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/service.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/service.yaml new file mode 100644 index 000000000..06e13b8c6 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/templates/service.yaml @@ -0,0 +1,18 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Service +metadata: + name: {{ include "llm-uservice.fullname" . }} + labels: + {{- include "llm-uservice.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: 9000 + protocol: TCP + name: llm-uservice + selector: + {{- include "llm-uservice.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift-rhoai/llm-uservice/values.yaml b/helm-charts/codegen-openshift-rhoai/llm-uservice/values.yaml new file mode 100644 index 000000000..77c851cde --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/llm-uservice/values.yaml @@ -0,0 +1,86 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +image: + repository: "insert-your-image-here" # image-registry.openshift-image-registry.svc:5000//llm-tgi + tag: "latest" + pullPolicy: IfNotPresent + +source: + gitRef: main + gitUri: https://github.com/opea-project/GenAIComps.git + type: Git + +strategy: + type: Docker + dockerfilePath: comps/llms/text-generation/tgi/Dockerfile + +replicaCount: 1 + +PYTHONPATH: /home/user/.local/lib/python3.11/site-packages:/home/user # don't edit unless you have a good reason +HOME: /tmp/home + +service: + type: ClusterIP + port: 9000 + +servingRuntime: + name: "insert-name-of-deployed-model-here" + namespace: "insert-project-name-where-model-is-deployed" + +global: + clusterDomain: "insert-your-cluster-domain-here" + +job: + image: image-registry.openshift-image-registry.svc:5000/openshift/cli:latest + +serviceAccountName: codegen-sa + +rbac: + roles: + - name: codegen-rhoai-role + createRole: true + apiGroups: + - '""' + - '"apps"' + - '"batch"' + - '"machineconfiguration.openshift.io"' + - '"rbac.authorization.k8s.io"' + - '"security.openshift.io"' + - '"datasciencecluster.opendatahub.io"' + - '"operator.knative.dev"' + - '"build.openshift.io"' + scope: + cluster: true + resources: + - namespaces + - secrets + - datascienceclusters + - clusterrolebindings + - knativeservings + - jobs + - buildconfigs + - builds + - buildconfigs/instantiate + verbs: + - "create" + - "get" + - "list" + - "patch" + - "update" + - "use" + - "watch" + roleBindings: + - name: codegen-rhoai-rolebind + createBinding: true + scope: + cluster: true + namespace: "" + subjects: + kind: ServiceAccount + name: codegen-sa + namespace: insert-your-namespace-here + apiGroup: "" + roleRef: + kind: ClusterRole + name: codegen-rhoai-role diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/.helmignore b/helm-charts/codegen-openshift-rhoai/react-ui/.helmignore new file mode 100644 index 000000000..0e8a0eb36 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/.helmignore @@ -0,0 +1,23 @@ +# Patterns to ignore when building packages. +# This supports shell glob matching, relative path matching, and +# negation (prefixed with !). Only one pattern per line. +.DS_Store +# Common VCS dirs +.git/ +.gitignore +.bzr/ +.bzrignore +.hg/ +.hgignore +.svn/ +# Common backup files +*.swp +*.bak +*.tmp +*.orig +*~ +# Various IDEs +.project +.idea/ +*.tmproj +.vscode/ diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/Chart.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/Chart.yaml new file mode 100644 index 000000000..bd518d6c0 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/Chart.yaml @@ -0,0 +1,27 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: react-ui +description: A Helm chart deploying OPEA CodeGen React UI + +# A chart can be either an 'application' or a 'library' chart. +# +# Application charts are a collection of templates that can be packaged into versioned archives +# to be deployed. +# +# Library charts provide useful utilities or functions for the chart developer. They're included as +# a dependency of application charts to inject those utilities and functions into the rendering +# pipeline. Library charts do not define any templates and therefore cannot be deployed. +type: application + +# This is the chart version. This version number should be incremented each time you make changes +# to the chart and its templates, including the app version. +# Versions are expected to follow Semantic Versioning (https://semver.org/) +version: 0.1.0 + +# This is the version number of the application being deployed. This version number should be +# incremented each time you make changes to the application. Versions are not expected to +# follow Semantic Versioning. They should reflect the version the application is using. +# It is recommended to use it with quotes. +appVersion: "1.0" diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/NOTES.txt b/helm-charts/codegen-openshift-rhoai/react-ui/templates/NOTES.txt new file mode 100644 index 000000000..027640017 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/NOTES.txt @@ -0,0 +1,2 @@ +1. Get the application URL by running this command or opening OpenShift Console apps. +oc get route -n {{ .Release.Namespace }} {{ include "codegen-react-ui.fullname" . }} diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/_helpers.tpl b/helm-charts/codegen-openshift-rhoai/react-ui/templates/_helpers.tpl new file mode 100644 index 000000000..ca8230902 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "codegen-react-ui.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "codegen-react-ui.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "codegen-react-ui.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "codegen-react-ui.labels" -}} +helm.sh/chart: {{ include "codegen-react-ui.chart" . }} +{{ include "codegen-react-ui.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "codegen-react-ui.selectorLabels" -}} +app.kubernetes.io/name: {{ include "codegen-react-ui.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-bc.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-bc.yaml new file mode 100644 index 000000000..dc487b8be --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-bc.yaml @@ -0,0 +1,42 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: react-ui +spec: + lookupPolicy: + local: true +--- +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: react-ui +spec: + output: + to: + kind: "ImageStreamTag" + name: "react-ui:{{ .Values.image.tag }}" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + contextDir: {{ .Values.source.contextDir }} + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + env: + - name: VITE_CODE_GEN_URL + value: https://codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }}/v1/codegen + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-cms.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-cms.yaml new file mode 100644 index 000000000..fa68f3586 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-cms.yaml @@ -0,0 +1,55 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: v1 +data: + default.conf: | + server { + listen 8080; + gzip on; + gzip_proxied any; + gzip_comp_level 6; + gzip_buffers 16 8k; + gzip_http_version 1.1; + gzip_types font/woff2 text/css application/javascript application/json application/font-woff application/font-tff image/gif image/png image/svg+xml application/octet-stream; + location / { + root /usr/share/nginx/html; + index index.html index.htm; + try_files $uri $uri/ /index.html =404; + location ~* \.(gif|jpe?g|png|webp|ico|svg|css|js|mp4|woff2)$ { + expires 1d; + } + } + } +kind: ConfigMap +metadata: + creationTimestamp: null + name: {{ include "codegen-react-ui.fullname" . }}-default-conf +--- +apiVersion: v1 +data: + nginx.conf: | + user nginx; + worker_processes auto; + error_log /var/log/nginx/error.log notice; + pid /tmp/nginx.pid; + events { + worker_connections 1024; + } + http { + include /etc/nginx/mime.types; + default_type application/octet-stream; + log_format main '$remote_addr - $remote_user [$time_local] "$request" ' + '$status $body_bytes_sent "$http_referer" ' + '"$http_user_agent" "$http_x_forwarded_for"'; + access_log /var/log/nginx/access.log main; + sendfile on; + #tcp_nopush on; + keepalive_timeout 65; + #gzip on; + include /etc/nginx/conf.d/*.conf; + } +kind: ConfigMap +metadata: + name: {{ include "codegen-react-ui.fullname" . }}-nginx-conf diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-consolelink.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-consolelink.yaml new file mode 100644 index 000000000..d01a0925a --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-consolelink.yaml @@ -0,0 +1,15 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: console.openshift.io/v1 +kind: ConsoleLink +metadata: + name: {{ include "codegen-react-ui.fullname" . }}-{{ randAlphaNum 12 | lower }} +spec: + href: https://codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + location: ApplicationMenu + # This text will appear in a box called "Launcher" under "namespace" or "project" in the web console + text: OPEA CodeGen UI + applicationMenu: + section: Open Platform For Enterprise AI + imageURL:  diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-deployment.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-deployment.yaml new file mode 100644 index 000000000..8d428a3b5 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-deployment.yaml @@ -0,0 +1,85 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "codegen-react-ui.fullname" . }} + labels: + {{- include "codegen-react-ui.labels" . | nindent 4 }} + annotations: + image.openshift.io/triggers: '[{"from":{"kind":"ImageStreamTag","name":"react-ui:{{ .Values.image.tag }}"},"fieldPath":"spec.template.spec.containers[?(@.name==\"{{ .Chart.Name }}\")].image"}]' +spec: + selector: + matchLabels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + template: + metadata: + {{- with .Values.podAnnotations }} + annotations: + {{- toYaml . | nindent 8 }} + {{- end }} + labels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + spec: + {{- with .Values.imagePullSecrets }} + imagePullSecrets: + {{- toYaml . | nindent 8 }} + {{- end }} + securityContext: {{- toYaml .Values.podSecurityContext | nindent 8 }} + containers: + - command: ["/bin/sh", "-c"] + args: + - | + cp -v /etc/nginx/nginx.conf /tmp/nginx.conf && \ + sed -i "s/pid .*;/pid \/tmp\/nginx.pid;/" /tmp/nginx.conf && \ + nginx -c /tmp/nginx.conf -g "daemon off;" + image: {{ .Values.image.repository }}:{{ .Values.image.tag }} + name: {{ .Chart.Name }} + ports: + - name: http + containerPort: {{ .Values.service.port }} + protocol: TCP + livenessProbe: + httpGet: + path: / + port: http + readinessProbe: + httpGet: + path: / + port: http + resources: + {{- toYaml .Values.resources | nindent 10 }} + securityContext: + {{- toYaml .Values.securityContext | nindent 10 }} + volumeMounts: + - mountPath: /var/cache/nginx/ + name: nginx-cache-dir + - mountPath: /etc/nginx/conf.d/default.conf + name: default-conf + subPath: default.conf + - mountPath: /etc/nginx/nginx.conf + name: nginx-conf + subPath: nginx.conf + volumes: + - name: nginx-cache-dir + emptyDir: {} + - name: default-conf + configMap: + name: {{ include "codegen-react-ui.fullname" . }}-default-conf + - name: nginx-conf + configMap: + name: {{ include "codegen-react-ui.fullname" . }}-nginx-conf + {{- with .Values.nodeSelector }} + nodeSelector: + {{- toYaml . | nindent 8 }} + {{- end }} + {{- with .Values.affinity }} + affinity: + {{- toYaml . | nindent 8 }} + {{- end }} + {{- with .Values.tolerations }} + tolerations: + {{- toYaml . | nindent 8 }} + {{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-https-route.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-https-route.yaml new file mode 100644 index 000000000..09ff5e8f4 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-https-route.yaml @@ -0,0 +1,17 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: route.openshift.io/v1 +kind: Route +metadata: + labels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + name: {{ include "codegen-react-ui.fullname" . }} +spec: + host: codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + tls: + termination: edge + to: + name: {{ include "codegen-react-ui.fullname" . }} +status: {} diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-service.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-service.yaml new file mode 100644 index 000000000..c8741bf95 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/templates/codegen-react-ui-service.yaml @@ -0,0 +1,19 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: v1 +kind: Service +metadata: + name: {{ include "codegen-react-ui.fullname" . }} + labels: + {{- include "codegen-react-ui.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: http + protocol: TCP + name: http + selector: + {{- include "codegen-react-ui.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift-rhoai/react-ui/values.yaml b/helm-charts/codegen-openshift-rhoai/react-ui/values.yaml new file mode 100644 index 000000000..a267d3691 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/react-ui/values.yaml @@ -0,0 +1,67 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +# Default values for codegen-react-ui. +# This is a YAML-formatted file. +# Declare variables to be passed into your templates. + +global: + clusterDomain: "insert-your-cluster-domain-here" + +image: + repository: "insert-your-image-here" # image-registry.openshift-image-registry.svc:5000//react-ui + pullPolicy: IfNotPresent + # Overrides the image tag whose default is the chart appVersion. + tag: "latest" + +source: + contextDir: CodeGen/ui + gitRef: main + gitUri: "https://github.com/opea-project/GenAIExamples.git" + type: Git + +strategy: + type: Docker + dockerfilePath: docker/Dockerfile.react + +imagePullSecrets: [] +nameOverride: "" +fullnameOverride: "" + +podAnnotations: {} + +podSecurityContext: {} + # fsGroup: 2000 + +securityContext: {} + # capabilities: + # drop: + # - ALL + # readOnlyRootFilesystem: true + # runAsNonRoot: true + # runAsUser: 1000 + +service: + type: ClusterIP + port: 8080 + +resources: {} + # limits: + # cpu: 100m + # memory: 128Mi + # requests: + # cpu: 100m + # memory: 128Mi + +autoscaling: + enabled: false + minReplicas: 1 + maxReplicas: 100 + targetCPUUtilizationPercentage: 80 + # targetMemoryUtilizationPercentage: 80 + +nodeSelector: {} + +tolerations: [] + +affinity: {} diff --git a/helm-charts/codegen-openshift-rhoai/templates/_helpers.tpl b/helm-charts/codegen-openshift-rhoai/templates/_helpers.tpl new file mode 100644 index 000000000..7cffcef8d --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "codegen.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "codegen.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "codegen.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "codegen.labels" -}} +helm.sh/chart: {{ include "codegen.chart" . }} +{{ include "codegen.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "codegen.selectorLabels" -}} +app.kubernetes.io/name: {{ include "codegen.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift-rhoai/templates/buildconfig.yaml b/helm-charts/codegen-openshift-rhoai/templates/buildconfig.yaml new file mode 100644 index 000000000..dc9790cd3 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/buildconfig.yaml @@ -0,0 +1,31 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: {{ include "codegen.fullname" . }} + namespace: {{ .Release.Namespace }} +spec: + output: + to: + kind: "ImageStreamTag" + name: "codegen:latest" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + contextDir: {{ .Values.source.contextDir }} + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift-rhoai/templates/deployment.yaml b/helm-charts/codegen-openshift-rhoai/templates/deployment.yaml new file mode 100644 index 000000000..88c90c8bb --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/deployment.yaml @@ -0,0 +1,31 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "codegen.fullname" . }} + labels: + {{- include "codegen.labels" . | nindent 4 }} +spec: + replicas: {{ .Values.replicaCount }} + selector: + matchLabels: + {{- include "codegen.selectorLabels" . | nindent 6 }} + template: + metadata: + labels: + {{- include "codegen.selectorLabels" . | nindent 8 }} + spec: + containers: + - name: {{ .Release.Name }} + env: + - name: LLM_SERVICE_HOST_IP + value: {{ .Release.Name }}-llm-uservice + securityContext: null + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + imagePullPolicy: {{ .Values.image.pullPolicy }} + ports: + - name: codegen + containerPort: {{ .Values.port }} + protocol: TCP diff --git a/helm-charts/codegen-openshift-rhoai/templates/imagestream.yaml b/helm-charts/codegen-openshift-rhoai/templates/imagestream.yaml new file mode 100644 index 000000000..75b7907bb --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/imagestream.yaml @@ -0,0 +1,12 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: codegen + namespace: {{ .Release.Namespace }} +spec: + lookupPolicy: + local: true diff --git a/helm-charts/codegen-openshift-rhoai/templates/route.yaml b/helm-charts/codegen-openshift-rhoai/templates/route.yaml new file mode 100644 index 000000000..6791c88c7 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/route.yaml @@ -0,0 +1,21 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: route.openshift.io/v1 +kind: Route +metadata: + labels: + {{- include "codegen.labels" . | nindent 4 }} + name: {{ .Release.Name }}-tls +spec: + host: codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + path: /v1/codegen + port: + targetPort: codegen + tls: + termination: edge + to: + kind: Service + name: codegen + weight: 100 + wildcardPolicy: None diff --git a/helm-charts/codegen-openshift-rhoai/templates/service.yaml b/helm-charts/codegen-openshift-rhoai/templates/service.yaml new file mode 100644 index 000000000..aec13682f --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/templates/service.yaml @@ -0,0 +1,18 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Service +metadata: + name: {{ include "codegen.fullname" . }} + labels: + {{- include "codegen.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: {{ .Values.port }} + protocol: TCP + name: codegen + selector: + {{- include "codegen.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift-rhoai/tgi/Chart.yaml b/helm-charts/codegen-openshift-rhoai/tgi/Chart.yaml new file mode 100644 index 000000000..b510ea0e1 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/tgi/Chart.yaml @@ -0,0 +1,8 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: tgi +description: A Helm chart for deploying servingruntime for ise-uiuc/Magicoder-S-DS-6.7B model on Red Hat OpenShift with Red Hat OpenShift AI +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift-rhoai/tgi/gaudi-values.yaml b/helm-charts/codegen-openshift-rhoai/tgi/gaudi-values.yaml new file mode 100644 index 000000000..651ca99c6 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/tgi/gaudi-values.yaml @@ -0,0 +1,44 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +global: + huggingfacehubApiToken: "insert-your-huggingface-token-here" + +image: + repository: ghcr.io/huggingface/tgi-gaudi + tag: 2.0.4 + +servingRuntimeName: tgi-codellama-7b-hf-gaudi +servingRuntimeDisplayName: Text Generation Inference CodeLlama-7b-hf on Gaudi + +args: + - --model-id + - /mnt/models/--meta-llama--CodeLlama-7b-hf/snapshots/b462c3c99b077d341db691ec780a33156f3c1472 + - --port=8080 + - --json-output + - --max-input-length + - "1024" + - --max-total-tokens + - "2048" + +accelerators: '''["habana.ai/gaudi"]''' + +resources: + limits: + habana.ai/gaudi: 1 + requests: + habana.ai/gaudi: 1 + +volumeMounts: + - mountPath: /data + name: model-volume + - mountPath: /var/log/habana_logs + name: logs-volume + +volumes: + - emptyDir: + sizeLimit: 300Gi + name: model-volume + - emptyDir: + sizeLimit: 500Mi + name: logs-volume diff --git a/helm-charts/codegen-openshift-rhoai/tgi/templates/hf_token_secret.yaml b/helm-charts/codegen-openshift-rhoai/tgi/templates/hf_token_secret.yaml new file mode 100644 index 000000000..7ce2d2021 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/tgi/templates/hf_token_secret.yaml @@ -0,0 +1,10 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Secret +metadata: + name: hf-token +type: Opaque +stringData: + HUGGING_FACE_HUB_TOKEN: {{ .Values.global.huggingfacehubApiToken }} diff --git a/helm-charts/codegen-openshift-rhoai/tgi/templates/servingruntime-template.yaml b/helm-charts/codegen-openshift-rhoai/tgi/templates/servingruntime-template.yaml new file mode 100644 index 000000000..eda75fc28 --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/tgi/templates/servingruntime-template.yaml @@ -0,0 +1,52 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: template.openshift.io/v1 +kind: Template +metadata: + annotations: + opendatahub.io/apiProtocol: REST + opendatahub.io/modelServingSupport: '["single"]' + labels: + opendatahub.io/dashboard: "true" + name: {{ .Values.servingRuntimeName }} + namespace: redhat-ods-applications +objects: +- apiVersion: serving.kserve.io/v1alpha1 + kind: ServingRuntime + labels: + opendatahub.io/dashboard: "true" + metadata: + annotations: + openshift.io/display-name: {{ .Values.servingRuntimeDisplayName }} + opendatahub.io/recommended-accelerators: {{ .Values.accelerators }} + name: {{ .Values.servingRuntimeName }} + spec: + containers: + - args: {{- toYaml .Values.args | nindent 10 }} + env: + - name: NUMBA_CACHE_DIR + value: /tmp/hf_home + - name: HF_HOME + value: /tmp/hf_home + - name: HF_HUB_CACHE + value: /mnt/models + - name: HUGGING_FACE_HUB_TOKEN + valueFrom: + secretKeyRef: + key: HUGGING_FACE_HUB_TOKEN + name: hf-token + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + name: kserve-container + ports: + - containerPort: 8080 + protocol: TCP + resources: + {{- toYaml .Values.resources | nindent 10 }} + volumeMounts: {{- toYaml .Values.volumeMounts | nindent 10 }} + multiModel: false + supportedModelFormats: + - autoSelect: true + name: llm + volumes: {{- toYaml .Values.volumes | nindent 6 }} diff --git a/helm-charts/codegen-openshift-rhoai/tgi/values.yaml b/helm-charts/codegen-openshift-rhoai/tgi/values.yaml new file mode 100644 index 000000000..15dd6d50a --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/tgi/values.yaml @@ -0,0 +1,35 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +global: + huggingfacehubApiToken: "insert-your-huggingface-token-here" + +image: + repository: ghcr.io/huggingface/text-generation-inference + tag: 2.1.0 + +servingRuntimeName: tgi-magicoder-s-ds-6.7b-cpu +servingRuntimeDisplayName: Text Generation Inference Magicoder-S-DS-6.7B on CPU + +args: + - --model-id + - /mnt/models/--ise-uiuc--Magicoder-S-DS-6.7B/snapshots/b3ed7cb1578a3643ceaf2ebf996a3d8e85f75d8f + - --port=8080 + - --json-output + - --max-input-length + - "1024" + - --max-batch-prefill-tokens + - "32256" + +accelerators: {} + +resources: {} + +volumes: + - emptyDir: + sizeLimit: 300Gi + name: model-volume + +volumeMounts: + - mountPath: /data + name: model-volume diff --git a/helm-charts/codegen-openshift-rhoai/values.yaml b/helm-charts/codegen-openshift-rhoai/values.yaml new file mode 100644 index 000000000..cec3bb2ee --- /dev/null +++ b/helm-charts/codegen-openshift-rhoai/values.yaml @@ -0,0 +1,28 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +global: + clusterDomain: "insert-your-cluster-domain-here" + +image: + repository: "insert-your-image-here" # image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/codegen + tag: "latest" + pullPolicy: IfNotPresent + +source: + contextDir: CodeGen + gitRef: main + gitUri: "https://github.com/opea-project/GenAIExamples.git" + type: Git + +strategy: + type: Docker + dockerfilePath: Dockerfile + +replicaCount: 1 + +port: 7778 + +service: + type: ClusterIP + port: 7778 diff --git a/helm-charts/codegen-openshift/Chart.yaml b/helm-charts/codegen-openshift/Chart.yaml new file mode 100644 index 000000000..8f015e2b4 --- /dev/null +++ b/helm-charts/codegen-openshift/Chart.yaml @@ -0,0 +1,18 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: codegen +description: A Helm chart for deploying codegen on Red Hat OpenShift +dependencies: + - name: tgi + version: 1.0.0 + repository: "file://tgi" + - name: llm-uservice + version: 1.0.0 + repository: "file://llm-uservice" + - name: react-ui + version: 0.1.0 + repository: "file://react-ui" +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift/README.md b/helm-charts/codegen-openshift/README.md new file mode 100644 index 000000000..46030ab05 --- /dev/null +++ b/helm-charts/codegen-openshift/README.md @@ -0,0 +1,37 @@ +# CodeGen + +Helm chart for deploying CodeGen service on Red Hat OpenShift. + +# Prerequisites + +1. Red Hat OpenShift Cluster with dynamic StorageClass to provision PersistentVolumes e.g. OpenShift Data Foundation) +2. Image registry to push there docker images (https://docs.openshift.com/container-platform/4.16/registry/securing-exposing-registry.html). +3. Account on https://huggingface.co/, access to model _ise-uiuc/Magicoder-S-DS-6.7B_ (for Xeon) or _meta-llama/CodeLlama-7b-hf_ (for Gaudi) and token with Read permissions. + +## Installing the Chart + +To install the chart, login to OpenShift CLI, go to your project and run the following: + +```console +cd GenAIInfra/helm-charts/ +./update_dependency.sh +helm dependency update codegen-openshift + +export NAMESPACE="insert-your-namespace-here" +export CLUSTERDOMAIN="$(oc get Ingress.config.openshift.io/cluster -o jsonpath='{.spec.domain}' | sed 's/^apps.//')" +export HFTOKEN="insert-your-huggingface-token-here" + +# To run on Xeon +helm install codegen codegen-openshift --set image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/codegen --set llm-uservice.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/llm-tgi --set react-ui.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/react-ui --set global.clusterDomain=${CLUSTERDOMAIN} --set global.huggingfacehubApiToken=${HFTOKEN} + +# To run on Gaudi +helm install codegen codegen-openshift --set image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/codegen --set llm-uservice.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/llm-tgi --set react-ui.image.repository=image-registry.openshift-image-registry.svc:5000/${NAMESPACE}/react-ui --set global.clusterDomain=${CLUSTERDOMAIN} --set global.huggingfacehubApiToken=${HFTOKEN} --values codegen-openshift/gaudi-values.yaml +``` + +## Verify + +To verify the installation, run the command `oc get pods` to make sure all pods are running. Wait about 5 minutes for building images. When 3 pods achieve _Completed_ status, the rest with services should go to _Running_. + +## Launch the UI + +To access the frontend, find the route for _react-ui_ with command `oc get routes` and open it in the browser. diff --git a/helm-charts/codegen-openshift/gaudi-values.yaml b/helm-charts/codegen-openshift/gaudi-values.yaml new file mode 100644 index 000000000..a9b5811d0 --- /dev/null +++ b/helm-charts/codegen-openshift/gaudi-values.yaml @@ -0,0 +1,62 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +global: + clusterDomain: "insert-your-cluster-domain-here" + huggingfacehubApiToken: "insert-your-huggingface-token-here" + +image: + repository: "insert-your-image-here" + tag: "latest" + pullPolicy: IfNotPresent + +source: + contextDir: CodeGen + gitRef: main + gitUri: "https://github.com/opea-project/GenAIExamples.git" + type: Git + +strategy: + type: Docker + dockerfilePath: Dockerfile + +replicaCount: 1 + +port: 7778 + +service: + type: ClusterIP + port: 7778 + +tgi: + model: + id: meta-llama/CodeLlama-7b-hf + replicaCount: 1 + port: 8080 + image: + repository: ghcr.io/huggingface/tgi-gaudi + tag: 2.0.4 + pullPolicy: IfNotPresent + service: + type: ClusterIP + resources: + limits: + habana.ai/gaudi: 1 + requests: + habana.ai/gaudi: 1 + args: + - "--max-input-length" + - "1024" + - "--max-total-tokens" + - "2048" + volumeMounts: + - mountPath: /data + name: model-volume + - mountPath: /var/log/habana_logs + name: logs-volume + volumes: + - emptyDir: + sizeLimit: 50Gi + name: model-volume + - emptyDir: + name: logs-volume diff --git a/helm-charts/codegen-openshift/llm-uservice/Chart.yaml b/helm-charts/codegen-openshift/llm-uservice/Chart.yaml new file mode 100644 index 000000000..ecf281498 --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/Chart.yaml @@ -0,0 +1,8 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: llm-uservice +description: A Helm chart for deploying llm-uservice on Red Hat OpenShift +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift/llm-uservice/templates/_helpers.tpl b/helm-charts/codegen-openshift/llm-uservice/templates/_helpers.tpl new file mode 100644 index 000000000..726fca204 --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "llm-uservice.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "llm-uservice.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "llm-uservice.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "llm-uservice.labels" -}} +helm.sh/chart: {{ include "llm-uservice.chart" . }} +{{ include "llm-uservice.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "llm-uservice.selectorLabels" -}} +app.kubernetes.io/name: {{ include "llm-uservice.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift/llm-uservice/templates/buildconfig.yaml b/helm-charts/codegen-openshift/llm-uservice/templates/buildconfig.yaml new file mode 100644 index 000000000..9b3d0631d --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/templates/buildconfig.yaml @@ -0,0 +1,30 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: {{ include "llm-uservice.fullname" . }} + namespace: {{ .Release.Namespace }} +spec: + output: + to: + kind: "ImageStreamTag" + name: "llm-tgi:latest" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift/llm-uservice/templates/deployment.yaml b/helm-charts/codegen-openshift/llm-uservice/templates/deployment.yaml new file mode 100644 index 000000000..523cf9343 --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/templates/deployment.yaml @@ -0,0 +1,53 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "llm-uservice.fullname" . }} + labels: + {{- include "llm-uservice.labels" . | nindent 4 }} + annotations: + image.openshift.io/triggers: '[{"from":{"kind":"ImageStreamTag","name":"llm-tgi:latest"},"fieldPath":"spec.template.spec.containers[?(@.name==\"llm-tgi-server\")].image"}]' +spec: + replicas: {{ .Values.replicaCount }} + selector: + matchLabels: + {{- include "llm-uservice.selectorLabels" . | nindent 6 }} + template: + metadata: + labels: + {{- include "llm-uservice.selectorLabels" . | nindent 8 }} + spec: + securityContext: {} + containers: + - name: {{ .Release.Name }} + env: + - name: TGI_LLM_ENDPOINT + value: {{ .Values.TGI_LLM_ENDPOINT | quote}} + - name: HUGGINGFACEHUB_API_TOKEN + valueFrom: + secretKeyRef: + key: HUGGING_FACE_HUB_TOKEN + name: hf-token + - name: PYTHONPATH + value: {{ .Values.PYTHONPATH | quote }} + - name: HOME + value: {{ .Values.HOME | quote }} + securityContext: {} + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + imagePullPolicy: {{ .Values.image.pullPolicy }} + ports: + - name: llm-uservice + containerPort: 9000 + protocol: TCP + startupProbe: + {{- toYaml .Values.startupProbe | nindent 12 }} + volumeMounts: + - mountPath: /tmp/home + name: local-dir + resources: {} + volumes: + - emptyDir: + sizeLimit: 5Gi + name: local-dir diff --git a/helm-charts/codegen-openshift/llm-uservice/templates/imagestream.yaml b/helm-charts/codegen-openshift/llm-uservice/templates/imagestream.yaml new file mode 100644 index 000000000..31dc5155d --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/templates/imagestream.yaml @@ -0,0 +1,12 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: llm-tgi + namespace: {{ .Release.namespace }} +spec: + lookupPolicy: + local: true diff --git a/helm-charts/codegen-openshift/llm-uservice/templates/service.yaml b/helm-charts/codegen-openshift/llm-uservice/templates/service.yaml new file mode 100644 index 000000000..06e13b8c6 --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/templates/service.yaml @@ -0,0 +1,18 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Service +metadata: + name: {{ include "llm-uservice.fullname" . }} + labels: + {{- include "llm-uservice.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: 9000 + protocol: TCP + name: llm-uservice + selector: + {{- include "llm-uservice.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift/llm-uservice/values.yaml b/helm-charts/codegen-openshift/llm-uservice/values.yaml new file mode 100644 index 000000000..bbcd92c4b --- /dev/null +++ b/helm-charts/codegen-openshift/llm-uservice/values.yaml @@ -0,0 +1,36 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +image: + repository: "insert-your-image-here" + tag: "latest" + pullPolicy: IfNotPresent + +source: + gitRef: main + gitUri: https://github.com/opea-project/GenAIComps.git + type: Git + +strategy: + type: Docker + dockerfilePath: comps/llms/text-generation/tgi/Dockerfile + +replicaCount: 1 + +TGI_LLM_ENDPOINT: "http://codegen-tgi:8080" # if you change it, update also startupProbe@line 26 +PYTHONPATH: /home/user/.local/lib/python3.11/site-packages:/home/user # don't edit unless you have a good reason +HOME: /tmp/home + +startupProbe: + exec: + command: + - python + - -c + - 'import requests; req = requests.get("http://codegen-tgi:8080/info"); print(req)' + initialDelaySeconds: 5 + periodSeconds: 5 + failureThreshold: 120 + +service: + type: ClusterIP + port: 9000 diff --git a/helm-charts/codegen-openshift/react-ui/.helmignore b/helm-charts/codegen-openshift/react-ui/.helmignore new file mode 100644 index 000000000..0e8a0eb36 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/.helmignore @@ -0,0 +1,23 @@ +# Patterns to ignore when building packages. +# This supports shell glob matching, relative path matching, and +# negation (prefixed with !). Only one pattern per line. +.DS_Store +# Common VCS dirs +.git/ +.gitignore +.bzr/ +.bzrignore +.hg/ +.hgignore +.svn/ +# Common backup files +*.swp +*.bak +*.tmp +*.orig +*~ +# Various IDEs +.project +.idea/ +*.tmproj +.vscode/ diff --git a/helm-charts/codegen-openshift/react-ui/Chart.yaml b/helm-charts/codegen-openshift/react-ui/Chart.yaml new file mode 100644 index 000000000..bd518d6c0 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/Chart.yaml @@ -0,0 +1,27 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: react-ui +description: A Helm chart deploying OPEA CodeGen React UI + +# A chart can be either an 'application' or a 'library' chart. +# +# Application charts are a collection of templates that can be packaged into versioned archives +# to be deployed. +# +# Library charts provide useful utilities or functions for the chart developer. They're included as +# a dependency of application charts to inject those utilities and functions into the rendering +# pipeline. Library charts do not define any templates and therefore cannot be deployed. +type: application + +# This is the chart version. This version number should be incremented each time you make changes +# to the chart and its templates, including the app version. +# Versions are expected to follow Semantic Versioning (https://semver.org/) +version: 0.1.0 + +# This is the version number of the application being deployed. This version number should be +# incremented each time you make changes to the application. Versions are not expected to +# follow Semantic Versioning. They should reflect the version the application is using. +# It is recommended to use it with quotes. +appVersion: "1.0" diff --git a/helm-charts/codegen-openshift/react-ui/templates/NOTES.txt b/helm-charts/codegen-openshift/react-ui/templates/NOTES.txt new file mode 100644 index 000000000..027640017 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/NOTES.txt @@ -0,0 +1,2 @@ +1. Get the application URL by running this command or opening OpenShift Console apps. +oc get route -n {{ .Release.Namespace }} {{ include "codegen-react-ui.fullname" . }} diff --git a/helm-charts/codegen-openshift/react-ui/templates/_helpers.tpl b/helm-charts/codegen-openshift/react-ui/templates/_helpers.tpl new file mode 100644 index 000000000..ca8230902 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "codegen-react-ui.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "codegen-react-ui.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "codegen-react-ui.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "codegen-react-ui.labels" -}} +helm.sh/chart: {{ include "codegen-react-ui.chart" . }} +{{ include "codegen-react-ui.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "codegen-react-ui.selectorLabels" -}} +app.kubernetes.io/name: {{ include "codegen-react-ui.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-bc.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-bc.yaml new file mode 100644 index 000000000..dc487b8be --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-bc.yaml @@ -0,0 +1,42 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: react-ui +spec: + lookupPolicy: + local: true +--- +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: react-ui +spec: + output: + to: + kind: "ImageStreamTag" + name: "react-ui:{{ .Values.image.tag }}" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + contextDir: {{ .Values.source.contextDir }} + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + env: + - name: VITE_CODE_GEN_URL + value: https://codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }}/v1/codegen + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-cms.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-cms.yaml new file mode 100644 index 000000000..fa68f3586 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-cms.yaml @@ -0,0 +1,55 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: v1 +data: + default.conf: | + server { + listen 8080; + gzip on; + gzip_proxied any; + gzip_comp_level 6; + gzip_buffers 16 8k; + gzip_http_version 1.1; + gzip_types font/woff2 text/css application/javascript application/json application/font-woff application/font-tff image/gif image/png image/svg+xml application/octet-stream; + location / { + root /usr/share/nginx/html; + index index.html index.htm; + try_files $uri $uri/ /index.html =404; + location ~* \.(gif|jpe?g|png|webp|ico|svg|css|js|mp4|woff2)$ { + expires 1d; + } + } + } +kind: ConfigMap +metadata: + creationTimestamp: null + name: {{ include "codegen-react-ui.fullname" . }}-default-conf +--- +apiVersion: v1 +data: + nginx.conf: | + user nginx; + worker_processes auto; + error_log /var/log/nginx/error.log notice; + pid /tmp/nginx.pid; + events { + worker_connections 1024; + } + http { + include /etc/nginx/mime.types; + default_type application/octet-stream; + log_format main '$remote_addr - $remote_user [$time_local] "$request" ' + '$status $body_bytes_sent "$http_referer" ' + '"$http_user_agent" "$http_x_forwarded_for"'; + access_log /var/log/nginx/access.log main; + sendfile on; + #tcp_nopush on; + keepalive_timeout 65; + #gzip on; + include /etc/nginx/conf.d/*.conf; + } +kind: ConfigMap +metadata: + name: {{ include "codegen-react-ui.fullname" . }}-nginx-conf diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-consolelink.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-consolelink.yaml new file mode 100644 index 000000000..eab5320b3 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-consolelink.yaml @@ -0,0 +1,15 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: console.openshift.io/v1 +kind: ConsoleLink +metadata: + name: {{ include "codegen-react-ui.fullname" . }} +spec: + href: https://codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + location: ApplicationMenu + # This text will appear in a box called "Launcher" under "namespace" or "project" in the web console + text: OPEA CodeGen UI + applicationMenu: + section: Open Platform For Enterprise AI + imageURL:  diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-deployment.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-deployment.yaml new file mode 100644 index 000000000..8d428a3b5 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-deployment.yaml @@ -0,0 +1,85 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "codegen-react-ui.fullname" . }} + labels: + {{- include "codegen-react-ui.labels" . | nindent 4 }} + annotations: + image.openshift.io/triggers: '[{"from":{"kind":"ImageStreamTag","name":"react-ui:{{ .Values.image.tag }}"},"fieldPath":"spec.template.spec.containers[?(@.name==\"{{ .Chart.Name }}\")].image"}]' +spec: + selector: + matchLabels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + template: + metadata: + {{- with .Values.podAnnotations }} + annotations: + {{- toYaml . | nindent 8 }} + {{- end }} + labels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + spec: + {{- with .Values.imagePullSecrets }} + imagePullSecrets: + {{- toYaml . | nindent 8 }} + {{- end }} + securityContext: {{- toYaml .Values.podSecurityContext | nindent 8 }} + containers: + - command: ["/bin/sh", "-c"] + args: + - | + cp -v /etc/nginx/nginx.conf /tmp/nginx.conf && \ + sed -i "s/pid .*;/pid \/tmp\/nginx.pid;/" /tmp/nginx.conf && \ + nginx -c /tmp/nginx.conf -g "daemon off;" + image: {{ .Values.image.repository }}:{{ .Values.image.tag }} + name: {{ .Chart.Name }} + ports: + - name: http + containerPort: {{ .Values.service.port }} + protocol: TCP + livenessProbe: + httpGet: + path: / + port: http + readinessProbe: + httpGet: + path: / + port: http + resources: + {{- toYaml .Values.resources | nindent 10 }} + securityContext: + {{- toYaml .Values.securityContext | nindent 10 }} + volumeMounts: + - mountPath: /var/cache/nginx/ + name: nginx-cache-dir + - mountPath: /etc/nginx/conf.d/default.conf + name: default-conf + subPath: default.conf + - mountPath: /etc/nginx/nginx.conf + name: nginx-conf + subPath: nginx.conf + volumes: + - name: nginx-cache-dir + emptyDir: {} + - name: default-conf + configMap: + name: {{ include "codegen-react-ui.fullname" . }}-default-conf + - name: nginx-conf + configMap: + name: {{ include "codegen-react-ui.fullname" . }}-nginx-conf + {{- with .Values.nodeSelector }} + nodeSelector: + {{- toYaml . | nindent 8 }} + {{- end }} + {{- with .Values.affinity }} + affinity: + {{- toYaml . | nindent 8 }} + {{- end }} + {{- with .Values.tolerations }} + tolerations: + {{- toYaml . | nindent 8 }} + {{- end }} diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-https-route.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-https-route.yaml new file mode 100644 index 000000000..09ff5e8f4 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-https-route.yaml @@ -0,0 +1,17 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: route.openshift.io/v1 +kind: Route +metadata: + labels: + {{- include "codegen-react-ui.selectorLabels" . | nindent 8 }} + name: {{ include "codegen-react-ui.fullname" . }} +spec: + host: codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + tls: + termination: edge + to: + name: {{ include "codegen-react-ui.fullname" . }} +status: {} diff --git a/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-service.yaml b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-service.yaml new file mode 100644 index 000000000..c8741bf95 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/templates/codegen-react-ui-service.yaml @@ -0,0 +1,19 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: v1 +kind: Service +metadata: + name: {{ include "codegen-react-ui.fullname" . }} + labels: + {{- include "codegen-react-ui.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: http + protocol: TCP + name: http + selector: + {{- include "codegen-react-ui.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift/react-ui/values.yaml b/helm-charts/codegen-openshift/react-ui/values.yaml new file mode 100644 index 000000000..a36413906 --- /dev/null +++ b/helm-charts/codegen-openshift/react-ui/values.yaml @@ -0,0 +1,67 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +# Default values for codegen-react-ui. +# This is a YAML-formatted file. +# Declare variables to be passed into your templates. + +global: + clusterDomain: "insert-your-cluster-domain-here" + +image: + repository: "insert-your-image-here" + pullPolicy: IfNotPresent + # Overrides the image tag whose default is the chart appVersion. + tag: "latest" + +source: + contextDir: CodeGen/ui + gitRef: main + gitUri: "https://github.com/opea-project/GenAIExamples.git" + type: Git + +strategy: + type: Docker + dockerfilePath: docker/Dockerfile.react + +imagePullSecrets: [] +nameOverride: "" +fullnameOverride: "" + +podAnnotations: {} + +podSecurityContext: {} + # fsGroup: 2000 + +securityContext: {} + # capabilities: + # drop: + # - ALL + # readOnlyRootFilesystem: true + # runAsNonRoot: true + # runAsUser: 1000 + +service: + type: ClusterIP + port: 8080 + +resources: {} + # limits: + # cpu: 100m + # memory: 128Mi + # requests: + # cpu: 100m + # memory: 128Mi + +autoscaling: + enabled: false + minReplicas: 1 + maxReplicas: 100 + targetCPUUtilizationPercentage: 80 + # targetMemoryUtilizationPercentage: 80 + +nodeSelector: {} + +tolerations: [] + +affinity: {} diff --git a/helm-charts/codegen-openshift/templates/_helpers.tpl b/helm-charts/codegen-openshift/templates/_helpers.tpl new file mode 100644 index 000000000..7cffcef8d --- /dev/null +++ b/helm-charts/codegen-openshift/templates/_helpers.tpl @@ -0,0 +1,51 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "codegen.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "codegen.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "codegen.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "codegen.labels" -}} +helm.sh/chart: {{ include "codegen.chart" . }} +{{ include "codegen.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "codegen.selectorLabels" -}} +app.kubernetes.io/name: {{ include "codegen.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} diff --git a/helm-charts/codegen-openshift/templates/buildconfig.yaml b/helm-charts/codegen-openshift/templates/buildconfig.yaml new file mode 100644 index 000000000..dc9790cd3 --- /dev/null +++ b/helm-charts/codegen-openshift/templates/buildconfig.yaml @@ -0,0 +1,31 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +kind: BuildConfig +apiVersion: build.openshift.io/v1 +metadata: + name: {{ include "codegen.fullname" . }} + namespace: {{ .Release.Namespace }} +spec: + output: + to: + kind: "ImageStreamTag" + name: "codegen:latest" + failedBuildsHistoryLimit: 5 + successfulBuildsHistoryLimit: 5 + nodeSelector: null + postCommit: {} + resources: {} + runPolicy: SerialLatestOnly + source: + contextDir: {{ .Values.source.contextDir }} + git: + ref: {{ .Values.source.gitRef }} + uri: {{ .Values.source.gitUri }} + type: {{ .Values.source.type }} + strategy: + type: {{ .Values.strategy.type }} + dockerStrategy: + dockerfilePath: {{ .Values.strategy.dockerfilePath }} + triggers: + - type: ConfigChange diff --git a/helm-charts/codegen-openshift/templates/deployment.yaml b/helm-charts/codegen-openshift/templates/deployment.yaml new file mode 100644 index 000000000..e4e44ad9c --- /dev/null +++ b/helm-charts/codegen-openshift/templates/deployment.yaml @@ -0,0 +1,33 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "codegen.fullname" . }} + labels: + {{- include "codegen.labels" . | nindent 4 }} + app: {{ include "codegen.fullname" . }} +spec: + replicas: {{ .Values.replicaCount }} + selector: + matchLabels: + {{- include "codegen.selectorLabels" . | nindent 6 }} + app: {{ include "codegen.fullname" . }} + template: + metadata: + labels: + {{- include "codegen.selectorLabels" . | nindent 8 }} + app: {{ include "codegen.fullname" . }} + spec: + containers: + - name: {{ .Release.Name }} + env: + - name: LLM_SERVICE_HOST_IP + value: {{ .Release.Name }}-llm-uservice + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + imagePullPolicy: {{ .Values.image.pullPolicy }} + ports: + - name: codegen + containerPort: {{ .Values.port }} + protocol: TCP diff --git a/helm-charts/codegen-openshift/templates/hf_token_secret.yaml b/helm-charts/codegen-openshift/templates/hf_token_secret.yaml new file mode 100644 index 000000000..7ce2d2021 --- /dev/null +++ b/helm-charts/codegen-openshift/templates/hf_token_secret.yaml @@ -0,0 +1,10 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Secret +metadata: + name: hf-token +type: Opaque +stringData: + HUGGING_FACE_HUB_TOKEN: {{ .Values.global.huggingfacehubApiToken }} diff --git a/helm-charts/codegen-openshift/templates/imagestream.yaml b/helm-charts/codegen-openshift/templates/imagestream.yaml new file mode 100644 index 000000000..14f1f6b0d --- /dev/null +++ b/helm-charts/codegen-openshift/templates/imagestream.yaml @@ -0,0 +1,12 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +--- +apiVersion: image.openshift.io/v1 +kind: ImageStream +metadata: + name: {{ include "codegen.fullname" . }} + namespace: {{ .Release.Namespace }} +spec: + lookupPolicy: + local: true diff --git a/helm-charts/codegen-openshift/templates/route.yaml b/helm-charts/codegen-openshift/templates/route.yaml new file mode 100644 index 000000000..7e8fe0fb6 --- /dev/null +++ b/helm-charts/codegen-openshift/templates/route.yaml @@ -0,0 +1,22 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: route.openshift.io/v1 +kind: Route +metadata: + labels: + {{- include "codegen.labels" . | nindent 4 }} + app: {{ include "codegen.fullname" . }} + name: {{ .Release.Name }}-tls +spec: + host: codegen-react-ui-{{ .Release.Namespace }}.apps.{{ .Values.global.clusterDomain }} + path: /v1/codegen + port: + targetPort: codegen + tls: + termination: edge + to: + kind: Service + name: codegen + weight: 100 + wildcardPolicy: None diff --git a/helm-charts/codegen-openshift/templates/service.yaml b/helm-charts/codegen-openshift/templates/service.yaml new file mode 100644 index 000000000..104e00b95 --- /dev/null +++ b/helm-charts/codegen-openshift/templates/service.yaml @@ -0,0 +1,19 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Service +metadata: + name: {{ include "codegen.fullname" . }} + labels: + {{- include "codegen.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: {{ .Values.service.port }} + targetPort: {{ .Values.port }} + protocol: TCP + name: codegen + selector: + {{- include "codegen.selectorLabels" . | nindent 4 }} + app: {{ include "codegen.fullname" . }} diff --git a/helm-charts/codegen-openshift/tgi/Chart.yaml b/helm-charts/codegen-openshift/tgi/Chart.yaml new file mode 100644 index 000000000..f3856fbec --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/Chart.yaml @@ -0,0 +1,8 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v2 +name: tgi +description: A Helm chart for deploying tgi on Red Hat OpenShift +type: application +version: 1.0.0 diff --git a/helm-charts/codegen-openshift/tgi/gaudi-values.yaml b/helm-charts/codegen-openshift/tgi/gaudi-values.yaml new file mode 100644 index 000000000..97d3c6a6e --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/gaudi-values.yaml @@ -0,0 +1,42 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +model: + id: meta-llama/CodeLlama-7b-hf + +replicaCount: 1 + +port: 8080 + +image: + repository: ghcr.io/huggingface/tgi-gaudi + tag: 2.0.4 + pullPolicy: IfNotPresent + +service: + type: ClusterIP + +resources: + limits: + habana.ai/gaudi: 1 + requests: + habana.ai/gaudi: 1 + +args: +- "--max-input-length" +- "1024" +- "--max-total-tokens" +- "2048" + +volumeMounts: + - mountPath: /data + name: model-volume + - mountPath: /var/log/habana_logs + name: logs-volume + +volumes: +- emptyDir: + sizeLimit: 50Gi + name: model-volume +- emptyDir: + name: logs-volume diff --git a/helm-charts/codegen-openshift/tgi/templates/_helpers.tpl b/helm-charts/codegen-openshift/tgi/templates/_helpers.tpl new file mode 100644 index 000000000..b672e8309 --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/templates/_helpers.tpl @@ -0,0 +1,69 @@ +{{/* +Expand the name of the chart. +*/}} +{{- define "tgi.name" -}} +{{- default .Chart.Name .Values.nameOverride | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Create a default fully qualified app name. +We truncate at 63 chars because some Kubernetes name fields are limited to this (by the DNS naming spec). +If release name contains chart name it will be used as a full name. +*/}} +{{- define "tgi.fullname" -}} +{{- if .Values.fullnameOverride }} +{{- .Values.fullnameOverride | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- $name := default .Chart.Name .Values.nameOverride }} +{{- if contains $name .Release.Name }} +{{- .Release.Name | trunc 63 | trimSuffix "-" }} +{{- else }} +{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" }} +{{- end }} +{{- end }} +{{- end }} + +{{/* +Create chart name and version as used by the chart label. +*/}} +{{- define "tgi.chart" -}} +{{- printf "%s-%s" .Chart.Name .Chart.Version | replace "+" "_" | trunc 63 | trimSuffix "-" }} +{{- end }} + +{{/* +Convert chart name to a string suitable as metric prefix +*/}} +{{- define "tgi.metricPrefix" -}} +{{- include "tgi.fullname" . | replace "-" "_" | regexFind "[a-zA-Z_:][a-zA-Z0-9_:]*" }} +{{- end }} + +{{/* +Common labels +*/}} +{{- define "tgi.labels" -}} +helm.sh/chart: {{ include "tgi.chart" . }} +{{ include "tgi.selectorLabels" . }} +{{- if .Chart.AppVersion }} +app.kubernetes.io/version: {{ .Chart.AppVersion | quote }} +{{- end }} +app.kubernetes.io/managed-by: {{ .Release.Service }} +{{- end }} + +{{/* +Selector labels +*/}} +{{- define "tgi.selectorLabels" -}} +app.kubernetes.io/name: {{ include "tgi.name" . }} +app.kubernetes.io/instance: {{ .Release.Name }} +{{- end }} + +{{/* +Create the name of the service account to use +*/}} +{{- define "tgi.serviceAccountName" -}} +{{- if .Values.serviceAccount.create }} +{{- default (include "tgi.fullname" .) .Values.serviceAccount.name }} +{{- else }} +{{- default "default" .Values.serviceAccount.name }} +{{- end }} +{{- end }} diff --git a/helm-charts/codegen-openshift/tgi/templates/deployment.yaml b/helm-charts/codegen-openshift/tgi/templates/deployment.yaml new file mode 100644 index 000000000..4d93f991b --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/templates/deployment.yaml @@ -0,0 +1,47 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: apps/v1 +kind: Deployment +metadata: + name: {{ include "tgi.fullname" . }} + labels: + {{- include "tgi.labels" . | nindent 4 }} +spec: + replicas: {{ .Values.replicaCount }} + selector: + matchLabels: + {{- include "tgi.selectorLabels" . | nindent 6 }} + template: + metadata: + labels: + {{- include "tgi.selectorLabels" . | nindent 8 }} + spec: + securityContext: {} + containers: + - name: {{ .Chart.Name }} + env: + - name: MODEL_ID + value: "{{ .Values.model.id }}" + - name: PORT + value: {{ .Values.port | quote }} + - name: NUMBA_CACHE_DIR + value: /data + - name: HF_TOKEN + valueFrom: + secretKeyRef: + key: HUGGING_FACE_HUB_TOKEN + name: hf-token + securityContext: {} + image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}" + imagePullPolicy: {{ .Values.image.pullPolicy }} + args: + {{- toYaml .Values.args | nindent 12 }} + volumeMounts: {{- toYaml .Values.volumeMounts | nindent 10 }} + ports: + - name: http + containerPort: {{ .Values.port }} + protocol: TCP + resources: + {{- toYaml .Values.resources | nindent 12 }} + volumes: {{- toYaml .Values.volumes | nindent 6 }} diff --git a/helm-charts/codegen-openshift/tgi/templates/service.yaml b/helm-charts/codegen-openshift/tgi/templates/service.yaml new file mode 100644 index 000000000..7ed5529a9 --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/templates/service.yaml @@ -0,0 +1,18 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +apiVersion: v1 +kind: Service +metadata: + name: {{ include "tgi.fullname" . }} + labels: + {{- include "tgi.labels" . | nindent 4 }} +spec: + type: {{ .Values.service.type }} + ports: + - port: 8080 + targetPort: {{ .Values.port }} + protocol: TCP + name: tgi + selector: + {{- include "tgi.selectorLabels" . | nindent 4 }} diff --git a/helm-charts/codegen-openshift/tgi/values.yaml b/helm-charts/codegen-openshift/tgi/values.yaml new file mode 100644 index 000000000..45f8ad1ee --- /dev/null +++ b/helm-charts/codegen-openshift/tgi/values.yaml @@ -0,0 +1,30 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +model: + id: ise-uiuc/Magicoder-S-DS-6.7B + +replicaCount: 1 + +port: 8080 + +image: + repository: ghcr.io/huggingface/text-generation-inference + tag: 2.1.0 + pullPolicy: IfNotPresent + +service: + type: ClusterIP + +resources: {} + +args: [] + +volumeMounts: + - mountPath: /data + name: model-volume + +volumes: + - emptyDir: + sizeLimit: 50Gi + name: model-volume diff --git a/helm-charts/codegen-openshift/values.yaml b/helm-charts/codegen-openshift/values.yaml new file mode 100644 index 000000000..de00d09b1 --- /dev/null +++ b/helm-charts/codegen-openshift/values.yaml @@ -0,0 +1,50 @@ +# Copyright (C) 2024 Intel Corporation +# SPDX-License-Identifier: Apache-2.0 + +global: + clusterDomain: "insert-your-cluster-domain-here" + huggingfacehubApiToken: "insert-your-huggingface-token-here" + +image: + repository: "insert-your-image-here" + tag: "latest" + pullPolicy: IfNotPresent + +source: + contextDir: CodeGen + gitRef: main + gitUri: "https://github.com/opea-project/GenAIExamples.git" + type: Git + +strategy: + type: Docker + dockerfilePath: Dockerfile + +replicaCount: 1 + +port: 7778 + +service: + type: ClusterIP + port: 7778 + +tgi: + model: + id: ise-uiuc/Magicoder-S-DS-6.7B + replicaCount: 1 + port: 8080 + image: + repository: ghcr.io/huggingface/text-generation-inference + tag: 2.1.0 + pullPolicy: IfNotPresent + service: + type: ClusterIP + resources: {} + args: [] + volumeMounts: + - mountPath: /data + name: model-volume + volumes: + - emptyDir: + sizeLimit: 50Gi + name: model-volume