Skip to content

Commit

Permalink
Add Azimuth UI related content
Browse files Browse the repository at this point in the history
  • Loading branch information
Scott Davidson committed Oct 25, 2023
1 parent 06123ef commit 26d13c6
Show file tree
Hide file tree
Showing 5 changed files with 29 additions and 3 deletions.
5 changes: 5 additions & 0 deletions Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -22,3 +22,8 @@ version: 0.1.0
# follow Semantic Versioning. They should reflect the version the application is using.
# It is recommended to use it with quotes.
appVersion: "1.16.0"

icon: https://huggingface.co/datasets/huggingface/brand-assets/resolve/main/hf-logo.svg

annotations:
azimuth.stackhpc.com/label: HuggingFace LLM
1 change: 1 addition & 0 deletions templates/NOTES.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
The LLM app allows users to deploy machine learning models using [text-generation-inference](https://github.com/huggingface/text-generation-inference) as a model serving backend and [gradio](https://github.com/gradio-app/gradio) as a web interface.
8 changes: 6 additions & 2 deletions templates/api/zenith-reservation.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,12 @@ apiVersion: zenith.stackhpc.com/v1alpha1
kind: Reservation
metadata:
name: {{ .Release.Name }}-api
labels:
{{- include "azimuth-llm.labels" . | nindent 4 }}
annotations:
azimuth.stackhpc.com/service-label: {{ quote .Values.api.service.zenith.label }}
azimuth.stackhpc.com/service-icon-url: {{ .Values.api.service.zenith.iconUrl }}
{{- with .Values.api.service.zenith.description }}
azimuth.stackhpc.com/service-description: {{ quote . }}
{{- end }}
spec:
credentialSecretName: {{ .Release.Name }}-api-zenith-credential
{{- end -}}
6 changes: 6 additions & 0 deletions templates/ui/ui-zenith-reservation.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,12 @@ metadata:
name: {{ .Release.Name }}-ui
labels:
{{- include "azimuth-llm.labels" . | nindent 4 }}
annotations:
azimuth.stackhpc.com/service-label: {{ quote .Values.ui.service.zenith.label }}
azimuth.stackhpc.com/service-icon-url: {{ .Values.ui.service.zenith.iconUrl }}
{{- with .Values.ui.service.zenith.description }}
azimuth.stackhpc.com/service-description: {{ quote . }}
{{- end }}
spec:
credentialSecretName: {{ .Release.Name }}-ui-zenith-credential
{{- end -}}
12 changes: 11 additions & 1 deletion values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -30,14 +30,20 @@ api:
# Container image config
image:
repository: ghcr.io/huggingface/text-generation-inference
version: 1.1.0
# NOTE: versions > 0.9.4 are no longer Apache licensed :(
version: 0.9.4
# Service config
service:
name: text-generation-inference
type: ClusterIP
zenith:
enabled: true
skipAuth: false
label: Inference API
iconUrl:
description: |
The raw inference API endpoints for the deployed LLM.
Public API docs are available [here](https://huggingface.github.io/text-generation-inference/#/Text%20Generation%20Inference)
# Config
storage:
# Number of gpus to requests for each api pod instance
Expand Down Expand Up @@ -67,6 +73,10 @@ ui:
zenith:
enabled: true
skipAuth: false
label: Web Interface
iconUrl: https://raw.githubusercontent.com/gradio-app/gradio/5524e590577769b0444a5332b8d444aafb0c5c12/js/app/public/static/img/logo.svg
description: |
A web-based user inferface for interacting with the deployed LLM.
# The update strategy to use for the deployment
updateStrategy:
rollingUpdate:
Expand Down

0 comments on commit 26d13c6

Please sign in to comment.