Skip to content

Commit

Permalink
Merge pull request #795 from k8up-io/chart-job-stuck
Browse files Browse the repository at this point in the history
Remove K8upJobStuck alert
  • Loading branch information
ccremer authored Jan 12, 2023
2 parents 847cc39 + c702b82 commit fad59b6
Show file tree
Hide file tree
Showing 4 changed files with 3 additions and 18 deletions.
2 changes: 1 addition & 1 deletion charts/k8up/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ keywords:
- backup
- operator
- restic
version: 4.0.1
version: 4.0.2
sources:
- https://github.com/k8up-io/k8up
maintainers:
Expand Down
4 changes: 2 additions & 2 deletions charts/k8up/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# k8up

![Version: 4.0.1](https://img.shields.io/badge/Version-4.0.1-informational?style=flat-square)
![Version: 4.0.2](https://img.shields.io/badge/Version-4.0.2-informational?style=flat-square)

Kubernetes and OpenShift Backup Operator based on restic

Expand All @@ -13,7 +13,7 @@ helm repo add k8up-io https://k8up-io.github.io/k8up
helm install k8up k8up-io/k8up
```
```bash
kubectl apply -f https://github.com/k8up-io/k8up/releases/download/k8up-4.0.1/k8up-crd.yaml
kubectl apply -f https://github.com/k8up-io/k8up/releases/download/k8up-4.0.2/k8up-crd.yaml
```

<!---
Expand Down
8 changes: 0 additions & 8 deletions charts/k8up/templates/prometheus/prometheusrule.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -31,14 +31,6 @@ spec:
annotations:
summary: "No K8up jobs were run in {{ "{{ $labels.namespace }}" }} within the last 24 hours. Check the operator, there might be a deadlock"
runbook_url: https://k8up.io/k8up/explanations/runbooks/K8upBackupNotRunning.html
- alert: K8upJobStuck
expr: k8up_jobs_queued_gauge{jobType="backup"} > 0 and on(namespace) k8up_schedules_gauge > 0
for: 24h
labels:
severity: critical
annotations:
summary: "K8up jobs are stuck in {{ "{{ $labels.namespace }}" }} for the last 24 hours."
runbook_url: https://k8up.io/k8up/explanations/runbooks/K8upJobStuck.html
{{- range .Values.metrics.prometheusRule.jobFailedRulesFor }}
- alert: K8up{{- . | title -}}Failed
expr: (sum(kube_job_status_failed) by(job_name, namespace) * on(job_name, namespace) group_right() kube_job_labels{label_k8up_syn_tools_type="{{- . -}}"}) > 0
Expand Down
7 changes: 0 additions & 7 deletions config/samples/prometheus/rules.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -23,10 +23,3 @@ groups:
severity: critical
annotations:
summary: "No K8up jobs were run in {{ $labels.namespace }} within the last 24 hours. Check the operator, there might be a deadlock"
- alert: K8upJobStuck
expr: k8up_jobs_queued_gauge{jobType="backup"} > 0 and on(namespace) k8up_schedules_gauge > 0
for: 24h
labels:
severity: critical
annotations:
summary: "K8up jobs are stuck in {{ $labels.namespace }} for the last 24 hours."

0 comments on commit fad59b6

Please sign in to comment.