Skip to content

Commit 1c514ef

Browse files
saad-aliErin A Boyd
authored and
Erin A Boyd
committed
Deprecate Kubernetes on-call rotations
1 parent 4cb1ba4 commit 1c514ef

File tree

5 files changed

+60
-195
lines changed

5 files changed

+60
-195
lines changed

contributors/devel/issues.md

+51-5
Original file line numberDiff line numberDiff line change
@@ -40,11 +40,57 @@ and this document will cover the basic ones.
4040

4141
Sometimes users ask for support requests in issues; these are usually requests
4242
from people who need help configuring some aspect of Kubernetes. These should be
43-
directed to our [support structures](https://github.com/kubernetes/community/blob/master/contributors/devel/on-call-user-support.md) and then closed. Also, if the issue is clearly abandoned or in
44-
the wrong place, it should be closed. Keep in mind that only issue reporter,
45-
assignees and component organization members can close issue. If you do not
46-
have such privilege, just comment your findings. Otherwise, first `/assign`
47-
issue to yourself and then `/close`.
43+
directed to our support structures (see below) and then closed. Also, if the issue
44+
is clearly abandoned or in the wrong place, it should be closed. Keep in mind that
45+
only issue reporter, assignees and component organization members can close issue.
46+
If you do not have such privilege, just comment your findings. Otherwise, first
47+
`/assign` issue to yourself and then `/close`.
48+
49+
### Support Structures
50+
51+
Support requests should be directed to the following:
52+
53+
* [User documentation](https://kubernetes.io/docs/) and
54+
[troubleshooting guide](https://kubernetes.io/docs/tasks/debug-application-cluster/troubleshooting/)
55+
56+
* [Stack Overflow](http://stackoverflow.com/questions/tagged/kubernetes) and
57+
[ServerFault](http://serverfault.com/questions/tagged/google-kubernetes)
58+
59+
* [Slack](https://kubernetes.slack.com) ([registration](http://slack.k8s.io))
60+
* Check out the [Slack Archive](http://kubernetes.slackarchive.io/) first.
61+
62+
* [Email/Groups](https://groups.google.com/forum/#!forum/kubernetes-users)
63+
64+
### User support response example
65+
66+
If you see support questions on kubernetes-dev@googlegroups.com or issues asking for
67+
support try to redirect them to Stack Overflow. Example response:
68+
69+
```code
70+
Please re-post your question to [Stack Overflow]
71+
(http://stackoverflow.com/questions/tagged/kubernetes).
72+
73+
We are trying to consolidate the channels to which questions for help/support
74+
are posted so that we can improve our efficiency in responding to your requests,
75+
and to make it easier for you to find answers to frequently asked questions and
76+
how to address common use cases.
77+
78+
We regularly see messages posted in multiple forums, with the full response
79+
thread only in one place or, worse, spread across multiple forums. Also, the
80+
large volume of support issues on github is making it difficult for us to use
81+
issues to identify real bugs.
82+
83+
Members of the Kubernetes community use Stack Overflow to field support
84+
requests. Before posting a new question, please search Stack Overflow for answers
85+
to similar questions, and also familiarize yourself with:
86+
87+
* [user documentation](http://kubernetes.io/docs/)
88+
* [troubleshooting guide](https://kubernetes.io/docs/tasks/debug-application-cluster/troubleshooting/)
89+
90+
Again, thanks for using Kubernetes.
91+
92+
The Kubernetes Team
93+
```
4894

4995
## Find the right SIG(s)
5096
Components are divided among [Special Interest Groups (SIGs)](https://github.com/kubernetes/community/blob/master/sig-list.md). Find a proper SIG for the ownership of the issue using the bot:

contributors/devel/on-call-build-cop.md

-50
This file was deleted.

contributors/devel/on-call-rotations.md

-43
This file was deleted.

contributors/devel/on-call-user-support.md

-83
This file was deleted.

contributors/devel/release/testing.md

+9-14
Original file line numberDiff line numberDiff line change
@@ -81,10 +81,11 @@ When a test is failing, it must be quickly escalated to the correct owner. Test
8181
are left to fail for days or weeks become toxic and create noise in the system health
8282
metrics.
8383

84-
The [build cop] is expected to ensure that the release blocking tests remain
84+
Each SIG is expected to ensure that the release blocking tests that belong to the SIG remain
8585
perpetually healthy by monitoring the test grid and escalating failures.
8686

87-
On test failures, the build cop will follow the [sig escalation](#sig-test-escalation) path.
87+
Failing tests that are not being addressed, can be escalated by following the
88+
[sig escalation](#sig-test-escalation) path.
8889

8990
*Tests without a responsive owner should be assigned a new owner or disabled.*
9091

@@ -132,14 +133,11 @@ urgent than persistent failures, but still expected to have a root cause investi
132133

133134
## Broken test workflow
134135

135-
SIGs are expected to proactively monitor and maintain their tests. The build cop will also
136-
monitor the health of the entire project, but is intended as backup who will escalate
137-
failures to the owning SIGs.
136+
SIGs are expected to proactively monitor and maintain their tests.
138137

139138
- File an issue for the broken test so it can be referenced and discovered
140139
- Set the following labels: `priority/failing-test`, `sig/*`
141140
- Assign the issue to whoever is working on it
142-
- Mention the current build cop (TODO: publish this somewhere)
143141
- Root cause analysis of the test failure is performed by the owner
144142
- **Note**: The owning SIG for a test can reassign ownership of a resolution to another SIG only after getting
145143
approval from that SIG
@@ -152,13 +150,11 @@ failures to the owning SIGs.
152150

153151
## SIG test escalation
154152

155-
The build cop should monitor the overall test health of the project, and ensure ownership for any given
156-
test does not fall through the cracks. When the build cop observer a test failure, they should first
157-
search to see if an issue has been filed already, and if not (optionally file an issue and) escalate to the SIG
158-
escalation point. If the escalation point is unresponsive within a day, the build cop should escalate to the SIG
159-
googlegroup and/or slack channel, mentioning the SIG leads. If escalation through the SIG googlegroup,
160-
slack channel and SIG leads is unsuccessful, the build cop should escalate to SIG release through the
161-
googlegroup and slack - mentioning the SIG leads.
153+
As a Kubernetes developers if you observe a test failure, first search to see if an issue has been filed already,
154+
and if not (optionally file an issue and) escalate to the SIG escalation point.
155+
If the escalation point is unresponsive within a day, escalate to the SIG googlegroup and/or slack channel,
156+
mentioning the SIG leads. If escalation through the SIG googlegroup, slack channel and SIG leads is unsuccessful,
157+
escalate to SIG release through the googlegroup and slack - mentioning the SIG leads.
162158

163159
The SIG escalation points should be bootstrapped from the [community sig list].
164160

@@ -172,7 +168,6 @@ The SIG escalation points should be bootstrapped from the [community sig list].
172168
[community sig list]: https://github.com/kubernetes/community/blob/master/sig-list.md
173169
[triage tool]: https://storage.googleapis.com/k8s-gubernator/triage/index.html
174170
[test grid]: https://k8s-testgrid.appspot.com/
175-
[build cop]: https://github.com/kubernetes/community/blob/master/contributors/devel/on-call-build-cop.md
176171
[release-master-blocking]: https://k8s-testgrid.appspot.com/release-master-blocking#Summary
177172
[1.7-master-upgrade]: https://k8s-testgrid.appspot.com/1.7-master-upgrade#Summary
178173
[1.6-master-upgrade]: https://k8s-testgrid.appspot.com/1.6-master-upgrade#Summary

0 commit comments

Comments
 (0)