New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

ADR for reviewer and commiter responsibilities #44097

Draft

emmanuelbernard wants to merge 1 commit into main from adr-review

Member

emmanuelbernard commented Oct 25, 2024

We had a discussion on reviewer responsibilities on Zulip.
In that thread, I captured the responsibilities as a summary.

I expanded that work and formalized it in an ADR.
I also expanded the conversation on the impact of time to PR merge and how we could improve this.

This is a draft as we want to further discuss this collectively.

quarkus-bot bot added the area/adr label


          Add draft for the reviewer ADR

91803c2

emmanuelbernard force-pushed the adr-review branch from 651be3e to 91803c2 Compare

October 25, 2024 10:27

cescoffier requested changes

View reviewed changes

adr/0006-reviewer-responsibilies.adoc

+              * Being a perfect solution is _not_ a goal, _better_ than today is.
+              * Define ways to measure and improve the time to review.
+              == Scenarios (optional)

Member

cescoffier Oct 28, 2024

Section to be dropped.

adr/0006-reviewer-responsibilies.adoc


		== Considered options

		The main alternative is the status quo which roughly is: Guillaume knowing what we are intending to do and some persons talking with Guillaume regularly being sufficiently aware.

Member

cescoffier Oct 28, 2024

That's not true. While Guillaume provides extensive review on many areas, he is not alone to overlook the whole thing.

Member Author

emmanuelbernard Oct 28, 2024

This sentence is not talking about the review, it is talking about the process for the reviews.

adr/0006-reviewer-responsibilies.adoc


		There are three levels of reviewing.

		. Any one can review a Pull Request (PR) and give constructive feedback on different areas (code, documentation, feature experience, security, etc.).

Member

cescoffier Oct 28, 2024

Is it intended to be a * ? or a 1.

You used * above.

Member Author

emmanuelbernard Oct 28, 2024

yes because it's about 3 levels, I felt numbering was helpful. I can change if that hurts too many people.

adr/0006-reviewer-responsibilies.adoc

+                More eyes means higher quality.
+              . A PR is reviewed by a _binding reviewer_ who is fluent in the area being touched: this person is responsible for signing with blood of the usefulness, quality of the code and soundness of the solution.
+                That’s the review of the expert and the most critical one for quality.
+                This review is mandatory.

Member

cescoffier Oct 28, 2024

PRs are associated with areas (when not, they get the need-triage label, and it's quickly identified, and the areas are set manually). Each area has a set (sometimes a set of 1) of binding reviewers. They are listed in https://github.com/quarkusio/quarkus/blob/main/.github/quarkus-github-bot.yml.

adr/0006-reviewer-responsibilies.adoc

+              . A PR is reviewed by a _binding reviewer_ who is fluent in the area being touched: this person is responsible for signing with blood of the usefulness, quality of the code and soundness of the solution.
+                That’s the review of the expert and the most critical one for quality.
+                This review is mandatory.
+              .. Unfortunately, we do not always have (designed or effective) experts in all areas so it’s a constant work in progress and might make this step optional in some situations.

Member

cescoffier Oct 28, 2024

It is rarely the case - we try to avoid that as much as possible. When this happens, we require more extensive reviews from multiple reviewers (Guillaume, Georgios, George, Alexey, myself, Max...)

Member Author

emmanuelbernard Oct 28, 2024

Might be worth talking with the team, there seems to be a perception reality disconnect from some.

adr/0006-reviewer-responsibilies.adoc

+              And how to use it as a mechanism to improve and know where to get help?
+              The faster the flow, the high an organisation measure in effectiveness, happiness etc.
+              ==== Proposition 1

Member

cescoffier Oct 28, 2024

IF there is only one, let's drop the 1.

Member Author

emmanuelbernard Oct 28, 2024

It's really the meat to discuss, we could get more proposals from others. After the conversation, I anticipated "TODO Open questions to resolve before closing" will be folded into the other sections.

adr/0006-reviewer-responsibilies.adoc


		==== Proposition 1

		Define and measure (GitHub APIs) SLO for the time to push or drop / withdraw.

Member

cescoffier Oct 28, 2024

How would you define that magic number? As explained above, raising awareness is okay (and we have the lottery for this), but taking action is NOT.

And maybe this issue does not really exist? Do you have examples of PRs that took months to merge? It is not rare for a few weeks to be required because Quarkus has a complex (and pretty large) code base. Some PR have been discussed for multiple months, but that's a different issue.

Member Author

emmanuelbernard Oct 28, 2024

Yes look above, your comment made me do some GitHub queries to try and get some numbers around staleness.

adr/0006-reviewer-responsibilies.adoc

+              ==== Proposition 1
+              Define and measure (GitHub APIs) SLO for the time to push or drop / withdraw.
+              This is not a perfect proxy as some PRs do demand multi weeks conversations but measuring and dashboarding is our first step to know in which area we have a problem.

Member

cescoffier Oct 28, 2024

Are you volunteering? :-)

adr/0006-reviewer-responsibilies.adoc

+              Define and measure (GitHub APIs) SLO for the time to push or drop / withdraw.
+              This is not a perfect proxy as some PRs do demand multi weeks conversations but measuring and dashboarding is our first step to know in which area we have a problem.
+              We could define a team intended SLO (say 3 weeks to start but that’s already quite long) and know which areas chronically fall behind.

Member

cescoffier Oct 28, 2024

Three is not that long.

There is also another issue. Let's imagine 2 PRs are opened at the same time. One gets expedited quickly and they other one is staled. There are multiple reasons for this:

we often trust more regular contributors - that isn't very objective, but that's the reality
Some PRs are more complex to review than others. Changing the native compilation configuration can be pretty touchy and requires the reviewer to test many possibilities.
some PR requires expertise even binding reviewers do not have (like Windows :-)) - cross-cutting PRs can also be tricky (because it requires multiple binding reviews, and technical friction may happen)

Member Author

emmanuelbernard Oct 28, 2024

Yes, that's where it's important to have the conversation.

what are the useful SLO (is that 90% percentile vs absolute number)
is a latency between last user comment and committer and binding reviewer comment useful (e.g. 95% before 7 days)

And even before defining the SLO, probably defining and measuring the SLI could be useful as an indicator.

adr/0006-reviewer-responsibilies.adoc


		We could define a team intended SLO (say 3 weeks to start but that’s already quite long) and know which areas chronically fall behind.

		We can introduce the notion of binding reviewer leutenants (like in the Linux kernel) where a binding reviewer whose area is not meeting the SLO would train other peoples (actively delegating the PR work as a first phase before its own review, mentoring, answering questions, etc).

Member

cescoffier Oct 28, 2024

Suggested change

      
            We can introduce the notion of binding reviewer leutenants (like in the Linux kernel) where a binding reviewer whose area is not meeting the SLO would train other peoples (actively delegating the PR work as a first phase before its own review, mentoring, answering questions, etc).
          
            We can introduce the notion of binding reviewer lieutenants (like in the Linux kernel), where a binding reviewer whose area is not meeting the SLO would train other people (actively delegating the PR work as a first phase before its own review, mentoring, answering questions, etc.).

Member

cescoffier Oct 28, 2024

I like the idea, but I'm not sure of the implementation. For me, even lieutenants would require a set of contributions before being named lieutenants and if they do that, they could become binding reviewers. Otherwise, it would mean that the last review (the merge part) would need to do a more extensive review (which would add even more latency)

Member Author

emmanuelbernard Oct 28, 2024

That's good input. What's the most important to me is to surface these rules that are in your head or in guillaume's head (and certainly not in my head), and write them down.

So maybe it's more a binding reviewer in training for an area (mentee concept) to help towards the contrubition levels you mentioned earlier.

Member

cescoffier commented Oct 29, 2024

I think we need to start with an ADR explaining the current rules and some magic GitHub invocation to track staled PRs. It does not mean that rules cannot be revisited, but having them written down would help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels