Scale: Refurbish section, and absorb relevant tutorials from community forum #150

amotl · 2024-11-27T18:28:17Z

About

Rework the whole section about scaling CrateDB Clusters. Feedback is very much welcome.

Preview

Current: https://cratedb.com/docs/guide/admin/clustering/scale-up-down.html
Next: https://cratedb-guide--150.org.readthedocs.build/admin/clustering/scale/

Details

The patch absorbs those tutorials previously published on the community forum. Thanks for your excellent work. 💯 ¹

References

I will add a bit of polishing to them on behalf of a subsequent iteration, if you don't mind. ↩

amotl · 2024-11-27T19:29:57Z

docs/admin/clustering/scale/index.md

+## Advices
+
+Please apply best-practice operation guidelines when managing your database
+cluster.
+
+:::{tip}
+:class: font-larger
+
+For safely adding or decommission nodes, it is a good idea to
+add and remove only one node at a time.
+:::
+
+:::{caution}
+:class: font-larger
+
+When removing nodes from a cluster, you must be conscious about maintaining a
+quorum of nodes in the cluster as a whole, even if some nodes would not have
+any data.
+:::


That's your suggestions, quickly slapped into the patch, @henrikingo. Please advise about any improvements.

How to safely decommission nodes #148

Thanks! I think for the last sentence, it's actually not about just removing nodes. Restarting or shutting down a node would do it too. Other than that this is really good!

Thanks. Do you have a suggestion for a good wording that includes all relevant details concisely?

When restarting, shutting down or removing nodes from a cluster, using the DECOMMISION command... you must be conscious about maintaining a quorum of nodes up and connected to the cluster as a whole, even if some nodes would no longer have any data.

Thank you very much. I've updated the patch using your suggestion.

hammerhead · 2024-11-28T08:42:57Z

docs/admin/clustering/scale/demand.md

+
+We will be able to identify the new nodes by using a custom attribute (`node.attr.storage=temporarynodes`) (see further down for details on how to configure this), so the first step is to configure the existing partitions so that they do not consider the new nodes as suitable targets for shard allocation.
+
+In CrateDB 5.1.2 or higher we can achieve this with:


I haven't tested it yet, but since 5.9.0, it shouldn't be needed anymore to change allocation for every single table, but default allocation rules can be set on the cluster level:

Added support to override routing.allocation.* cluster settings with a routing.allocation.* table setting. This can be used to define the default routing behavior for all tables with a cluster setting and reroute individual tables by assigning the table setting using ALTER TABLE SET.

Thanks. So, let's add this as an additional information bit at this very spot, relating to "CrateDB 5.9.0 and higher"? Do you have different suggestions? Please advise.

Hi. 1e60c06 adds a relevant note, see comment below about possibly refining it. Thanks!

docs/admin/clustering/scale/expand.md

docs/admin/clustering/scale/auto.md

docs/admin/clustering/scale/demand.md

Refactor `scale-up-down.md` and `kubernetes.rst` page into dedicated section to be able to expand it further.

Co-authored-by: Henrik Ingo <[email protected]>

Co-authored-by: Niklas Schmidtmer <[email protected]>

In CrateDB 5.9.0 or higher, you don't need to change shard allocations for each individual table any longer. Instead, default allocation rules can be set on the cluster level: > [CrateDB 5.9.0] added support to override `routing.allocation.*` cluster > settings with a `routing.allocation.*` table setting. This can be used to > define the default routing behavior for all tables with a cluster setting > and reroute individual tables by assigning the table setting using `ALTER > TABLE SET`.

amotl · 2024-11-28T18:16:14Z

docs/admin/clustering/scale/demand.md

+In CrateDB 5.1.2 or higher, you can achieve this with:
+```sql
+/* this applies the setting to all existing partitions and new partitions */
+ALTER TABLE test SET ("routing.allocation.exclude.storage" = 'temporarynodes');
+
+/* then we run this other command so that the setting does not apply to new partitions */
+ALTER TABLE ONLY test RESET ("routing.allocation.exclude.storage");
+```
+In CrateDB 5.9.0 or higher, you don't need to change shard allocations for each
+individual table any longer. Instead, default allocation rules can be set on
+the cluster level:
+> [CrateDB 5.9.0] added support to override `routing.allocation.*` cluster
+> settings with a `routing.allocation.*` table setting. This can be used to
+> define the default routing behavior for all tables with a cluster setting
+> and reroute individual tables by assigning the table setting using `ALTER
+> TABLE SET`.


@hammerhead suggested to educate readers about the improvements in CrateDB 5.9.0. Thanks!

Is it sensible to come up with example SQL statements here, similar like the statements above, to emphasize and clarify how a corresponding procedure would work?

NB: With new design elements in the documentation, we have a few more possibilites to convey information in a denser way, or optionally hide it from the main reading flow by using collapsibles. So, don't hesitate to increase the volume of documentation when relaying suggestions. I will try to give them a reasonable layout and formatting. Thanks!

If you are fine with the notice, it will be all good. In this case, please acknowledge.

Otherwise, if you have a few examples at hand, and think adding them would be good, just slap them into a comment, and I will add them to the document at this spot.

henrikingo

Sorry for not approving earlier today, was interrupted

amotl · 2024-11-28T21:58:41Z

No worries, and thanks. Let's merge the patch now, in order to serve the outcome to our readers in the spirit of release early and often?

@hammerhead: Please tell us about any improvements to the section you've suggested, also in retrospective. Thanks!

amotl force-pushed the amo/improve-scaling branch from f4b5706 to ddd1e4d Compare November 27, 2024 18:30

amotl mentioned this pull request Nov 27, 2024

How to safely decommission nodes #148

Closed

amotl force-pushed the amo/improve-scaling branch from 3221cc8 to 54da7a7 Compare November 27, 2024 19:26

amotl requested review from hammerhead, wierdvanderhaar and hlcianfagna November 27, 2024 19:26

amotl commented Nov 27, 2024

View reviewed changes

amotl marked this pull request as ready for review November 27, 2024 19:30

amotl changed the title ~~Scale: Refurbish section~~ Scale: Refurbish section, and absorb relevant tutorials from community forum Nov 27, 2024

amotl mentioned this pull request Nov 27, 2024

Consolidate Integration Tutorials I vs. II #102

Open

amotl force-pushed the amo/improve-scaling branch from 54da7a7 to 6383cdc Compare November 28, 2024 07:37