-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rocky9 image fails deployment #100
Comments
Hi @wespiard It seems that you are using legacy boot mode and not UEFI. Can you try with UEFI or there is a reason why UEFI cannot be used? According to canonical/curtin@5b89082 support for RHEL9 was added but not released yet. If you need legacy boot mode, I would suggest to patch snap download maas
unsquashfs maas_xxx.snap
snap pack ./squashfs-root
sudo snap install --dangerous maas_xxx.snap And don't forget to connect snap slots and plugs after that (since for snap connections maas | awk '$1 != "content" && $3 == "-" {print $2}' | xargs -r -n1 sudo snap connect
sudo snap restart maas |
@troyanov thank you for the reply. I'm not sure exactly what you mean. Are you saying the machine I'm trying to deploy is configured to PXE boot in legacy boot mode (via the BIOS)? Or is there a boot configuration in MaaS settings? I don't think I consciously made a decision to use legacy boot mode anywhere, so it may not be necessary. Thanks! |
Upon changing this setting in my BIOS (Dell PowerEdge R730), the MaaS event log just gets stuck in a loop after repeatedly failing to PXE boot, with this message: With a KVM connected to see what is happening, I get the following output:
Then it tries to boot from the next boot options (HDD, Optical drive, etc.) and fails. Then every 30 seconds or so it retries and fails, etc. |
I looked through the MaaS Should it be requesting |
@wespiard it seems that you have to stay with "legacy boot" option then. If patching |
So you are using external DHCP? Can you tell a bit more about your setup, do you have MAAS DHCP as a
IIRC |
This sounds like the easiest option for now. I'll give it a shot. |
Yeah, sorry. External DHCP server that I don't really have control over. The subnet our MaaS machines are on is part of a large corporate network. So I'm assuming that DHCP server is configured to point to MaaS for PXE booting?
So because it looks like 'pxelinux.0' is being requested even when configured for EUFI boot, is the external DHCP server not configured properly to handle it? |
It should, yes.
Since you've mentioned this in a very first message, I would assume that DHCP was configured correctly. |
I am wondering if this is sort of a fallback scenario? Like it tried to do UEFI boot and then did a fall back to legacy BIOS. |
I tried re-uploading the image to MaaS with
Looking through the curtin functions more, it looks like target is read here: rhel_ver = (distro.rpm_get_dist_id(target) I didn't want to follow the code too much, but I'm asssuming it's reading the version from the image itself, not just the parameters we supply to MaaS? Seems like the proper way to do it, I suppose. Also, I noticed this commit this morning: 65e270b Does MaaS actually need to be 3.3 for Rocky 9? My version is 3.2.7. I am okay using Rocky 8 for the time being if there isn't an easy solution for Rocky 9. Don't want to use up too much of your time as there aren't any specific reasons I need 9 over 8 as of now. |
Hey @wespiard, is this still an issue you're encountering? |
I just stuck with Rocky/RHEL 8. I don't have any test machines to play with right now, so I can't promise when I'll try rocky9 next. |
Alright, sounds good. |
I'm on MaaS 3.3.4 and I get the same error when trying to deploy Rocky 9.
|
this is because the maas snap is built on core22, which has a too old curtin, unfortunately (v21.3). the referenced commit above only made it into 23.1.1 the earliest. |
I can confirm 8/9 install used to fail with 3.3/edge. Works fine since I upgraded to latest/edge (3.5 alpha1 which comes with curtin 23.1.1). |
Where did you obtain 3.5 from? I need to deploy Rocky 9x on a few systems and getting the same issue. |
Via snap latest/edge. Or you can build your own package by cloning master branch of git repo if you will. |
We used this project to create a Rocky 9.1 template and the template is working, on UEFI and Legacy. Look like the issue is related to recent version of Rocky 9, current latest is 9.4. with Rocky 9.4;
tested on Maas 3.4.0 and 3.2.9 note: in order to build 9.1 without issue with packer, it's been installed using cdrom as source in the kickstart from the full iso of 9.1 from: https://dl.rockylinux.org/vault/rocky/9.1/isos/x86_64/ |
@pdion891 - I've documented the issue here with Rocky - https://discourse.maas.io/t/rocky-9-3-deployment/7744. This should resolve the boot issues for both legacy and UEFI. |
the bug report for the linked post is #191 also |
First off, I am not sure if this is a MaaS issue, an issue with the packer template, or a tool version issue. I am new to packer, but I've gotten RHEL 8 and Rocky 8 images to be deployed successfully.
I am running packer on the same machine that I'm hosting MaaS with. MaaS is installed via snap, and is version 3.2.7. It is running Ubuntu 20.04.6 LTS (GNU/Linux 5.4.0-144-generic x86_64). The CPU is a Xeon E3-1220 v6, which from what I have found supports x86-64-v2 extensions. Packer is version 1.8.6.
The error in the MaaS GUI's log that is reported is this:
Marking node failed - Missing boot image custom/amd64/ga-20.04/rocky9
After running
make
in therocky9
directory, I uploaded therocky9.tar.gz
file using the command in the readme, replacing$PROFILE
with my MaaS username:Then, after deploying a new machine in MaaS, I get the error stated above. The installation output log ends with the following:
Does the version of
curtin
matter on the build system, or the actual machine the image is being deployed on?For example,
curtin
is version 20.1 on my MaaS Ubuntu machine that I'm trying to build the rocky9 image on. But on the machine it's being deployed on,curtin
is version 22.1, as shown at the beginning of the installation log:curtin: Installation started. (22.1-0ubuntu1~20.04.1)
The text was updated successfully, but these errors were encountered: