-
Notifications
You must be signed in to change notification settings - Fork 558
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ArcBox-Full deployment hangs Waiting for Kubernetes control plane to be in Provisioned phase... in multiple subscriptions #2087
Comments
Hey friend! Thanks for opening this issue. We appreciate your contribution and welcome you to our community! We are glad to have you here and to have your input on the Azure Arc Jumpstart. |
To Reproduce:
|
@mausolfj: Could you please provide the detailed error you got in Terraform, as I was not able to repro it. Meanwhile, I am trying again with Bastion enabled. Also, I recommend deleting the screenshot with SPN secrets and rotating the secrets. Thank you. |
Appreciate you looking into this Zaid. Additional context included below. ... module.client_vm.azurerm_virtual_machine_extension.custom_script: Still creating... [26m20s elapsed] ... there is no terraform error, just a deadline exceeded, essentially a timeout on the custom script installCAPI.sh provisioning of the capi vm kube control plane should take minutes, instead it hangs here for hours resulting in the custom script timeout exceeding the default time limit of 90 min. We have tried to figure out why we are hung here using kubectl and clusterctl commands but have yet to fully understand why the control plane is stuck in the provisioning state. Any suggestions or ideas you may have that may help us isolate or work around this deployment issue would be greatly appreciated. Thank you - Jeff |
@mausolfj : Yes, I did a deployment now and Terraform and CAPI is successful. |
@mausolfj can you please try to create a new service principal and test it? make sure you are assigning it with the proper permissions. |
Is your issue related to a Jumpstart scenario, ArcBox, HCIBox, or Agora?
[<!--- A link to the Jumpstart scenario you are working on. --->]
(https://azurearcjumpstart.io/azure_jumpstart_arcbox/full/)
Describe the issue or the bug
When attempting to deploy ArcBox-Full module.capi_vm[0].azurerm_virtual_machine_extension.custom_script is still creating... long after ArcBox-Client is complete. The jumpstart_logs/installCAPI.log shows the Rancher K3 cluster (arcbox-capi-mgmt) is ready, providers are installed, and we start Deploying the Kubernetes workload cluster. The following are created before an endless loop of "Waiting for Kubernetes control plane to be in Provisioned phase..." starts:
kubeadmconfigtemplate.bootstrap.cluster.x-k8s.io/arcbox-capi-data-oaeg-md-0 created
cluster.cluster.x-k8s.io/arcbox-capi-data-oaeg created
machinedeployment.cluster.x-k8s.io/arcbox-capi-data-oaeg-md-0 created
kubeadmcontrolplane.controlplane.cluster.x-k8s.io/arcbox-capi-data-oaeg-control-plane created
azurecluster.infrastructure.cluster.x-k8s.io/arcbox-capi-data-oaeg created
azureclusteridentity.infrastructure.cluster.x-k8s.io/cluster-identity created
azuremachinetemplate.infrastructure.cluster.x-k8s.io/arcbox-capi-data-oaeg-control-plane created
azuremachinetemplate.infrastructure.cluster.x-k8s.io/arcbox-capi-data-oaeg-md-0 created
Waiting for Kubernetes control plane to be in Provisioned phase...
Waiting for Kubernetes control plane to be in Provisioned phase...
Waiting for Kubernetes control plane to be in Provisioned phase...
To Reproduce
CAPI cluster provisioning completes and ArcBox-Full is provisioned.
Environment summary
$ az version
{
"azure-cli": "2.51.0",
"azure-cli-core": "2.51.0",
"azure-cli-telemetry": "1.1.0",
"extensions": {}
}
$ terraform version
Terraform v1.5.6
on linux_amd64
Have you looked at the Troubleshooting and Logs section?
Yes, I have reviewed the log files and then attempted ArcBox-Full deployment in another subscription with less stringent security measures but got the same results...
Screenshots
CAPI VM jumpstart_logs/installCAPI.logVSCode terminal window
CAPI cluster still stuck in provisioning state two hours later
more kubctl information
Additional context
The text was updated successfully, but these errors were encountered: