You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
terraform plan etc shows complete cluster replacement
Actual behavior
Error: exit status 1
Error: failed to create kubernetes rest client for read of resource: Get "http://localhost/api?timeout=32s": dial tcp [::1]:80: connect: connection refused
with module.a3ultra-benchmark-pr.module.kubectl_apply.module.kubectl_apply_manifests["9"].kubectl_manifest.apply_doc["0"],
on .terraform/modules/a3ultra-benchmark-pr/modules/management/kubectl-apply/kubectl/main.tf line 60, in resource "kubectl_manifest" "apply_doc":
60: resource "kubectl_manifest" "apply_doc" {
Version (gcluster --version)
tested with both v1.43 and experimental
The text was updated successfully, but these errors were encountered:
I've added kubectl-apply block but that didn't help
chajath
changed the title
A3 Ultra GKE deployment fails due to kubectl misconfig
A3 Ultra GKE deployment fails due to kubectl misconfig when regional/zonal cluster settings are mixed up
Dec 19, 2024
This is not possible today, but we have an open PR #3406 to address the feature request.
Once this PR is merged, you can pull develop and use ./gcluster deploy examples/gke-a3-ultragpu/gke-a3-ultragpu.yaml --force to solve your issue or wait for the release when this is pushed to main
I think the issue here was different and related to the cluster being provisioned as ZONAL vs REGIONAL which might be an assumption in the kubectl-apply module.
Describe the bug
I'm trying to provision A3Ultra cluster with the blueprint modeled after https://github.com/GoogleCloudPlatform/cluster-toolkit/blob/a3ultra-preview/examples/gke-a3-ultragpu/gke-a3-ultragpu.yaml but without kubectl-apply block, as we want to manage k8s resources outside of toolkit.
Steps to reproduce
Steps to reproduce the behavior:
Expected behavior
terraform plan
etc shows complete cluster replacementActual behavior
Version (
gcluster --version
)tested with both
v1.43
andexperimental
The text was updated successfully, but these errors were encountered: