I've been reading through The Book of Kubernetes by Alan Hohn and decided I would try to get the application described in the final chapter working with the Flannel CNI rather than Calico, which is what the application uses by default. The Longhorn storage system refuses to work with Flannel though. Only one longhorn-manager and longhorn-csi-plugin pods start (longhorn-manager-c46fd and longhorn-csi-plugin-nzw5v), both on node02 in example I'm providing. The longhorn-csi-plugin pods are attempting to connect to a socket called /csi/csi.sock (/var/lib/kubelet/plugins/driver.longhorn.io/csi.sock on the host system), which is only available on host02.
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
csi-attacher-698d78f74c-5lwfz 0/1 CrashLoopBackOff 50 (4m57s ago) 46h 10.244.5.36 host06 <none> <none>
csi-attacher-698d78f74c-cqdp7 0/1 Error 53 (5m50s ago) 46h 10.244.3.31 host04 <none> <none>
csi-attacher-698d78f74c-t68pd 1/1 Running 51 (5m30s ago) 46h 10.244.4.30 host05 <none> <none>
csi-provisioner-67f89896c9-7twtp 1/1 Running 51 (5m22s ago) 46h 10.244.4.31 host05 <none> <none>
csi-provisioner-67f89896c9-9jlsk 0/1 CrashLoopBackOff 48 (4m39s ago) 46h 10.244.5.34 host06 <none> <none>
csi-provisioner-67f89896c9-tmxdq 0/1 Error 52 (5m42s ago) 46h 10.244.3.32 host04 <none> <none>
csi-resizer-7966f575f5-27s6b 1/1 Running 50 (5m25s ago) 46h 10.244.4.32 host05 <none> <none>
csi-resizer-7966f575f5-67wcq 0/1 CrashLoopBackOff 50 (4m50s ago) 46h 10.244.5.35 host06 <none> <none>
csi-resizer-7966f575f5-zcv8m 0/1 CrashLoopBackOff 51 (12s ago) 46h 10.244.3.35 host04 <none> <none>
csi-snapshotter-845c75b5b6-8rp7f 0/1 CrashLoopBackOff 49 (4m30s ago) 46h 10.244.5.32 host06 <none> <none>
csi-snapshotter-845c75b5b6-trswb 0/1 Error 51 (5m33s ago) 46h 10.244.3.33 host04 <none> <none>
csi-snapshotter-845c75b5b6-z2kzn 1/1 Running 49 (5m16s ago) 46h 10.244.4.33 host05 <none> <none>
engine-image-ei-b0369a5d-6ccqj 1/1 Running 3 (27m ago) 46h 10.244.0.15 host01 <none> <none>
engine-image-ei-b0369a5d-6x486 1/1 Running 3 (26m ago) 46h 10.244.2.15 host03 <none> <none>
engine-image-ei-b0369a5d-fgnns 1/1 Running 3 (25m ago) 46h 10.244.5.30 host06 <none> <none>
engine-image-ei-b0369a5d-qj2xv 1/1 Running 3 (26m ago) 46h 10.244.3.36 host04 <none> <none>
engine-image-ei-b0369a5d-vg5n2 1/1 Running 3 (27m ago) 46h 10.244.1.42 host02 <none> <none>
engine-image-ei-b0369a5d-wlcbv 1/1 Running 3 (25m ago) 46h 10.244.4.36 host05 <none> <none>
instance-manager-37f197c012829700a256d86ef4357100 1/1 Running 0 26m 10.244.1.47 host02 <none> <none>
longhorn-csi-plugin-58wgq 1/3 CrashLoopBackOff 105 (3m14s ago) 45h 10.244.3.29 host04 <none> <none>
longhorn-csi-plugin-k4mmd 1/3 CrashLoopBackOff 104 (3m9s ago) 45h 10.244.5.29 host06 <none> <none>
longhorn-csi-plugin-mnz8z 1/3 CrashLoopBackOff 108 (28s ago) 45h 10.244.2.14 host03 <none> <none>
longhorn-csi-plugin-nzw5v 3/3 Running 52 (12m ago) 46h 10.244.1.45 host02 <none> <none>
longhorn-csi-plugin-r54wv 1/3 CrashLoopBackOff 107 (2m57s ago) 45h 10.244.4.34 host05 <none> <none>
longhorn-csi-plugin-v6b24 1/3 CrashLoopBackOff 108 (39s ago) 45h 10.244.0.13 host01 <none> <none>
longhorn-driver-deployer-56f94fbbd7-psgsk 1/1 Running 17 (25m ago) 46h 10.244.1.44 host02 <none> <none>
longhorn-manager-c46fd 1/1 Running 6 (26m ago) 46h 10.244.1.40 host02 <none> <none>
longhorn-manager-fnp7s 0/1 CrashLoopBackOff 38 (61s ago) 45h 10.244.5.31 host06 <none> <none>
longhorn-manager-jpxrv 0/1 CrashLoopBackOff 38 (97s ago) 45h 10.244.3.34 host04 <none> <none>
longhorn-manager-kzlsm 0/1 CrashLoopBackOff 37 (118s ago) 45h 10.244.0.14 host01 <none> <none>
longhorn-manager-lnmtv 0/1 CrashLoopBackOff 38 (110s ago) 45h 10.244.2.13 host03 <none> <none>
longhorn-manager-s54g6 0/1 CrashLoopBackOff 38 (68s ago) 45h 10.244.4.29 host05 <none> <none>
longhorn-ui-74b6b85c94-7k4v5 1/1 Running 5 (26m ago) 46h 10.244.1.46 host02 <none> <none>
longhorn-ui-74b6b85c94-gfb22 1/1 Running 6 (26m ago) 46h 10.244.1.41 host02 <none> <none>
Logs from failing pods
longhorn-manager-fnp7s
W0624 19:29:48.578629 1 client_config.go:618] Neither --kubeconfig nor --master was specified. Using the inClusterConfig. This might not work.
time="2024-06-24T19:29:48Z" level=info msg="Starting longhorn conversion webhook server" func=webhook.StartWebhook file="webhook.go:24"
time="2024-06-24T19:29:48Z" level=info msg="Waiting for conversion webhook to become ready" func=webhook.StartWebhook file="webhook.go:43"
time="2024-06-24T19:29:48Z" level=warning msg="Failed to check endpoint https://localhost:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://localhost:9501/v1/healthz\": dial tcp [::1]:9501: connect: connection refused"
time="2024-06-24T19:29:48Z" level=info msg="Active TLS secret longhorn-system/longhorn-webhook-tls (ver=5907) (count 2): map[listener.cattle.io/cn-longhorn-admission-webhook.longhor-59584d:longhorn-admission-webhook.longhorn-system.svc listener.cattle.io/cn-longhorn-conversion-webhook.longho-6a0089:longhorn-conversion-webhook.longhorn-system.svc listener.cattle.io/fingerprint:SHA1=A5D6010EC627A8877305D8828712E103B103C5B3]" func="memory.(*memory).Update" file="memory.go:42"
time="2024-06-24T19:29:48Z" level=info msg="Listening on :9501" func=server.ListenAndServe.func2 file="server.go:77"
time="2024-06-24T19:29:48Z" level=info msg="Starting apiregistration.k8s.io/v1, Kind=APIService controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Starting apiextensions.k8s.io/v1, Kind=CustomResourceDefinition controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Starting /v1, Kind=Secret controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Building conversion rules..." func="server.(*WebhookServer).runConversionWebhookListenAndServe.func1" file="server.go:195"
time="2024-06-24T19:29:48Z" level=info msg="Updating TLS secret for longhorn-system/longhorn-webhook-tls (count: 2): map[listener.cattle.io/cn-longhorn-admission-webhook.longhor-59584d:longhorn-admission-webhook.longhorn-system.svc listener.cattle.io/cn-longhorn-conversion-webhook.longho-6a0089:longhorn-conversion-webhook.longhorn-system.svc listener.cattle.io/fingerprint:SHA1=A5D6010EC627A8877305D8828712E103B103C5B3]" func="kubernetes.(*storage).saveInK8s" file="controller.go:225"
time="2024-06-24T19:29:50Z" level=info msg="Started longhorn conversion webhook server on localhost" func=webhook.StartWebhook file="webhook.go:47"
time="2024-06-24T19:29:51Z" level=warning msg="Failed to check endpoint https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz\": context deadline exceeded (Client.Timeout exceeded while awaiting headers)"
time="2024-06-24T19:29:54Z" level=warning msg="Failed to check endpoint https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz\": context deadline exceeded (Client.Timeout exceeded while awaiting headers)"
longhorn-csi-plugin-58wgq
I0624 19:28:16.188843 64939 main.go:135] Version: v2.9.2
I0624 19:28:16.188921 64939 main.go:136] Running node-driver-registrar in mode=
I0624 19:28:16.188948 64939 main.go:157] Attempting to open a gRPC connection with: "/csi/csi.sock"
W0624 19:28:26.189744 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
W0624 19:28:36.189205 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
W0624 19:28:46.190515 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
E0624 19:28:46.190535 64939 main.go:160] error connecting to CSI driver: context deadline exceeded