Kubernetes 1.18.3

merge0303 · June 16, 2020, 9:59pm

Our Consul 1.7.1 is running fine on an older version of K8s (1.16.2).
Trying to test on K8s 1.18.3 cluster and having problems reaching the port 8500 on the hostIP:8500, the way a client agent would normally be reached on a worker node.

Is this expected to work on K8s 1.18.3? Is anyone doing this?
Maybe I need a later version of Consul?

lkysow · June 17, 2020, 4:55pm

Where are you running Kubernetes? This might be an issue with where you’re running Kubernetes that doesn’t support hostPorts. The next release of consul-helm will support running in hostNetwork mode which might help.

merge0303 · June 22, 2020, 9:28pm

@lkysow Thanks for the reply. We’re running Consul 1.7.1 on K8s 1.18.3, with RHEL8.2 host images. This is a private cloud Openstack cluster. I wanted to upgrade to Consul 1.8.0 before responding. Running Consul 1.8.0, RHEL8.2 and K8s 1.18.3 now.

What we’re observing is that the “client agent” instances bind to the hostIP:8500 and everything comes up fine (three server agents and three client agents). Everything works just fine (like our existing production RHEL7.6/K8s 1.16) when the cluster is first deployed.

But with RHEL8.2 hosts, if a Consul client agent daemonset pod is deleted, the replacement pod always fails to bind to the hostIP:8500. There are no errors in the client agent logs, it all looks just fine. However, the replacement client agent does not respond to any HTTP queries at hostIP:8500.

Again noting that every thing does work just fine on first bring up. It is only when a daemonset pod is deleted and replaced, do we see the issue with the hostIP:8500 binding.

If we revert back to RHEL7.6 with K8s 1.16, the Consul client agent pods work just fine (as it has for years) and can be deleted and restarted as deployment conditions require.

lkysow · June 29, 2020, 4:30pm

Hmm, I’ve never seen that before. I’m not sure it’s consul-specific. Have you seen any similar issues with other Kubernetes users on GitHub?

merge0303 · July 16, 2020, 3:36pm

This appears to be unrelated to Consul itself. Appears to be an issue with kube-proxy and the host machine kernel’s version of iptables not working harmoniously together. We see a stale iptables entry for the hostPort/podIpAddr when the pod is deleted. And this seems to interfere when a new client agent pod is started using the same hostPort/newPodIpAddr. Probably not many users trying this with K8s 1.18.3 on RHEL8.2 yet.

Topic		Replies	Views
Consul data plane is replacing consul client agent running in the cluster. How to hit consul HTTP API going forward? Consul consul-k8s	0	124	January 23, 2024
Issue with Consul agent as a client Consul	4	5066	April 29, 2020
Connect agent running on vm with cluster running on k8s Consul k8s	5	602	April 3, 2020
Classic networking issues Consul k8s	4	776	December 8, 2020
Consul agent to join a server on k8s aws by using nodeport and the aws provider Consul k8s	3	432	July 21, 2020

Kubernetes 1.18.3

Related topics