Error: Consul cluster not able to elect a leader

shweshi · April 3, 2022, 5:52pm

I have deployed a consul cluster with 3 nodes. i have setup --bootstrap-expect to 3. After deployment leader is elected and things works fine. However if i do a new deployment with some changes, using Terraform, aws ecs task. Following things are happening.

3 new tasks are getting created. These 3 new nodes will join the cluster.
The old 3 nodes will take some time to get stopped. Till then i can see there are 6 ips in the peers API.
Once the previous 3 nodes are stopped including the leader the election will start but consul cluster is not able to elect a leader.

I can see the following logs might be useful.
lost leadership because received a requestVote with a newer term

Also i can see the term value is diff in the 3 nodes… how it went out of sync?

The election is starting as soon as the old leader is stopped. but for some reason the leader is not able to elect.

How to fix this? is there something wrong in deployment? what is the best practice to remove the old cluster and bring back the new cluster with leader?

Thanks.

Wolfsrudel · April 3, 2022, 7:24pm

To get a feel for the problem: Can you trigger a force-leave on the old nodes before they are stopped?

Ultimately, I see a cluster with an even number of server nodes, since the old nodes are still valid voting members until the AutoPilot cleans up. If you get the cluster to completely forget about the old nodes and the node count is odd again, it might work.

No guarantee for anything.

shweshi · April 5, 2022, 4:42pm

thanks @Wolfsrudel on new node deployment gracefully exiting the old node servers seems to be working.

either consul leave or setting leave_on_terminate = true configuration works.

Topic		Replies	Views
Consul server not a voter Consul	3	318	February 11, 2024
Cluster leadership instability Consul	0	531	January 14, 2022
Failed leadership election with three node cluster in GKE (Consul v1.5.2) Consul	4	403	February 20, 2023
Consul failing to commit leader election results Consul	9	1834	November 22, 2022
Unable to bring up single node leader when other members in cluster are down Consul	1	792	October 20, 2021

Error: Consul cluster not able to elect a leader

Related topics