How to ReAllocate Nomad job after client recovery

maxim-design · December 29, 2022, 12:45pm

When a client becomes ineligible for any reason (like a crash),Nomad moves allocations of running jobs on that client to another healthy client. (based on various constrains, spread etc.)
but when the client comes back to life and rejoins the cluster or alternatively, a new client is connected to the cluster, Nomad doesn’t reallocate those moved jobs for better and more even spread (not until it faces resource problems ).

i was wandering how can i ensure that when a failed client recovers (or a new client connects to cluster) nomad will return all the previously moved allocations back to it ?

jrasell · January 3, 2023, 12:45pm

Hi @maxim-design,

When a client {re-}joins the cluster, a number of evaluations a generated to asses required allocation placements including blocked workload and jobs of type system. Nomad unfortunately does not track previously migrated workload or perform periodic rebalancing which would result in the behaviour you desire.

The Nomad backlog does have #10039 which details a rebalancing feature, however, I am not aware of this being roadmapped presently. Adding a +1 will always be useful for internal prioritisation along with any comments which have not been previously mentioned.

Thanks,
jrasell and the Nomad team

maxim-design · January 4, 2023, 1:32pm

Thanks for the information. will definitely add my +1 to the mentioned thread, and will be awaiting eagerly.

Topic		Replies	Views
Recovery from failed client Nomad	3	829	January 13, 2022
Auto migrate when nomad client get low resources Nomad	1	619	November 5, 2020
Allocation lost after client restart Nomad nomad	2	186	January 11, 2024
Does reloading a Nomad client or server creates Job downtime? Nomad	1	259	July 20, 2023
Understanding job restart behaviour on lost jobs Nomad	2	1153	May 12, 2022

How to ReAllocate Nomad job after client recovery

Related topics