Poststop task receiving main task result/status

zalken · July 14, 2021, 2:18am

I’m running batch jobs. Is it possible, in a task configured with a poststop lifecycle hook, to receive/forward status from the main task execution’s result?

Use case:
When the main task fails, send a message to notify that this specific task is in error.

Docs currently mention that poststop task can be used to “recover from failure”:

They are useful for performing post-processing that isn’t available in the main tasks or for recovering from failures in the main tasks.

But I’m not sure to understand how failures can be distinguished from success, in this case.

Any help much appreciated!

jrasell · July 14, 2021, 9:09am

Hi @zalken,

I am not aware of any Nomad native manner in which the exit status or such is passed between tasks. The approach I would potentially look at is either having the poststop task check the Nomad API for the status, or have the main task write a breadcrumb file to the shared alloc dir that the poststop task can then read to determine what actions to perform.

Thanks,
jrasell and the Nomad team

zalken · July 14, 2021, 11:29am

Thanks for this answer. That makes sense, and corresponds to the scenarios I imagined.

Considering that, for example, a failed artifact download doesn’t even hit the driver/configured command, I guess there’s no real way in the main task to write something to shared alloc dir, no matter when the failure happen - ie when initialising the task, or during its execution? It would save a roundtrip compared to checking Nomad API.

lgfa29 · July 21, 2021, 12:12am

Hi @zalken

I’m not sure if it helps, but maybe the test job file we use in our E2E test suite could give you some ideas on how to do what you are looking for?

github.com

hashicorp/nomad/blob/main/e2e/lifecycle/inputs/batch.nomad

# lifecycle hook test job for batch jobs. touches, removes, and tests
# for the existence of files to assert the order of running tasks.
# all tasks should exit 0 and the alloc dir should contain the following
# files: ./init-ran, ./main-ran, ./poststart-run

job "batch-lifecycle" {

  datacenters = ["dc1"]

  type = "batch"

  constraint {
    attribute = "${attr.kernel.name}"
    value     = "linux"
  }

  group "test" {

    task "init" {

This file has been truncated. show original

jeteve · February 8, 2024, 10:29am

Given I have the nomad client in my running image, how would I query the reflective nomad API?

Topic		Replies	Views
How to get exit status of jobs from inside poststop task? Nomad	0	178	August 17, 2023
How to only run a poststop lifecycle task if main task hasn't failed Nomad	0	333	December 24, 2021
Can one Nomad task in task group take down other task if fails? Nomad	0	238	May 11, 2023
Use consul connect with nomad poststop hooks Nomad	0	230	August 24, 2022
Nomad alloc status always failed Nomad	4	428	April 1, 2021

Poststop task receiving main task result/status

Related topics