SLURM sacct shows 'batch' and 'extern&

2019-05-29 03:31发布

问题:

I have submitted a job to a SLURM queue, the job has run and completed. I then check the completed jobs using the sacct command. But looking at the results of the sacct command I notice additional results that I did not expect:

       JobID                        JobName      State      NCPUS  Timelimit
5297048                                test  COMPLETED          1   00:10:00  
5297048.bat+                          batch  COMPLETED          1           
5297048.ext+                         extern  COMPLETED          1       

Can anyone explain what the 'batch' and 'extern' jobs are and what their purpose is. Why does the extern job always complete even when the primary job fails.

I have attempted to search the documentation but have not found a satisfactory and complete answer.

EDIT: Here's the script I am submitting to produce the above sacct output:

#!/bin/bash
echo test_script > done.txt

With the following sbatch command:

sbatch -A BRIDGE-CORE-SL2-CPU --nodes=1 --ntasks=1 -p skylake --cpus-per-task 1 -J jobname -t 00:10:00 --output=./output.out --error=./error.err < test.sh

回答1:

A job is comprised of several job steps. Each job step is shown independently. In your output you have $JOBID, which stands for the overall reservation, $JOBID.batch, which represents the main script you submitted.

Regarding external... I'm not sure, but I guess that you started one job step and you named it as "external". In that case, that is the information of that job step.

If you show us the script you submitted we can clarify some soubts.



标签: slurm