Checking job status#
There are two methods to check your job status.
Method 1: show_job
#
We provide a show_job
script. This script groups information, filters,
sorts, and provides statistics to provide a clean, tidy, and user-friendly output.
[lexg@m3-login3 ~]$ show_job
Job Summary
+-----------------------+--------------------------------+
| Summary | Value |
+-----------------------+--------------------------------+
| Cluster | M3 |
| User Name | Lex Gallon (Monash University) |
| Num of Submitted Jobs | 0 |
| Num of Running Job | 0 |
| Num of CPU Cores | 0 |
+-----------------------+--------------------------------+
********************
Job Details on M3
********************
+----+------+---------+---------+-----------+-----+-------+--------------+----------+-------------+---------+
| ID | USER | ACCOUNT | JOBNAME | PARTITION | QOS | STATE | RUNNING TIME | WALLTIME | NO OF NODES | DETAILS |
+----+------+---------+---------+-----------+-----+-------+--------------+----------+-------------+---------+
+----+------+---------+---------+-----------+-----+-------+--------------+----------+-------------+---------+
*********************
Job History on M3 sorted by attribute <start> start from 2024-09-17T17:02:05
*********************
+----------+---------------------+----------+---------+----------+----------+-----------------------+-----------+
| Job ID | Job Name | Username | Account | Priority | Hostname | Requested Memory (MB) | Status |
+----------+---------------------+----------+---------+----------+----------+-----------------------+-----------+
| 39883309 | interactive_session | lexg | nq46 | 106198 | m3i023 | 4096 | COMPLETED |
| 39883037 | Desktop | lexg | nq46 | 106198 | m3j002 | 4096 | CANCELLED |
| 39882958 | interactive_session | lexg | nq46 | 106198 | m3i029 | 4096 | COMPLETED |
| 39882955 | interactive_session | lexg | nq46 | 106198 | m3i029 | 4096 | CANCELLED |
| 39882906 | interactive_session | lexg | nq46 | 106198 | m3i022 | 4096 | COMPLETED |
| 39882703 | smux-job | lexg | nq46 | 106198 | m3i020 | 4096 | COMPLETED |
| 39878936 | bash | lexg | nq46 | 132851 | m3n103 | 4096 | FAILED |
| 39878241 | smux-job | lexg | nq46 | 106198 | m3i018 | 4096 | TIMEOUT |
| 39878231 | interactive_session | lexg | nq46 | 106194 | m3n103 | 65536 | CANCELLED |
| 39877919 | interactive_session | lexg | nq46 | 106194 | m3n103 | 65536 | CANCELLED |
| 39877874 | tmux | lexg | nq46 | 132851 | m3n103 | 4096 | COMPLETED |
| 39877731 | bash | lexg | nq46 | 132851 | m3n102 | 4096 | TIMEOUT |
| 39777081 | bash | lexg | nq46 | 132846 | m3a120 | 4096 | FAILED |
+----------+---------------------+----------+---------+----------+----------+-----------------------+-----------+
You may run show_job --help for more information about the command
Hint
To check the status of a single job use show_job [JOBID]
.
Method 2: Slurm commands#
To display all of your running/pending jobs use squeue -u `whoami`
.
Hint
whoami
returns your M3 username, and is a handy shortcut.
$ squeue -u `whoami`
JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON)
If you want to view the status of a single job
$ scontrol show job [JOBID]