Checking the status of M3#
On M3, users can check the status of all nodes via the show_cluster
command.
The output of this command should be similar to:
$ show_cluster
NODE TYPE PARTITION* CPU Mem (MB) GPU/Phi STATUS
(Free) (Free) (Free)
---------------------------------------------------------------------------------------
m3c001 K80 desktop 0 0 0 Busy
m3c002 K80 desktop 0 0 0 Busy
m3c003 K80 desktop 0 0 0 Busy
m3c004 K80 desktop 0 0 0 Busy
m3c005 K80 desktop 0 0 0 Busy
m3c006 K80 desktop 0 0 0 Busy
m3c007 K80 desktop 0 0 0 Busy
m3c008 K80 desktop 0 0 0 Busy
m3c009 K80 OFFLINE REASON: Not responding Offline
m3c010 K80 desktop 0 64 0 Busy
m3c011 K80 desktop 0 0 0 Busy
m3c012 K80 desktop 0 0 0 Busy
m3c013 K80 desktop 0 0 0 Busy
m3c014 K80 desktop 0 0 0 Busy
m3d100 CPU comp 48 732 0 Idle
m3d101 CPU comp 48 732 0 Idle
m3d112 CPU comp 16 482 0 Running
m3d113 CPU comp 48 732 0 Idle
m3d114 CPU comp 48 732 0 Idle
m3d115 CPU comp 48 732 0 Idle
The STATUS field explained#
The STATUS
field can show:
Idle - Node is completely free. No jobs running on the node.
Running - Some jobs are running on the node but it still has available resources for new jobs.
Busy - Node is completely busy. There are no free resources on the node. No new jobs can start on this node.
Offline - Node is offline and unavailable due to a system issue.
Reserved - Node has been booked by other users and is ONLY available for them.