Checking the status of M3#

On M3, users can check the status of all nodes via the show_cluster command. The output of this command should be similar to:

$ show_cluster
 NODE            TYPE      PARTITION*         CPU     Mem (MB)   GPU/Phi         STATUS
                                           (Free)       (Free)    (Free)
 ---------------------------------------------------------------------------------------
 m3c001             K80           desktop         0         0         0           Busy
     m3c002             K80           desktop         0         0         0           Busy
     m3c003             K80           desktop         0         0         0           Busy
     m3c004             K80           desktop         0         0         0           Busy
     m3c005             K80           desktop         0         0         0           Busy
     m3c006             K80           desktop         0         0         0           Busy
     m3c007             K80           desktop         0         0         0           Busy
     m3c008             K80           desktop         0         0         0           Busy
     m3c009             K80 OFFLINE REASON:                   Not responding        Offline
     m3c010             K80           desktop         0        64         0           Busy
     m3c011             K80           desktop         0         0         0           Busy
     m3c012             K80           desktop         0         0         0           Busy
     m3c013             K80           desktop         0         0         0           Busy
     m3c014             K80           desktop         0         0         0           Busy
     m3d100             CPU              comp        48       732         0           Idle
     m3d101             CPU              comp        48       732         0           Idle
     m3d112             CPU              comp        16       482         0        Running
     m3d113             CPU              comp        48       732         0           Idle
     m3d114             CPU              comp        48       732         0           Idle
     m3d115             CPU              comp        48       732         0           Idle

The STATUS field explained#

The STATUS field can show:

  • Idle - Node is completely free. No jobs running on the node.

  • Running - Some jobs are running on the node but it still has available resources for new jobs.

  • Busy - Node is completely busy. There are no free resources on the node. No new jobs can start on this node.

  • Offline - Node is offline and unavailable due to a system issue.

  • Reserved - Node has been booked by other users and is ONLY available for them.