Attention

This page is manually updated, so it may be slightly out of date from time to time.

Please email help@massive.org.au if you have any issues.

MASSIVE System Status and Known Issues

System Status

If you are having difficulty accessing MASSIVE, you should first check the status of the system. The status of all Monash eResearch systems can be checked at a glance on statuspage.io The status of M3/MASSIVE is updated based on automatic monitoring of

  • Are the login nodes accessible by SSH

  • Are the data transfer nodes accessible by SSH

  • What is the read and write bandwidth to the Lustre file systems

  • What is the read and write bandwidth to the NFS file systems

Archived System Status and Known Issues

20th May 2020 AEST

M3 is experiencing an ongoing service interruption relating to the /home file system. This disruption began at 16:30pm AEST.

16th Apr 2020 AEST

M3 is experiencing an ongoing service interruption relating to the Lustre file system. This disruption began at 11:54am AEST.

16th Dec 2019 5:15pm AEDT

M3 is experiencing an ongoing service interruption relating to the Lustre file system. This disruption began at 4:43pm AEDT.

The system status is as follows:

  • The slurm job scheduler is operational and accepting jobs

  • The home directory and software stack services are operational

  • The Lustre file systems /projects and /scratch are operational, however there may be slow performance from time to time until we resolve the issue

12th Dec 2019 10:10am AEDT

M3 is operational following yesterday’s scheduled maintenance. There are no major incidents impacting the service.

6th Dec 2019 3:00pm AEDT

M3 is operational, and there are no major incidents impacting the service. Specifically:

  • The slurm job scheduler is operational and accepting jobs

  • The home directory and software stack services are operational

  • The Lustre file systems /projects and /scratch are operational, however there may be slow performance from time to time until we resolve the issue

Please email help@massive.org.au if you notice any system issues.

29th Nov 2019 3:30 pm AEDT

M3 is experiencing a service interruption. We are actively working to resolve the issue.

27th Nov 2019

M3 is experiencing an ongoing service interruption relating to the Lustre file system. Read and write operations to /projects may be slower than normal. We have submitted a detailed event log to our vendor and are in the process of applying recommended fixes to the Lustre hosts.

The job scheduler is still accepting jobs and users can still request desktop sessions.

3rd Oct 2019

We are still troubleshooting the M3 filesystem issue, the job scheduler is still accepting jobs and users can still request desktop sessions.