Monthly maintenance Sept 12, 2022 7am-11am
Regular monthly maintenance will take place Monday Sept. 12th, 2022 7-11am
NOTICES
- New user training sessions are available every month. New training sessions on specific tools and concepts will be added soon.
You can find a list and links to sign up here: https://www.rc.fas.harvard.edu/upcoming-training/ - New partition tool 'spart' now available. This new command will show users the partitions they have access to and information about those partitions
- The New England Research Cloud (NERC) is now available, including to Harvard researchers. This self-service resource provides on-prem (MGHPCC) on-demand cloud services. For more info, see their site at https://nerc.mghpcc.org or links to the service here: https://docs.rc.fas.harvard.edu/kb/nerc/
- If you have a 2021/2022 publication that made use of the FASRC cluster and you don't find it listed here, please do let us know: https://www.rc.fas.harvard.edu/cluster/publications/
GENERAL MAINTENANCE
- Slurm upgrade to 22.05.3
- Audience: All cluster job users
- Impact: Jobs and the scheduler will be paused during upgrade and resume once maintenance completes
- tempfs change in Slurm (Cannon - This already exists in FASSE)
- Audience: All cluster job users who use local (on node) /tmp scratch
- Impact: After this change, any job using /tmp on a node will now have its own virtual slice of /tmp (does not currently affect /scratch).
This will affect jobs that try to share data across nodes by using /tmp. More details: https://slurm.schedmd.com/job_container.conf.html
- Data Center network upgrade
- 100Gb cutover POSTPONED
- Upgrade of Boston switch firmware may cause brief interruptions during maintenance period
- Computer exchange for certain partitions
- Audience: AMD nodes associated with shakhnovich, hernquist, doshi-velez, ni_lab
- Impact: These AMD nodes will be retired and exchanged with Intel nodes
- Decommission of partitions
- Audience: remotedesktop, august, hernquist-dev
- Impact: These partitions are deprecated and will no longer be available
- Login node and VDI node reboots
- Audience: Anyone logged into a a login node or VDI/OOD node
- Impact: Login and VDI/OOD nodes will be unavailable while updating and rebooting
- Scratch cleanup ( https://docs.rc.fas.harvard.edu/kb/policy-scratch/ )
- Audience: Cluster users
- Impact: Files older than 90 days will be removed.
- Reminder: Scratch 90-day file retention purging runs occur regularly not just during maintenance periods.