Welcome to the NIU HPC facilities, which are powerful tools for study and research. To start, please pay attention to the vital information on this page.
The new NIU compute cluster named Metis (also the Greek goddess of good counsel, planning, cunning and wisdom) was commissioned in September 2023. Therefore, the new accounts are expected to use Metis. All users of our older system, Gaea, are welcome to copy their projects to the Metis but can continue to use Gaea to complete the current tasks.
Both systems are very similar in architecture; current users can use Gaea accounts and passwords to access Metis, and the Gaea documentation pages are compatible with the Metis system. However, there are several essential differences listed below:
@gaea ~] ssh -Y username@metis.niu.edu
@metis ~]$
rsync -av username@gaea.niu.edu:/home/username/ /home/username/
@metis ~]$
rsync -av username@gaea.niu.edu:/data1/projectName/projectFolder /lstr/sahara/projectName/
These nodes work as gateways between a user and the rest of the systems. These are the only cluster computers accessible from public networks. They are shared between all users and provide tools to run software installed on the clusters and to develop custom applications. While powerful, the login nodes have limited resources and should be used responsibly:
Any production jobs should be submitted via the batch system. We impose a combined limit on the use of the login node resource usage, and any process exceeding 30% of available CPUs or memory will be killed after 30 min of the run time. If more extended interactive tests are necessary, please request an interactive batch system job
The PBS batch system manages all compute (a.k.a "worker") nodes. The batch system allows reserving a part of cluster nodes to run a particular application (a job). The more resources the job requires, the larger the cost of a user mistake:
The default unrestricted batch queues on Metis and Gaea allow correspondingly 6 and 12 jobs to run simultaneously. Still, there are no limits on the number of submitted jobs, which will be scheduled to start when possible. Additionally, jobs can be routed (via the amount of requested walltime, number of nodes and/or processors, etc.) to special queues allowing more jobs to run. The total number of jobs allowed to run simultaneously is 512 on Metis and 60 on Gaea. To list available queues, use "qstat -q"
command from the login nodes.
The accessible disk space is configured to be used as described below -
"quota -s"
command from the login node.We only keep a previous day snapshot of /home folders at the Metis cluster, available from metis.niu.edu via /nfs/ihfs/home_yesterday
. We strongly encourage users to use personal GitHub repositories for code development and frequently backup essential data and results to remote locations.