HPC Cluster Hardware Resources: Difference between revisions

From HPC Docs
Jump to navigation Jump to search
No edit summary
 
(55 intermediate revisions by the same user not shown)
Line 1: Line 1:
This page currently contains no content. (code411)
== Networking/Interconnects ==
All nodes in the chart listed below on this page contain multiple network connections.  All nodes except gpu-node006 through gpu-node008 contain 25Gbps Ethernet network interfaces. The gpu-node006 through gpu-node008 nodes contain 10Gbps Ethernet network interfaces.
 
== Network Storage ==
The ELSA cluster utilized network based storage that is shared among all nodes for storing personal files, project/research files, course files and the HPC applications. There are two pairs of identical storage servers. One pari is located in the STEM cluster room and another in the Green Hall datacenter.  Data from the STEM storage servers are regularly (i.e. nightly) replicated to the ones in Green Hall. The is a total of approximately 6.3PB of raw storage available. The storage servers are Linux-based utilizing the [https://zfsonlinux.org/ ZFS on Linux filesystem] and [[wikipedia:Network_File_System|NFS]] for sharing with the cluster nodes.
 
== Data Transfer Node ==
A [https://fasterdata.es.net/science-dmz/DTN/ data transfer node (DTN)] is used for transferring large files in and out of a cluster. It is designed to handle high-speed, high-volume transfers. The ELSA DTN contains 122.9TB (raw) of  SSD storage for temporarily holding large file transfers. It also has a 100Gbps Ethernet interface to maximize throughput.
 
== PerfSONAR ==
[https://www.perfsonar.net/about/what-is-perfsonar/ PerfSONAR] is a network performance testing and monitoring system. It regularly runs tests bandwidth and latency tests and if issues arise, it helps pinpoint the location in the network path causing the issue.
 
== Node Configurations ==
The following describes the contents of columns in the tables below.
* '''Node Name''' = name of the node server. Login nodes (login001 & login002) are accessible via the '''elsa.hpc.tcnj.edu''' load-balancer (e.g. using SSH) from the campus network (wired or wireless) or via the [https://tcnj.teamdynamix.com/TDClient/KB/ArticleDet?ID=17531 TCNJ VPN]. Other nodes are not meant to be directly accessed.
* '''Processor Family''' = the generation of processor in the node. Intel: Skylake Gold > Skylake Silver > Broadwell; AMD: Epyc Genoa > Epyc Rome
* '''Available Cores''' = these are the processing cores that compute jobs can use
* '''Reserved Cores''' = these cores are reserved for system use and not available to user jobs
* '''RAM Memory''' = how much memory the node contains
* '''GPU Count''' = number of GPU accelerators in the node
* '''NVIDIA GPU Type''' = the model of the GPU accelerators in the node. NVIDIA: L40S > RTX2080 >= GTX1080Ti > GTX1080
* '''Queue Membership''' = which queues (SLURM partitions) this node is a member of. Nodes can be a member of multiple queues. Note some queues are used for internal purposes (e.g. remoteviz, interactive) and should not be used for submitting your jobs except under certain conditions. Please see the [[HPC_Cluster_Job_Scheduler#ELSA_Job_Partitions.2FQueues|SLURM Partitions]] page for more information on the specification of these queues/partition.


{| class="wikitable"
{| class="wikitable"
|-
|-
! Node<br>Name !! Processor<br>Family !! Available<br>Cores !! Reserved<br>Cores !! RAM<br>Memory !! GPU<br>Count !! GPU<br>Type !! Queue<br>Membership
! Node<br>Name !! Processor<br>Family !! Available<br>Cores !! Reserved<br>Cores !! RAM<br>Memory !! GPU<br>Count !! NVIDIA<br>GPU Type !! Queue<br>Membership(s) !! Notes
|-
| login001 || virtual || 8 || 0 || 8G || 0 || n/a || n/a || Public hostname<br>'''elsa.hpc.tcnj.edu'''
|-
| login002 || virtual || 8 || 0 || 8G || 0 || n/a || n/a || Public hostname<br>'''elsa.hpc.tcnj.edu'''
|-
| osg-login || virtual || 4 || 0 || 4G || 0 || n/a || n/a || Dedicated for '''Open Science Grid''' job submissions
|-
| node001 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
|-
| node002 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
|-
| node003 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
|-
| node004 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
|-
| node005 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
|-
| node006 || AMD Epyc Genoa || 190 || 2 || 1,536G || 0 || n/a || amdtest <!-- short, normal, long, interactive, nolimit --> || &nbsp;
<!--
|-
| node007 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, interactive || &nbsp;
|-
| node008 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, interactive || &nbsp;
|-
| node009 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, interactive || &nbsp;
|-
| node010 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, interactive || &nbsp;
|-
| node011 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node012 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node013 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node014 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node015 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node016 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node017 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive || &nbsp;
|-
| node018 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive, nolimit || &nbsp;
|-
| node019 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive, nolimit || &nbsp;
|-
| node020 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, interactive, nolimit || &nbsp;
|-
| node021 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node022 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node023 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node024 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node025 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node026 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node027 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node028 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node029 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node030 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node031 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node032 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node033 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node034 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node035 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node036 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node037 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node038 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node039 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node040 || Intel Nehalem || 7 || 1 || 48G || 0 || short, normal, long, interactive, nolimit || &nbsp;
|-
| node041 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node042 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node043 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node044 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
| node045 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node001 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node046 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node002 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node047 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node003 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node048 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node004 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node049 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node005 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node050 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node006 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node051 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node007 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node052 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node008 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node053 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node009 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node054 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node010 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node055 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node011 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node056 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node012 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node057 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node013 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node058 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node014 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node059 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node015 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node060 || Intel Nehalem || 7 || 1 || 48G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
-->
|-
|-
| node016 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node061 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node017 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node062 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node018 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node063 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node019 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node064 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node020 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node065 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node021 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node066 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node022 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node067 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node023 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node068 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node024 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node069 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node025 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node070 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node026 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node071 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node027 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node072 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node028 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node073 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node029 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node074 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node030 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node075 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node031 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node076 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node032 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node077 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node033 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node078 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node034 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node079 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node035 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node080 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node036 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node081 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node037 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node082 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node038 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node083 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node039 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node084 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || short, normal, long, interactive, nolimit || &nbsp;
|-
|-
| node040 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node085 || Intel Skylake Gold || 30 || 2 || 192G || 0 || n/a || interactive || &nbsp;
|-
|-
| node041 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node086 || AMD Rome || 62 || 2 || 512G || 0 || n/a || amd || &nbsp;
|-
|-
| node042 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| node087 || AMD Rome || 62 || 2 || 512G || 0 || n/a || grahamlab || Restricted use
|-
|-
| node043 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node001 || Intel Broadwell || 19 || 1 || 256G || 4 || GTX 1080 || gpu || GPU has 8G VRAM
|-
|-
| node044 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node002 || Intel Broadwell || 19 || 1 || 256G || 4 || GTX 1080 || gpu || GPU has 8G VRAM
|-
|-
| node045 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node003 || Intel Broadwell || 19 || 1 || 256G || 4 || GTX 1080 || gpu || GPU has 8G VRAM
|-
|-
| node046 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node004 || Intel Broadwell || 19 || 1 || 256G || 4 || GTX 1080 || gpu || GPU has 8G VRAM
|-
|-
| node047 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node005 || Intel Broadwell || 19 || 1 || 256G || 4 || GTX 1080 || shortgpu || GPU has 8G VRAM
|-
|-
| node048 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node006 || Intel Skylake Silver || 19 || 1 || 192G || 8 || GTX 1080Ti || gpu || GPU has 11G VRAM
|-
|-
| node049 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node007 || Intel Skylake Silver || 19 || 1 || 192G || 8 || GTX 1080Ti || gpu || GPU has 11G VRAM
|-
|-
| node050 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node008 || Intel Skylake Silver || 19 || 1 || 192G || 8 || GTX 1080Ti || gpu || GPU has 11G VRAM
|-
|-
| node051 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node009 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node052 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node010 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node053 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node011 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node054 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node012 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node055 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node013 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node056 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node014 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node057 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node015 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node058 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node016 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
|-
| node059 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
| gpu-node017 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
| gpu-node018 || Intel Skylake Gold|| 31 || 1 || 384G || 4 || RTX 2080 || gpu || GPU has 8G VRAM
|-
| gpu-node019 || AMD Epyc Genoa || 30 || 2 || 512G || 4 || L40S || gputest || GPU has 48G VRAM
|-
| gpu-node020 || AMD Epyc Genoa || 30 || 2 || 512G || 4 || L40S || gputest || GPU has 48G VRAM
|-
| gpu-node021 || AMD Epyc Genoa || 30 || 2 || 512G || 4 || L40S || gputest || GPU has 48G VRAM
|-
| viz-node001 || Intel Skylake Gold|| 38 || 2 || 768G || 1 || Tesla V100 || remoteviz || '''Tesla V100 (high-end)''' remote visualization option
|-
| viz-node002 || Intel Skylake Gold|| 38 || 2 || 768G || 1 || Tesla V100 || remoteviz || '''Tesla V100 (high-end)''' remote visualization option
|-
| dev-node001 || Intel Broadwell || 20 || 0 || 256G || 1 || GTX 1080 || dev || '''GTX1080 (low-end)''' remote visualization option
|-
|-
| node060 || Nehalem || 7 || 1 || 48G || 0 || n/a || Example
|}
|}

Latest revision as of 16:25, 6 September 2024

Networking/Interconnects

All nodes in the chart listed below on this page contain multiple network connections. All nodes except gpu-node006 through gpu-node008 contain 25Gbps Ethernet network interfaces. The gpu-node006 through gpu-node008 nodes contain 10Gbps Ethernet network interfaces.

Network Storage

The ELSA cluster utilized network based storage that is shared among all nodes for storing personal files, project/research files, course files and the HPC applications. There are two pairs of identical storage servers. One pari is located in the STEM cluster room and another in the Green Hall datacenter. Data from the STEM storage servers are regularly (i.e. nightly) replicated to the ones in Green Hall. The is a total of approximately 6.3PB of raw storage available. The storage servers are Linux-based utilizing the ZFS on Linux filesystem and NFS for sharing with the cluster nodes.

Data Transfer Node

A data transfer node (DTN) is used for transferring large files in and out of a cluster. It is designed to handle high-speed, high-volume transfers. The ELSA DTN contains 122.9TB (raw) of SSD storage for temporarily holding large file transfers. It also has a 100Gbps Ethernet interface to maximize throughput.

PerfSONAR

PerfSONAR is a network performance testing and monitoring system. It regularly runs tests bandwidth and latency tests and if issues arise, it helps pinpoint the location in the network path causing the issue.

Node Configurations

The following describes the contents of columns in the tables below.

  • Node Name = name of the node server. Login nodes (login001 & login002) are accessible via the elsa.hpc.tcnj.edu load-balancer (e.g. using SSH) from the campus network (wired or wireless) or via the TCNJ VPN. Other nodes are not meant to be directly accessed.
  • Processor Family = the generation of processor in the node. Intel: Skylake Gold > Skylake Silver > Broadwell; AMD: Epyc Genoa > Epyc Rome
  • Available Cores = these are the processing cores that compute jobs can use
  • Reserved Cores = these cores are reserved for system use and not available to user jobs
  • RAM Memory = how much memory the node contains
  • GPU Count = number of GPU accelerators in the node
  • NVIDIA GPU Type = the model of the GPU accelerators in the node. NVIDIA: L40S > RTX2080 >= GTX1080Ti > GTX1080
  • Queue Membership = which queues (SLURM partitions) this node is a member of. Nodes can be a member of multiple queues. Note some queues are used for internal purposes (e.g. remoteviz, interactive) and should not be used for submitting your jobs except under certain conditions. Please see the SLURM Partitions page for more information on the specification of these queues/partition.
Node
Name
Processor
Family
Available
Cores
Reserved
Cores
RAM
Memory
GPU
Count
NVIDIA
GPU Type
Queue
Membership(s)
Notes
login001 virtual 8 0 8G 0 n/a n/a Public hostname
elsa.hpc.tcnj.edu
login002 virtual 8 0 8G 0 n/a n/a Public hostname
elsa.hpc.tcnj.edu
osg-login virtual 4 0 4G 0 n/a n/a Dedicated for Open Science Grid job submissions
node001 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node002 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node003 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node004 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node005 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node006 AMD Epyc Genoa 190 2 1,536G 0 n/a amdtest  
node061 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node062 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node063 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node064 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node065 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node066 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node067 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node068 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node069 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node070 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node071 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node072 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node073 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node074 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node075 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node076 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node077 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node078 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node079 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node080 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node081 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node082 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node083 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node084 Intel Skylake Gold 30 2 192G 0 n/a short, normal, long, interactive, nolimit  
node085 Intel Skylake Gold 30 2 192G 0 n/a interactive  
node086 AMD Rome 62 2 512G 0 n/a amd  
node087 AMD Rome 62 2 512G 0 n/a grahamlab Restricted use
gpu-node001 Intel Broadwell 19 1 256G 4 GTX 1080 gpu GPU has 8G VRAM
gpu-node002 Intel Broadwell 19 1 256G 4 GTX 1080 gpu GPU has 8G VRAM
gpu-node003 Intel Broadwell 19 1 256G 4 GTX 1080 gpu GPU has 8G VRAM
gpu-node004 Intel Broadwell 19 1 256G 4 GTX 1080 gpu GPU has 8G VRAM
gpu-node005 Intel Broadwell 19 1 256G 4 GTX 1080 shortgpu GPU has 8G VRAM
gpu-node006 Intel Skylake Silver 19 1 192G 8 GTX 1080Ti gpu GPU has 11G VRAM
gpu-node007 Intel Skylake Silver 19 1 192G 8 GTX 1080Ti gpu GPU has 11G VRAM
gpu-node008 Intel Skylake Silver 19 1 192G 8 GTX 1080Ti gpu GPU has 11G VRAM
gpu-node009 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node010 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node011 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node012 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node013 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node014 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node015 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node016 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node017 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node018 Intel Skylake Gold 31 1 384G 4 RTX 2080 gpu GPU has 8G VRAM
gpu-node019 AMD Epyc Genoa 30 2 512G 4 L40S gputest GPU has 48G VRAM
gpu-node020 AMD Epyc Genoa 30 2 512G 4 L40S gputest GPU has 48G VRAM
gpu-node021 AMD Epyc Genoa 30 2 512G 4 L40S gputest GPU has 48G VRAM
viz-node001 Intel Skylake Gold 38 2 768G 1 Tesla V100 remoteviz Tesla V100 (high-end) remote visualization option
viz-node002 Intel Skylake Gold 38 2 768G 1 Tesla V100 remoteviz Tesla V100 (high-end) remote visualization option
dev-node001 Intel Broadwell 20 0 256G 1 GTX 1080 dev GTX1080 (low-end) remote visualization option