Storage on SHARK
The SHARK cluster has multiple data storage types. This section details the different storage types in use.
A number of best practices for reading and writing data (I/O), and basic housekeeping of your files can be found here: Best Practices | Best Practices Shared File System
HPC storage does not have any backups!!
Storage solutions
HPC Isilon storage.
This is fast storage for direct access to your data on the cluster, which can be purchased from the IT&DI department through Topdesk. Once purchased this storage will be NFS v4 mounted on all the nodes on the cluster. The default mountpoint will be /exports/.
Access to this mountpoint is handled by an Active Directory group. The default mount access rights are set by an Ansible playbook. To grant users access to this share you need to have them added to the Active Directory group attached to the share. To find out which group is attached to your data storage use the following commandls -aldh /exports/<storage-share-name> | awk '{print $4}'
No backups or snapshots are available for the HPC Isilon storage.research LTS Isilon storage
This is slow storage for archiving data which can be purchased from the IT&DI department through Topdesk. Once purchased this storage will be NFS v4 mounted on all the execution/gpu/mem nodes on the cluster with read only access, on the login nodes you will have read and write access. The default mountpoint will be /exports/archive/. Access to this mountpoint is handled by an Active Directory group. The default mount access rights are set by an Ansible playbook. To grant users access to this share you need to have them added to the Active Directory group attached to the share. To find out which group is attached to your data storage use the following commandls -aldh /exports/archive/<storage-share-name> | awk '{print $4}'
How to stage your data:
From within your submit script
Create a directory on the local /tmp
mkdir -p /tmp/$USER/$SLURM_JOB_ID
Change permissions so only the owner can view the files
chmod 700 /tmp/$USER
Copy your data
cp <path>/to/Data /tmp/$USER/$SLURM_JOB_ID/
Execute your binary and save output to /tmp/$USER/$SLURM_JOB_ID/
At the end of your sbatch script move/copy your output to your HPC/LTS storage.
Now remove the complete directory on the staged directory `rm -Rf /tmp/$USER/$SLURM_JOB_ID/'
Importand information for /tmp directory
Data is scratch and will be treated as such
If needed data can be removed by the admins without notice !!
There is no backup for /tmp
Data and empty folders will be removed automatically
There is no quota do not abuse this FileSystem, data can and will be removed if necessary
Set the security to read,write,execute for the owner only
chmod 700
Special directories on the cluster
/bam-export/ This directory is created for sharing your Binary Alignment/Map files. This is a temporarel share, first in first out principle. Data here should not contain any patient related data and can be deleted at any time (make sure you have a copy somewhere else). This directory can be used in the UCSC Genome Browser to view your data tracks. The files here are accessible through a webbrowser with the following URL https://barmsijs.lumc.nl/. The files can only be accessed if you know the exact file name and the files are on the /bam-export/ directory word readable.
/home The /home directory is an Isilon HPC export mounted on the /home directory for all the nodes in the cluster. This export is limited to 10 Gb per person. Your home directory is automatically created the first time you log into the cluster and is the same as your username. By default your /home/ directory is world readable and world executable. This directory should not be used for data storage.
mount point | Storage | Size | Usage | owner | group | security rights | mount option login nodes | mount option compute nodes |
---|---|---|---|---|---|---|---|---|
/home | HPC Isilon | 10GB | For small personel storage | AD username | AD Group | rwxr-xr-x | read/write | read/write |
/exports | HPC Isilon | - | Mount point for depertment/project storage | AD username | AD Group | rwxrws--- | read/write | read/write |
/exports/archive | LTS Isilon | - | Mount point for department/project Long Term Storage | AD username | AD Group | rwxrws--- | read/write | read/only |
/bam-export | HPC Isilon | 2TB | For displaying BAM files on https://barmsijs.lumc.nl | AD username | AD Group | rwxrwxrwx | read/write | read/write |
How to get access to a HPC or LTS share.
The shares are mounted on a folder on Shark and an Active Directory group grants access to the folder where the share is mounted.
To get access to the storage share only your Department ACPr (Automation Contact Person) can grant access to the share through a self service desk form Aanvraag: Mutaties netwerkrechten of folders (add user to a security group)