Storage on FRCE

There are several options for file storage on the FRCE cluster, each with its own advantages and disadvantages. Shares available to all users include

Available disk shares
DirectoryQuotaAvailabilityData Protection
/home/username256GBcluster-wideVolume snapshots are kept for 30 days
/scratch/cluster_scratch/username5TBcluster-wideno protection
/scratch/local or /tmpno quota, limited to volume sizenot shared between hostsno protection

/home

Each user has a home directory called /home/username which is accessible from every HPC system. The /home directories have a quota of 256GB which cannot be increased. It is commonly used for config files, code, scripts, and analysis results. The share is relatively slow and applications should not process data on the share. Volume-level snapshots (backups) of the directories are made daily and kept for 30 days. Users can restore their own files from the past 14 days from the directory /home/.snapshot/timestamp/user. Restores from later than that will require a Service Now ticket directed to the EIT Storage group.

/scratch/cluster_scratch

This share is mounted on all FRCE nodes and is a high-performance network share. While the intent is for transient storage of raw data there is no defined lifetime for storage. The system does not ever delete data from this share. It is the responsibility of the user to manage file storage.

/scratch/local

This share is also mounted as /tmp and is local to each server. Files stored on one system will not be available on other systems, including the head node. Intended for data needed only during the duration of a single job and the files may be deleted when the systems reboot. Performance is comparable to that of /scratch/cluster_scratch.


Snapshots of most NFS shares are made daily and kept for a varying number of days depending on the requirements of the share owner. These snapshots are user accessible, and any user can retrieve earlier versions of their files. Substitute your account name for username in this example.

# First, list the available snapshots by date
$ ls -lc /home/.snapshot
total 504
drwxr-xr-x 500 root root 257536 Aug 30 23:30 2263164_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Aug 31 23:30 2266190_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  1 23:30 2269261_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  2 23:30 2272677_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  3 23:30 2276645_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  4 23:30 2281308_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  5 23:30 2285730_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  6 23:30 2289821_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  7 23:30 2293117_shared-home_shared-home
drwxr-xr-x 500 root root 257536 Sep  8 23:30 2296740_shared-home_shared-home
drwxr-xr-x 502 root root 258560 Sep  9 23:30 2301115_shared-home_shared-home
drwxr-xr-x 502 root root 258560 Sep 10 23:30 2305897_shared-home_shared-home
drwxr-xr-x 503 root root 259072 Sep 11 23:30 2310713_shared-home_shared-home
drwxr-xr-x 504 root root 259584 Sep 12 23:30 2315649_shared-home_shared-home
# Select the new snapshot taken before the date you need to recover from
$ cd /home/.snapshot/2301115_shared-home_shared-home/username
# Get a list of files/directories that can be recovered
$ ls -l
< ... file list ...>
# Copy the needed file back to your directory
$ cp -a .bashrc /home/username/.bashrc-recovered
$ cd
# Compare the dates on the new and old files
$ ls -l .bashrc*
-rw-r--r-- 1 username username 1496 Sep 13 09:36 .bashrc
-rw-r--r-- 1 username username 1446 Jul 25 09:19 .bashrc-recovered