_index.md 6.97 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
+++
title = "Handling Data"
description = "How to work with and transfer data to/from HCC resources."
weight = "30"
+++

<span id="title-text"> HCC-DOCS : Handling Data </span>
=======================================================

Created by <span class="author"> Derek Weitzel</span>, last modified by
<span class="editor"> Carrie Brown</span> on Sep 18, 2018

<span
class="aui-icon aui-icon-small aui-iconfont-warning confluence-information-macro-icon"></span>

HCC currently has no storage that is suitable for HIPAA or other PID
data sets.  Users are not permitted to store such data on HCC machines.

All HCC machines have three separate areas for every user to store data,
each intended for a different purpose.   In addition, we have a transfer
service that utilizes [Globus Connect](Globus-Connect_6357013.html).

<span
class="confluence-embedded-file-wrapper image-center-wrapper confluence-embedded-manual-size"><img src="assets/images/332256/35325560.png" class="confluence-embedded-image image-center" width="1000" /></span>

Home Directory
--------------

<span
class="aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon"></span>

You can access your home directory quickly using the $HOME environmental
variable (i.e. '`cd $HOME'`).

Your home directory (i.e. `/home/[group]/[username]`) is meant for items
that take up relatively small amounts of space.  For example:  source
code, program binaries, configuration files, etc.  This space is
quota-limited to **20GB per user**.  The home directories are backed up
for the purposes of best-effort disaster recovery.  This space is not
intended as an area for I/O to active jobs.  **/home** is mounted
**read-only** on cluster worker nodes to enforce this policy.

Common Directory
----------------

<span
class="aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon"></span>

You can access your common directory quickly using the $COMMON
environmental variable (i.e. '`cd $COMMON`')

The common directory operates similarly to work and is mounted with
**read and write capability to worker nodes all HCC Clusters**. This
means that any files stored in common can be accessed from Crane, Tusker
and Sandhills making this directory ideal for items that need to be
accessed from multiple clusters such as reference databases and shared
data files.

<span
class="aui-icon aui-icon-small aui-iconfont-warning confluence-information-macro-icon"></span>

Common is not designed for heavy I/O usage. Please continue to use your
work directory for active job output to ensure the best performance of
your jobs.

Quotas for common are **30 TB per group**, with larger quotas available
for purchase if needed. However, files stored here will **not be backed
up** and are **not subject to purge** at this time. Please continue to
backup your files to prevent irreparable data loss.

Additional information on using the common directories can be found in
the documentation on [Using the /common File System](30444241.html)

High Performance Work Directory
-------------------------------

<span
class="aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon"></span>

You can access your work directory quickly using the $WORK environmental
variable (i.e. '`cd $WORK'`).

<span
class="aui-icon aui-icon-small aui-iconfont-error confluence-information-macro-icon"></span>

The `/work` directories are **not backed up**. Irreparable data loss is
possible with a mis-typed command. See [Preventing File
Loss](Preventing-File-Loss_29065313.html) for strategies to avoid this.

Every user has a corresponding directory under /work using the same
naming convention as `/home` (i.e. `/work/[group]/[username]`).  We
encourage all users to use this space for I/O to running jobs.  This
directory can also be used when larger amounts of space are temporarily
needed.  There is a **50TB per group quota**; space in /work is shared
among all users.  It should be treated as short-term scratch space, and
**is not backed up**<span style="color: rgb(255,0,0);"><span
style="color: rgb(0,0,0);">Please use the `hcc-du` command to check your
own and your group's usage, and back up and clean up your files at
reasonable intervals in $WORK.</span></span>

### Purge Policy

HCC has a **purge policy on /work** for files that become dormant.
 After **6 months of inactivity on a file (26 weeks)**, an automated
purge process will reclaim the used space of these dormant files.  HCC
provides the **`hcc-purge`** utility to list both the summary and the
actual file paths of files that have been dormant for **24 weeks**.
 This list is periodically generated; the timestamp of the last search
is included in the default summary output when calling `hcc-purge` with
no arguments.  No output from `hcc-purge` indicates the last scan did
not find any dormant files.  `hcc-purge -l` will use the less pager to
list the matching files for the user.  The candidate list can also be
accessed at the following path:` /lustre/purge/current/${USER}.list`.
 This list is updated twice a week, on Mondays and Thursdays.

<span
class="aui-icon aui-icon-small aui-iconfont-error confluence-information-macro-icon"></span>

/work is intended for recent job output and not long term storage.
Evidence of circumventing the purge policy by users will result in
consequences including account lockout.

 

If you have space requirements outside what is currently provided,
please
email <a href="mailto:hcc-support@unl.edu" class="external-link">hcc-support@unl.edu</a> and
we will gladly discuss alternatives.

[Attic](Using-Attic_11635580.html)
----------------------------------

Attic is a near line archive available for purchase at HCC.  Attic
provides reliable large data storage that is designed to be more
reliable then `/work`, and larger than `/home`. Access to Attic is done
through [Globus Connect](Globus-Connect_6357013.html).

More details on Attic can be found on HCC's
<a href="https://hcc.unl.edu/attic" class="external-link">Attic</a>
website.

<span style="color: rgb(0,0,0);line-height: 1.4285715;font-size: 20.0px;">[Globus Connect](Globus-Connect_6357013.html)</span>
------------------------------------------------------------------------------------------------------------------------------

For moving large amounts of data into or out of HCC resources, users are
highly encouraged to consider using [Globus
Connect](Globus-Connect_6357013.html).

Using Box
---------

You can use your [UNL
Box.com](Integrating-Box-with-HCC_8192521.html) account to download and
upload files from any of the HCC clusters.

 

Attachments:
------------

<img src="assets/images/icons/bullet_blue.gif" width="8" height="8" />
[HCCStorageOptions\_cb\_edits.pdf](attachments/332256/30444364.pdf)
(application/pdf)  
<img src="assets/images/icons/bullet_blue.gif" width="8" height="8" />
[HCCStorageOptions\_cb\_edits.png](attachments/332256/30444365.png)
(image/png)  
<img src="assets/images/icons/bullet_blue.gif" width="8" height="8" />
[StorageOptions.png](attachments/332256/35325560.png) (image/png)