diff --git a/content/handling_data/_index.md b/content/handling_data/_index.md index 5dee5c2ffdbfe64ce59d5892fbc85144665fa3c6..d1f8c5cb845cee99fc9c3a0eb65042a212172ef8 100644 --- a/content/handling_data/_index.md +++ b/content/handling_data/_index.md @@ -39,6 +39,7 @@ If you are not sure if a dataset the software requires is already available on S Many public datasets are commonly used for running jobs across various scientific fields. To avoid any [per-user or per-group quota issues]({{<relref "data_storage">}}), HCC can host these datasets on a system-wide location on Swan excluded from the purge policy, such that the entire HCC community can benefit from using a shared copy. HCC currently hosts a few public datasets on Swan that can be accessed via data modules: + - **biodata/1.0** - [Static data resources for bioinformatics/computational biology]({{<relref "biodata_module" >}}) - **mldata/1.0** - Static data resources for machine-learning/AI (e.g., ImageNet, TCGA, CAMELYON, TCIA) - **mridata/1.0** - Static data resources for MRI/NeuroImaging (e.g., Penn Memory Center 3T ASHS 1.0 Atlas)