@@ -39,6 +39,7 @@ If you are not sure if a dataset the software requires is already available on S
Many public datasets are commonly used for running jobs across various scientific fields. To avoid any [per-user or per-group quota issues]({{<relref"data_storage">}}), HCC can host these datasets on a system-wide location on Swan excluded from the purge policy, such that the entire HCC community can benefit from using a shared copy.
HCC currently hosts a few public datasets on Swan that can be accessed via data modules:
-**biodata/1.0** - [Static data resources for bioinformatics/computational biology]({{<relref"biodata_module">}})
-**mldata/1.0** - Static data resources for machine-learning/AI (e.g., ImageNet, TCGA, CAMELYON, TCIA)
-**mridata/1.0** - Static data resources for MRI/NeuroImaging (e.g., Penn Memory Center 3T ASHS 1.0 Atlas)