To systematically alleviate user inconvenience and management workload caused by SSD capacity issues on computing nodes
Please note that in the following description, "system SSD" refers to the SSD where /local_dataset is located, while "additional SSD" refers to SSDs other than the system SSD, such as those where /data2/local_dataset is located.
Server Classification
Computing nodes and storage have been classified as follows.
A. Low-Capacity Nodes These are nodes equipped with only a system SSD of less than 1TB.
B. Low-Capacity Node with Additional SSD This refers to a node equipped with a system SSD of less than 1TB and an additional SSD.
C. High-Capacity Node This refers to nodes equipped with a system SSD of 1TB or more.
D. High-Capacity Nodes with Additional SSDs Refers to nodes equipped with a system SSD of 1TB or more and an additional SSD.
E. Remote Storage Refers to NAS and Ceph storage, excluding the SSDs of computing nodes.
Classification of Datasets
The types of datasets that can be uploaded to computing nodes and the applicable policies are summarized as follows.
Note: In the context below, a "dataset" refers to the directory containing files loaded for model training and inference, or files saved during training and inference.
Please delete the original containers of datasets (such as .tar and .zip files) so that they do not remain on the computing node.
Additionally, datasets that do not comply with the upload policy may be deleted at any time without prior notice. Please take note of this.
Other Notes
The policies applied to each node according to the above classifications are summarized in the table below.
🟧 : Upload allowed only to system SSD 🟦 : Upload allowed only to additional SSD 🟩 : Upload allowed to both system and additional SSD ⭕ : Upload allowed to remote storage (/data, /ceph_data) ❌ : Upload not allowed to this node or storage
| Type | Node List | Low-capacity datasets (<50GB) | Large datasets (>50GB) | Streaming Dataset | Pretrained weights, compile cache | | --- | --- | --- | --- | --- | --- | | A. Low capacity | moana-u[2-6] | 🟧 | ❌ | Varies by size | ❌ | | B. Low capacity + additional | ariel-v[1-13] | 🟧 | 🟦 | Varies by capacity | ❌ | | C. High capacity | moana-r[2, 5], y[1-7], u[1, 8], ariel-m2, n1 aurora-g1 | 🟧 | 🟩 | Varies by capacity | ❌ | | D. High capacity + Additional | moana-r[1, 3, 4], ariel-k[1, 2], g[1-5], aurora-g[2-8] | 🟧 | 🟩 | Varies by capacity | ❌ | | E. Ceph, NAS | - | ❌ | ❌ | ⭕ (Pilot Operation) | ⭕ |