Before I bark the wrong tree, I would like your experience to point me in the right direction:
Use case:
Content 1: 500TB to 750TB to 1PB of storage, not permanent, gets deleted when the project is finished. every file is multiple GBs in size. Speed matters.
This content 1 is made of 3 parts that can and should be in separate physical drives.
Assuming total of 750TB
-> 1a = 100TB
-> 1b = 300TB
-> 1c = 350TB
Content 2: rough guess 150to 200TB, permanent, used across multiple projects, file size usually less than 1GB, perhaps a couple of hundred MBs, some much smaller. Tens of thousands of files.
Processing
Step 1: only Content 1 is being worked on (all the files). generate 1a. read 1a to generate 1b. Then read 1a to generate 1c.
Step 2: only Content 2 is being worked on (just a few files)
Step 3: All content 1 + a selected few from Content 2 are being worked on at the same time.
Users: for now just one, myself.
Notes:
- I'm allergic to the word NAS, because the connection is dead slow (bottleneck), it would render the great speed of the CPU and NVMe RAIDs useless.
- Content 1: RAID 0, speed matters, if lost not a big deal, re-HPC
- Content 2: I would rather have a backup somewhere
This storage will be built and used overtime, bit by bit, drive by drive (if too expensive), the drives will be used separately as they're purchased before I bring them all together to make the 750TB-1PB. so they have to be coherent within each sub-group (1a, 1b, 1c and 2), and RAIDable (if that word exists), e.g. same capacity.
Why am I asking? because over several months I will buy and need to use this storage while it's being built up. I currently have workstation cases, a mix of workstation motherboards (ASUS® PRO WS WRX90E-SAGE SE) and server motherboards (SUPERMICRO H13SSL-N), so I'm tempted to start with M.2 drives in an Asus Hyper M.2 x16 Gen5 Card or similar temporary solutions that could end up a big spaghetti at the end, not really a coherent set of drives I can bring together later under one roof as the 1PB storage (not a NAS), with a different future 48 RAM slot motherboard e.g. TURIN2D48G-2L+/500W.
What storage approach would you go for now for immediate use taking into account it's meant to become 4 separate sets of storage (1a, 1b, 1c and 2)? What type of drives would you choose for each type of content 1a, 1b, 1c, and 2? U.2, U.3, M.2, SSD, HDD? and how do I keep the flexibility of moving the drives around?
Use case:
Content 1: 500TB to 750TB to 1PB of storage, not permanent, gets deleted when the project is finished. every file is multiple GBs in size. Speed matters.
This content 1 is made of 3 parts that can and should be in separate physical drives.
Assuming total of 750TB
-> 1a = 100TB
-> 1b = 300TB
-> 1c = 350TB
Content 2: rough guess 150to 200TB, permanent, used across multiple projects, file size usually less than 1GB, perhaps a couple of hundred MBs, some much smaller. Tens of thousands of files.
Processing
Step 1: only Content 1 is being worked on (all the files). generate 1a. read 1a to generate 1b. Then read 1a to generate 1c.
Step 2: only Content 2 is being worked on (just a few files)
Step 3: All content 1 + a selected few from Content 2 are being worked on at the same time.
Users: for now just one, myself.
Notes:
- I'm allergic to the word NAS, because the connection is dead slow (bottleneck), it would render the great speed of the CPU and NVMe RAIDs useless.
- Content 1: RAID 0, speed matters, if lost not a big deal, re-HPC
- Content 2: I would rather have a backup somewhere
This storage will be built and used overtime, bit by bit, drive by drive (if too expensive), the drives will be used separately as they're purchased before I bring them all together to make the 750TB-1PB. so they have to be coherent within each sub-group (1a, 1b, 1c and 2), and RAIDable (if that word exists), e.g. same capacity.
Why am I asking? because over several months I will buy and need to use this storage while it's being built up. I currently have workstation cases, a mix of workstation motherboards (ASUS® PRO WS WRX90E-SAGE SE) and server motherboards (SUPERMICRO H13SSL-N), so I'm tempted to start with M.2 drives in an Asus Hyper M.2 x16 Gen5 Card or similar temporary solutions that could end up a big spaghetti at the end, not really a coherent set of drives I can bring together later under one roof as the 1PB storage (not a NAS), with a different future 48 RAM slot motherboard e.g. TURIN2D48G-2L+/500W.
What storage approach would you go for now for immediate use taking into account it's meant to become 4 separate sets of storage (1a, 1b, 1c and 2)? What type of drives would you choose for each type of content 1a, 1b, 1c, and 2? U.2, U.3, M.2, SSD, HDD? and how do I keep the flexibility of moving the drives around?
Last edited: