Hey Guys, I need a little bit help. I need to store a lot of small files. The files are binary like images for example or text files. Both file types are about 100k large. I need to be able to recall the file by a path or an unique ID. Like a normal path in a Unix / Win OS, a String like in S3 (or any rational database) or a hash like with IPFS. My problem is, that there going to be a lot of files. Millions? Billions? Just as an example... Let's say, you want to have a copy of github. How to they store their hosted files?
About 10 Years ago I had to code a similar project. It was a website, where customers could buy photos, that were printed and framed, ready to put them on the wall. And for this project I used a MySQL Database to store the Data. It was ok in the beginning. Then DSLRs got more popular and the file size increased. And the number of files increased. And one time MySQL Crashed and the MySQL Own Recovery Process took hours to get the Database online again.
On the other hand, a friend had this year a problem with his project. He got a lot of traffic and the inodes overfilled the FS with billions of small files.
That's why I have currently no idea, how to handle this amount of files.
BTW. It's a non profit project, so there is not really a budget.
About 10 Years ago I had to code a similar project. It was a website, where customers could buy photos, that were printed and framed, ready to put them on the wall. And for this project I used a MySQL Database to store the Data. It was ok in the beginning. Then DSLRs got more popular and the file size increased. And the number of files increased. And one time MySQL Crashed and the MySQL Own Recovery Process took hours to get the Database online again.
On the other hand, a friend had this year a problem with his project. He got a lot of traffic and the inodes overfilled the FS with billions of small files.
That's why I have currently no idea, how to handle this amount of files.
BTW. It's a non profit project, so there is not really a budget.