Hi Greg,
I only had time to skim through it. However, one thing I find is missing in the Data Management part is something like this :

- If you have tens of thousands of files (or more), you are probably doing it wrong. Look for archiving those files in chunks, or combining them in some way.

On a cluster, having tens of thousands of files will be a killer, but even on a personnal computer, it will make your data management a nightmare.
I have seen users who think that having 2.5M files in a directory is "ok"...

Thanks,

Maxime

Le 2015-11-17 06:10, Greg Wilson a écrit :
Hi everyone,

I've started drafting a short guide to "good enough" practices in scientific computing, and would be grateful for your input. You can view the outline at http://swcarpentry.github.io/good-enough-practices-in-scientific-computing/, and leave comments at https://github.com/swcarpentry/good-enough-practices-in-scientific-computing (either as issues or pull requests - if you don't have permission, just email me and I'll post it).

Thanks,
Greg



_______________________________________________
Discuss mailing list
[email protected]
http://lists.software-carpentry.org/mailman/listinfo/discuss_lists.software-carpentry.org

Reply via email to