Hi folks, We're planning to setup a 5 node hadoop cluster. I'm thinking of just setting the dfs.replication to 3; which is the default. Each data node will have 500gb of local storage for dfs use.
How do i calculate the amount of usable dfs space given the replication setting and the number of nodes in this case? is there a formula which i can use? Any help is greatly appreciated. Thanks