Hi all; I want to learn how can i estimate the hardware nedeed for hadoop cluster. is there any standart or other things?
for example I have 10TB data, and i will analiyze it... My replication factor will be 2. How much ram do i need for one node? how can I estimate it? How much disk do i need for one node ? how can I estimate it? How many core - CPU do i need for one node? thanks in advance..