Hi Team, I have 10 disks over which I am running my HDFS. Out of this on disk5 I have my hadoop.tmp.dir configured. I see that on this disk I have huge IO when I run my jobs compared to other disks. Can you guide my to the standards to follow so that this IO can be distributed across to other disks as well. What should be the standard around setting up the hadoop.tmp.dir parameter. Any help would be highly appreciated. below is IO while I am running a huge job.
Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtnsda 2.11 37.65 226.20 313512628 1883809216sdb 1.47 96.44 152.48 803144582 1269829840sdc 1.45 93.03 153.10 774765734 1274979080sdd 1.46 95.06 152.73 791690022 1271944848sde 1.47 92.70 153.24 772025750 1276195288sdf 1.55 95.77 153.06 797567654 1274657320sdg 10.10 364.26 1951.79 3033537062 16254346480sdi 1.46 94.82 152.98 789646630 1274014936sdh 1.44 94.09 152.57 783547390 1270598232sdj 1.44 91.94 153.37 765678470 1277220208sdk 1.52 97.01 153.02 807928678 1274300360 *------------------------* Cheers !!! Siddharth Tiwari Have a refreshing day !!! "Every duty is holy, and devotion to duty is the highest form of worship of God.” "Maybe other people will try to limit me but I don't limit myself"