Since you ask about anything in general, when I forayed into using Hadoop, my biggest pain was lack of documentation clarity and completeness over the MR and DFS user APIs (and other little points).
It would be nice to have some work done to have one example or semi-example for every single Input/OutputFormat, Mapper/Reducer implementations, etc. added to the javadocs. I believe examples and snippets help out a ton (tons more than explaining just behavior) to new devs. On Fri, Mar 2, 2012 at 9:45 PM, Kunaal <kunalbha...@alumni.cmu.edu> wrote: > I am doing a general poll on what are the most prevalent pain points that > people run into with Hadoop? These could be performance related (memory > usage, IO latencies), usage related or anything really. > > The goal is to look for what areas this platform could benefit the most in > the near future. > > Any feedback is much appreciated. > > Thanks, > Kunal. -- Harsh J