I agree with others in this list that Java provides faster software development, the IO cost in Java is practically the same as in C/C++, etc. In short, most pieces of distributed software can be written in Java without any performance hiccups, as long as it is only system metadata that is handled by Java.
One problem is when data-flow has to occur in Java. Each record that is read from the storage has to be de-serialized, uncompressed and then processed. This processing could be very slow in Java compared to when written in other languages, especially because of the creation/destruction of too many objects. It would have been nice if the map/reduce task could have been written in C/C++, or better still, if the sorting inside the MR framework could occur in C/C++. thanks, dhruba On Mon, Oct 11, 2010 at 4:50 PM, helwr <[email protected]> wrote: > > Check out this thread: > https://www.quora.com/Why-was-Hadoop-written-in-Java > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Why-hadoop-is-written-in-java-tp1673148p1684291.html > Sent from the Hadoop lucene-users mailing list archive at Nabble.com. > -- Connect to me at http://www.facebook.com/dhruba
