You might want to take a look at Dumbo for use in writing hadoop jobs with python.
On Saturday, October 9, 2010, Shi Yu <[email protected]> wrote: > Wondering how Hadoop running with python and other languages. Java is easy to > develop, however, not very efficient to handle numerical computation with > objects like sparse matrices. Maybe hadoop will have Matlab, R extensions as > well? Hope to see it happens. > > > > On 2010-10-10 1:07, Arvind Kalyan wrote: > > On Sat, Oct 9, 2010 at 9:40 PM, elton sky<[email protected]> wrote: > > > > I always have this question but couldn't find proper answer for this. For > system level applications, c/c++ is preferable. But why this one using > java? > > > > > Look at the system (software) requirements for running Hadoop: > http://hadoop.apache.org/common/docs/current/single_node_setup.html#PreReqs > > Imagine how it would be, if it were to be written in C/C++. > > While C/C++ might give you a performance improvement at run-time, it can be > a total nightmare to develop and maintain. Especially if the network gets to > be heterogeneous. > > > > > >
