Cpython would be an option for integration, but not from the Java side. Each map call would involve a cross call so processing a typical GB of data would likely involve millions of calls to the mapper. At 1ms per call, that is just infeasible.
The other option would be to build on the C ports of hadoop, but I have lots of Java code that I can't leave behind for these apps. On 2/5/08 8:23 PM, "Markus Weimer" <[EMAIL PROTECTED]> wrote: > So, Jython was out of the question > as I don't know any way to access a CPython extension from Jython. I found > JPype to do the job and to do it well (the overhead per Cross-VM call was > around 1ms on my laptop). So for those craving for a state-of-the-art Python > with decent extensions and access to Java code, this might be an option.
