I think you'll want to look at the Hadoop Streaming or Hadoop Pipes API. Further down the line, I think somebody will want to implement a Haskell library to deal with the Avro serialization protocol when it becomes possible to write non-JVM mappers and reducers directly. This JIRA issue covers the RPC part of the Avro-Hadoop integration work:
https://issues.apache.org/jira/browse/HADOOP-6659 Looks like folks have already implemented support for Thrift and Protocol Buffers, so implementing a library for Avro would likely be pretty similar. -md begin C K Kashyap quotation: > Dear Haskellers, > > A big part of my new job requires tuning app's on Hadoop. I was wondering if > there is a way to push some Haskell code in the mix. I did some googling on > "Hadoop/Haskell" and came across Holumbus - but looks like that is parallel > to Hadoop. > > I was thinking in the lines of doing a Haskell implementation that could run > in a Hadoop cluster - has anyone tried anything like that? > > -- > Regards, > Kashyap > _______________________________________________ > Haskell-Cafe mailing list > Haskell-Cafe@haskell.org > http://www.haskell.org/mailman/listinfo/haskell-cafe _______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe