Hi, I'm trying to implement a way of using the hadoop-streaming-2.2.0.jar to export a subset of data ( timerange ) to a mapper and reduce application written in another language. However I have been unable to get anything but all the data from HBase table.
Looking at the code and forums, it seems that as hadoop-streaming doesnt support the new API it isn't possible to give it scan parameters to set the timerange or other filters. I found some classes online (http://cp1985chenpeng.iteye.com/blog/1315076) that implement the funuctionality of the newer API in a say that hadoop-streaming seems to be ok with, but when it gets the the mapreduce.Job part of processing it still just returns the whole table rather than the rows between the timeframe I am specifying. Is there a known way that I should be able to do this? -- -Ian Brooks Senior server administrator - Sensewhere
