Ted Dunning skrev:

I would say that it would be easier to use a system that has a full
extension language such as grool or JAQL than pig.  Resampling and
discretization are really pretty straightforward applications of map reduce
and should normally be collected as components into a larger composite
mapper.

I was thinking we would use Pig as that larger composite mapper. If we wanted to add discretization to Mahout we would then add it to Pig. They seem to have a framework to do a lot of the things I want in a pre processing module.

But I don't know Pig enough to say if that could work for all the things we might want to do at pre processing time with Mahout.

> I should also have said that Pig is progressing very quickly.

When do you think Pig might be "stable"?


   karl

Reply via email to