Ted Dunning skrev:
I would say that it would be easier to use a system that has a full extension language such as grool or JAQL than pig. Resampling and discretization are really pretty straightforward applications of map reduce and should normally be collected as components into a larger composite mapper.
I was thinking we would use Pig as that larger composite mapper. If we wanted to add discretization to Mahout we would then add it to Pig. They seem to have a framework to do a lot of the things I want in a pre processing module.
But I don't know Pig enough to say if that could work for all the things we might want to do at pre processing time with Mahout.
> I should also have said that Pig is progressing very quickly. When do you think Pig might be "stable"? karl
