On Mon, Feb 22, 2010 at 1:55 PM, Ted Dunning <ted.dunn...@gmail.com> wrote:

> I see pig as useful for data preparation, but for any numerical tasks, it
> is
> likely to be completely hopeless.
>

PIG will be a great tool to experiment quickly on algorithms.  But, with
people here trying to focus on using Vector to standardize the input output
process, It will be tough for the small bunch here to port that to PIG, or
help PIG scripts reuse it. As long as the input output of PIG based
algorithmns is based on VectorWritable, I dont see any problem not including
PIG. But bear in mind the previous PIG submission
https://issues.apache.org/jira/browse/MAHOUT-106 still haven't moved in to
the trunk. If anyone is willing to help standardize on using PIG with
vectors as input they are more than welcome.

One thing we definitely dont want to do at this point is for all algorithms
to have all different kinds of input format.

Robin

Reply via email to