On Mon, Feb 22, 2010 at 1:55 PM, Ted Dunning <ted.dunn...@gmail.com> wrote:
> I see pig as useful for data preparation, but for any numerical tasks, it > is > likely to be completely hopeless. > PIG will be a great tool to experiment quickly on algorithms. But, with people here trying to focus on using Vector to standardize the input output process, It will be tough for the small bunch here to port that to PIG, or help PIG scripts reuse it. As long as the input output of PIG based algorithmns is based on VectorWritable, I dont see any problem not including PIG. But bear in mind the previous PIG submission https://issues.apache.org/jira/browse/MAHOUT-106 still haven't moved in to the trunk. If anyone is willing to help standardize on using PIG with vectors as input they are more than welcome. One thing we definitely dont want to do at this point is for all algorithms to have all different kinds of input format. Robin