El 15/05/2013 20:15, Pat Ferrel escribió:
+1 To this. It should be easier to switch between sequential and distributed mode of jobs.Scaling some jobs requires splitting files to get multiple mappers and in other cases files must be combined into one to even run the job. Scaling is one huge reason to use Mahout, so it seems like it should be easier and input should universally be a dir of parts *or* single file.
