El 15/05/2013 20:15, Pat Ferrel escribió:
Scaling some jobs requires splitting files to get multiple mappers and in other 
cases files must be combined into one to even run the job. Scaling is one huge 
reason to use Mahout, so it seems like it should be easier and input should 
universally be a dir of parts *or* single file.
+1 To this. It should be easier to switch between sequential and distributed mode of jobs.

Reply via email to