Re: LZ4 file extensions from Mahout recommender

2013-07-04 Thread Sean Owen
This is nothing to do with Mahout, but how your Hadoop cluster is configured. I assume you have turned map / reduce output compression and are using the LZO codec. On Thu, Jul 4, 2013 at 11:06 AM, Sugato Samanta wrote: > Hello, > > I was trying to execute the recommendation using movie lens data

LZ4 file extensions from Mahout recommender

2013-07-04 Thread Sugato Samanta
Hello, I was trying to execute the recommendation using movie lens data ( http://www.grouplens.org/node/73). The mahout code is running fine but the output files are being generated in LZ4 format. Does any one know how to uncompress this type of file in linux? Cloudera version: cdh4.2 Linux versi

RE: significance of FEATURES in SGD

2013-07-04 Thread Chandra Mohan, Ananda Vel Murugan
Is there any way to parallelize SGD to make it train faster? I have 2million training samples, it takes at least 5-6 hours to complete the training. I tried new group training. It takes around 40 minutes. I understand it has just 1 records. Is there any tuning parameter to improve the perfor