This is nothing to do with Mahout, but how your Hadoop cluster is
configured. I assume you have turned map / reduce output compression
and are using the LZO codec.
On Thu, Jul 4, 2013 at 11:06 AM, Sugato Samanta wrote:
> Hello,
>
> I was trying to execute the recommendation using movie lens data
Hello,
I was trying to execute the recommendation using movie lens data (
http://www.grouplens.org/node/73). The mahout code is running fine but the
output files are being generated in LZ4 format. Does any one know how to
uncompress this type of file in linux?
Cloudera version: cdh4.2
Linux versi
Is there any way to parallelize SGD to make it train faster? I have 2million
training samples, it takes at least 5-6 hours to complete the training. I tried
new group training. It takes around 40 minutes. I understand it has just 1
records. Is there any tuning parameter to improve the perfor