[
https://issues.apache.org/jira/browse/MAHOUT-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Han Hui Wen updated MAHOUT-474:
--------------------------------
Affects Version/s: 0.4
Description:
!https://issues.apache.org/jira/secure/attachment/12451985/RowSimilarityJob-CooccurrencesMapper-SimilarityReducer.jpg!
From above picture ,we can see that the output of pairwiseSimilarity is very
large ,we should compress them.
SequenceFileOutputFormat.setOutputCompressionType(job, style);
SequenceFileOutputFormat.setCompressOutput(job, compress);
SequenceFileOutputFormat.setOutputCompressorClass(job, codecClass)
was:
!https://issues.apache.org/jira/secure/thumbnail/12451985/12451985_RowSimilarityJob-CooccurrencesMapper-SimilarityReducer.jpg!
From above picture ,we can see that the output of pairwiseSimilarity is very
large ,we should compress them.
SequenceFileOutputFormat.setOutputCompressionType(job, style);
SequenceFileOutputFormat.setCompressOutput(job, compress);
SequenceFileOutputFormat.setOutputCompressorClass(job, codecClass)
Fix Version/s: 0.4
Component/s: Collaborative Filtering
> Should compress output of Job pairwiseSimilarity and Job asMatrix
> -----------------------------------------------------------------
>
> Key: MAHOUT-474
> URL: https://issues.apache.org/jira/browse/MAHOUT-474
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Affects Versions: 0.4
> Reporter: Han Hui Wen
> Fix For: 0.4
>
>
> !https://issues.apache.org/jira/secure/attachment/12451985/RowSimilarityJob-CooccurrencesMapper-SimilarityReducer.jpg!
> From above picture ,we can see that the output of pairwiseSimilarity is very
> large ,we should compress them.
> SequenceFileOutputFormat.setOutputCompressionType(job, style);
> SequenceFileOutputFormat.setCompressOutput(job, compress);
> SequenceFileOutputFormat.setOutputCompressorClass(job, codecClass)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.