[
https://issues.apache.org/jira/browse/MAHOUT-457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Schelter updated MAHOUT-457:
--------------------------------------
Attachment: MAHOUT-457-2.patch
> ItemSimilarityJob and RecommenderJob don't work on Amazon ElasticMapReduce
> --------------------------------------------------------------------------
>
> Key: MAHOUT-457
> URL: https://issues.apache.org/jira/browse/MAHOUT-457
> Project: Mahout
> Issue Type: Bug
> Components: Collaborative Filtering
> Reporter: Sebastian Schelter
> Attachments: MAHOUT-457-2.patch, MAHOUT-457.patch
>
>
> I'm currently evaluating ItemSimilarityJob and RecommenderJob on
> ElasticMapReduce, it seems we have some small problems with S3, mostly due to
> the fact that we need to use Filesystem.get(path.toUri(), conf) instead of
> Filesystem.get(conf) in the code. I will create a patch for that the next
> days.
> I'm writing this mail because I encountered another problem I currently can't
> solve. RecommenderJob is emulating MultipleInputs (which is currently missing
> in Hadoop 0.20 AFAIK) by reading data from a combined path that is built like
> that:
> new Path(prePartialMultiplyPath1 + "," + prePartialMultiplyPath2)
> My Job always fails with this exception here:
> java.lang.IllegalArgumentException: Invalid hostname in URI
> s3:/testingbucket-12345/tmp/prePartialMultiply2
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.