[ 
https://issues.apache.org/jira/browse/MAHOUT-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12895810#action_12895810
 ] 

Sebastian Schelter commented on MAHOUT-457:
-------------------------------------------

{quote}
I think you are missing a slash in the URI:

s3:/testingbucket-12345/tmp/prePartialMultiply2

What happens if you try:

s3://testingbucket-12345/tmp/prePartialMultiply2

Andrew
{quote}

Hi Andrew,

I only specified the input path of the job (did that correctly with s3:// at 
the beginning) and the pathes for the M/R jobs executed before the 
"partialMultiply" worked correctly. I guess it must have to do something with 
the fact that this path contains two comma-separated URLs.

I don't know why there's only one slash in the stracktrace, maybe that was a 
typo from me as I had to change the real name of the bucket before posting here.

> ItemSimilarityJob and RecommenderJob don't work on Amazon ElasticMapReduce
> --------------------------------------------------------------------------
>
>                 Key: MAHOUT-457
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-457
>             Project: Mahout
>          Issue Type: Bug
>          Components: Collaborative Filtering
>            Reporter: Sebastian Schelter
>         Attachments: MAHOUT-457.patch
>
>
> I'm currently evaluating ItemSimilarityJob and RecommenderJob on 
> ElasticMapReduce, it seems we have some small problems with S3, mostly due to 
> the fact that we need to use Filesystem.get(path.toUri(), conf) instead of 
> Filesystem.get(conf) in the code. I will create a patch for that the next 
> days.
> I'm writing this mail because I encountered another problem I currently can't 
> solve. RecommenderJob is emulating MultipleInputs (which is currently missing 
> in Hadoop 0.20 AFAIK) by reading data from a combined path that is built like 
> that:
>     new Path(prePartialMultiplyPath1 + "," + prePartialMultiplyPath2)
> My Job always fails with this exception here:
>     java.lang.IllegalArgumentException: Invalid hostname in URI 
> s3:/testingbucket-12345/tmp/prePartialMultiply2

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to