[ 
https://issues.apache.org/jira/browse/MAHOUT-641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13013912#comment-13013912
 ] 

Sean Owen commented on MAHOUT-641:
----------------------------------

I'll commit it. More broadly I think there is some inconsistency in the code 
about who passes a Configuration when, and a lot of code ignores or makes its 
own Configuration. Configuration is the context of Hadoop operations, and 
probably needs to be passed around most everywhere in Mahout (rather than be 
created anew). The code probably "gets away" with not doing it in several 
cases, but not this one.

I cleaned up a small version of this in MAHOUT-633, having to do with obtaining 
a FileSystem to delete files properly. It's a broader issue and not one to 
solve right now perhaps.

> DistributedRowMatrix hadoop jobs ignore Configuration set via setConf()
> -----------------------------------------------------------------------
>
>                 Key: MAHOUT-641
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-641
>             Project: Mahout
>          Issue Type: Bug
>          Components: Math
>    Affects Versions: 0.4, 0.5
>         Environment: Mahout 0.4 and 0.5-SNAPSHOT run with Hadoop 0.20.2 on 
> Mac OS 10.6 and Linux x86_64 2.6.18
>            Reporter: Jonathan Traupman
>              Labels: configuration, hadoop
>             Fix For: 0.5
>
>         Attachments: MAHOUT-641.patch
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> I am using the Distributed Lanczos solver which uses the DistributedRowMatrix 
> class for it's internal calculation. In our environment, I need to set some 
> Configuration properties (specifically hadoop.job.ugi & hadoop.queue.name), 
> which are when set via the setConf() method on DistributedLanczosSolver. 
> These are correctly passed to DistributedRowMatrix via its setConf() method, 
> but are not passed into the Hadoop JobConfs created by the various static 
> routines in MatrixMultiplicationJob, TimesSquaredJob, and TransposeJob.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to