[ 
https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240015#comment-13240015
 ] 

tom pierce commented on MAHOUT-994:
-----------------------------------

It would make a lot of sense to me to unify with other Hadoop projects and 
start using the hadoop launcher script whenever we can - no need to maintain a 
custom reinvented wheel there.

As far as the job jar is concerned, I don't think it's customary to include 
Hadoop libs - seems redundant, and potentially dangerous when there are 
mismatched versions of classes in the classpath.  I know there are ways to 
specify/determine which versions of things to use and make this kind of thing 
work, but it has been my experience that it always becomes overcomplicated 
after a while - it is just bad juju.

Looks like the Hadoop classes in 
examples/target/dependency/hadoop-core-0.20.204.0.jar are tacked onto the 
classpath inside bin/mahout, which is why standalone mode currently 'just 
works' on a box with no hadoop install.  This seems less than ideal, though I 
can see why it is done.
                
> mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all 
> major Hadoop branches
> --------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-994
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-994
>             Project: Mahout
>          Issue Type: Bug
>          Components: Integration
>    Affects Versions: 0.6
>            Reporter: Roman Shaposhnik
>
> Mahout should follow the Pig and Hive example and not rely explicitly on 
> HADOOP_HOME and HADOOP_CONF_DIR

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to