Pass hadoop configuration to methods that use FileSystem operations, even if
they don't invoke map/reduce jobs
--------------------------------------------------------------------------------------------------------------
Key: MAHOUT-651
URL: https://issues.apache.org/jira/browse/MAHOUT-651
Project: Mahout
Issue Type: Improvement
Components: Clustering
Affects Versions: 0.4
Reporter: Robert Mahfoud
Fix For: 0.5
Some classes in the Classification component internally use the hadoop's
FileSystem class, however, they instantiate the hadoop configuration locally in
the method using {{new Configuration()}}. This limits the ability to integrate
these tools within applications that manage and enrich their own configuration
rather than rely on the default hadoop resources that get loaded when calling
{{new Configuration()}}.
The fix is simply to make these methods take a {{Configuration}} parameter
rather than creating a new instance when needed. An example for an that creates
a new {{Configuration}} instances is:
{{org.apache.mahout.clustering.kmeans.KMeansUtil.configureWithClusterInfo(Path,
List<Cluster>)}}
This problem could also exists beyond the Clustering module, but this issue
only addresses the Clustering code.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira