[
https://issues.apache.org/jira/browse/YARN-702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13670153#comment-13670153
]
Siddharth Seth commented on YARN-702:
-------------------------------------
bq. The issue in hbase is that we'd like to have one single conf that gets the
new settings but the minicluster creates a new copy of the conf passed in
instead of augmenting it. Thus when hbase MR jobs run, we need to copy out a
whole bunch of settings back out.
There's some discussion about this on HBASE-7904.
https://issues.apache.org/jira/browse/HBASE-7904?focusedCommentId=13611311&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13611311
In the steps mentioned, is the same config object used ? Since MR is the last
step, is it possible to replace this configuration with the conf returned by
the MiniMRCluster (getConfiguration()).
If the MiniMRCluster were to use the conf object passed in as a parameter, it
would end up adding a bunch of keys to this config (Everything defined in
yarn-default and yarn-site) - effectively the same as a getConfiguration on the
cluster.
I think it's useful for the cluster to run with a different config, and not
share this with the multiple jobs that may be submitted to it. What may be
possible is for the MiniMRCluster to copy these configs into the user specified
config. That would be very similar to a blanket copy.
As mentioned before, I think it'll be useful to expose a limited configuration
set (RM address, isMiniCluster, etc) which can then be merged into client
configs. Alternately, have the MiniCluster merge in this limited set. This
could be a start towards separating cluster specific configuration and client
specific configuration.
> minicluster classpath construction requires user to set yarn.is.minicluster
> in the job conf
> -------------------------------------------------------------------------------------------
>
> Key: YARN-702
> URL: https://issues.apache.org/jira/browse/YARN-702
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: nodemanager
> Affects Versions: 2.0.4-alpha
> Reporter: Sandy Ryza
> Assignee: Sandy Ryza
>
> YARN-129 improved classpath construction for miniclusters by, when
> yarn.is.minicluster is set, adding the current JVM's classpath to the
> ContainerLaunchContext for the MR AM and tasks. An issue with this is that
> it requires the user to set yarn.is.minicluster on the mapreduce side in the
> job conf, if they are not copying to RM conf into the jobconf.
> I think it would be better to bypass the ContainerLaunchContext and instead
> have the nodemanager check the property, and if it is true, do the classpath
> additions there.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira