[
https://issues.apache.org/jira/browse/DRILL-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292759#comment-15292759
]
Paul Rogers commented on DRILL-4591:
------------------------------------
A simpler, but still useful, solution is to add three minor revisions, all
backward compatible.
1. Split drill-env.sh into three files. Drill defaults move into
drill-config.sh. Distribution-specific settings go into a new (optional)
distrib-env.sh file. User-specified settings stay in drill-env.sh (which now
holds ONLY user entries.)
2. Check for a jars directory within $DRILL_CONFIG_DIR. If present, add it to
the class path.
4. If $DRILL_CONFIG_DIR points to a location other than $DRILL_HOME/conf, then
add $DRILL_HOME/conf to the class path after $DRILL_CONFIG_DIR. That way, Drill
will pick up the default logback.xml file if the user does not provide a custom
version.
The above allows the user to set up a Drill site directory as follows:
my-drill
|- drill-override.conf
|- drill-env.sh
|- jars
|- myCustom.jar
Launch Drill with:
drillbit.sh --config /path/to/my-drill
Or
export DRILL_CONF_DIR=/path/to/my-drill
drillbit.sh
The result is that all user files go into the custom config directory, no user
entries or files go anywhere in $DRILL_HOME. Upgrading is now trivial. When
Drill is run under YARN, only the config files need be uploaded to DFS for a
config change rather than the entire Drill distribution, resulting in faster
start of a YARN-managed Drill cluster.
The one item that this abbreviated proposal does not address is node-specific
configuration. However, that seems a rare case. Users that use node-specific
settings probably need to change only drill-env.sh, which can be done by
sourcing a node-specific script inside drill-env.sh.
> Extend config system with distrib, site, node property files
> ------------------------------------------------------------
>
> Key: DRILL-4591
> URL: https://issues.apache.org/jira/browse/DRILL-4591
> Project: Apache Drill
> Issue Type: Improvement
> Reporter: Paul Rogers
> Attachments: Drill-on-YARNDirectoryStructures.pdf
>
>
> Today Drill provides the drill-override.conf file to set Drill properties,
> and the drill-env.sh file to provide custom launch properties. Today, most
> users seem to have a copy of DRILL_HOME per node, and thus they copy these
> two files per-node. The result is that the two files act as both the overall
> "site" configuration (for all nodes) and the "per-node" configuration for
> that one node.
> In addition, some distributions of Drill (such as MapR), modify the "user"
> config files with settings for that distribution. Now, the same files hold
> settings for the distribution, site and node.
> The approach works, but is awkward. Ideally, provide the option to have three
> sets of files: for the distribution, site, and node.
> The proposal is to extend configuration to provide additional levels:
> * Drill defaults (drill default and module conf files, code in
> drill-config.sh)
> * Distribution settings (special JVM settings, say)
> * Site settings (standard log or spill file locations)
> * Node settings
> * Launch settings (environment variables, -Dname=value options)
> The improvement becomes more important if a user employs NFS, MapR FS or YARN
> to automatically deploy the site-wide files. In that case, the site files
> cannot also act as per-node files.
> The improvement also simplifies upgrades. Today, users must copy
> customizations from and old to a new install. With the revision, Drill files
> are complely separated from user files, making upgrades (of software) easier.
> For backward compatibility, the site and node directories are optional and
> ignored if the environment variables are not set. The site and node config
> files should be optional: skip them if they do not exist.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)