from:"Allen Wittenauer $JIRA$"

[jira] [Updated] (HADOOP-9902) Shell script rewrite

2014-08-18 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-9902:
-

Status: Open  (was: Patch Available)

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-2.patch, 
 HADOOP-9902-3.patch, HADOOP-9902-4.patch, HADOOP-9902-5.patch, 
 HADOOP-9902-6.patch, HADOOP-9902-7.patch, HADOOP-9902-8.patch, 
 HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, hadoop-9902-1.patch, 
 more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-9902) Shell script rewrite

2014-08-18 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-9902:
-

Status: Patch Available  (was: Open)

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-9902) Shell script rewrite

2014-08-18 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-9902:
-

Attachment: HADOOP-9902-16.patch

-16: re-deprecate the previously not deprecated but documented hadoop nfs3 and 
hadoop portmap subcommands

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-9870) Mixed configurations for JVM -Xmx in hadoop command

2014-08-18 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-9870.
--

Resolution: Duplicate

Closing this as HADOOP-9902 contains a fix for this issue.

 Mixed configurations for JVM -Xmx in hadoop command
 ---

 Key: HADOOP-9870
 URL: https://issues.apache.org/jira/browse/HADOOP-9870
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Wei Yan
 Attachments: HADOOP-9870.patch, HADOOP-9870.patch, HADOOP-9870.patch


 When we use hadoop command to launch a class, there are two places setting 
 the -Xmx configuration.
 *1*. The first place is located in file 
 {{hadoop-common-project/hadoop-common/src/main/bin/hadoop}}.
 {code}
 exec $JAVA $JAVA_HEAP_MAX $HADOOP_OPTS $CLASS $@
 {code}
 Here $JAVA_HEAP_MAX is configured in hadoop-config.sh 
 ({{hadoop-common-project/hadoop-common/src/main/bin/hadoop-config.sh}}). The 
 default value is -Xmx1000m.
 *2*. The second place is set with $HADOOP_OPTS in file 
 {{hadoop-common-project/hadoop-common/src/main/bin/hadoop}}.
 {code}
 HADOOP_OPTS=$HADOOP_OPTS $HADOOP_CLIENT_OPTS
 {code}
 Here $HADOOP_CLIENT_OPTS is set in hadoop-env.sh 
 ({{hadoop-common-project/hadoop-common/src/main/conf/hadoop-env.sh}})
 {code}
 export HADOOP_CLIENT_OPTS=-Xmx512m $HADOOP_CLIENT_OPTS
 {code}
 Currently the final default java command looks like:
 {code}java -Xmx1000m  -Xmx512m CLASS_NAME ARGUMENTS{code}
 And if users also specify the -Xmx in the $HADOOP_CLIENT_OPTS, there will be 
 three -Xmx configurations. 
 The hadoop setup tutorial only discusses hadoop-env.sh, and it looks that 
 users should not make any change in hadoop-config.sh.
 We should let hadoop smart to choose the right one before launching the java 
 command, instead of leaving for jvm to make the decision.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-9109) Support remote shell comands other than ssh in startup scripts

2014-08-18 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-9109:
-

Resolution: Duplicate
Status: Resolved  (was: Patch Available)

Closing this as a dupe of HADOOP-9902.

 Support remote shell comands other than ssh in startup scripts
 --

 Key: HADOOP-9109
 URL: https://issues.apache.org/jira/browse/HADOOP-9109
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Albert Chu
Priority: Minor
 Attachments: HADOOP-9109-tag-release-1.1.0.patch, HADOOP-9109.patch


 Some organizations have remote shell launching mechanisms other than ssh.  It 
 would be nice to be able to use these other remote shell commands without 
 hand modifying the startup scripts.
 I've attached a locally developed patch for this against Hadoop 1.1.0.  It 
 modifies conf/hadoop-env.sh, bin/slaves.sh, and bin/hadoop-daemon.sh and 
 supports a HADOOP_SSH_CMD environment variable to allow users to set whatever 
 remote shell command they want to use.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-19 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14102154#comment-14102154
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

Jenkins appears to be pretty horked.  Patch clearly applies, there are no tests 
associated with the shell code, and previous versions applied with no 
issues so I'm just going to commit -16.

Thanks all!

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-9902) Shell script rewrite

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-9902:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

Commit to trunk svn rev 1618847.  Closing.

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10530) Make hadoop trunk build on Java7+ only

2014-08-19 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14102185#comment-14102185
 ] 

Allen Wittenauer commented on HADOOP-10530:
---

At this point, shouldn't trunk really be jdk 1.8?

 Make hadoop trunk build on Java7+ only
 --

 Key: HADOOP-10530
 URL: https://issues.apache.org/jira/browse/HADOOP-10530
 Project: Hadoop Common
  Issue Type: Improvement
  Components: build
Affects Versions: 3.0.0, 2.4.0
 Environment: Java 1.7+
Reporter: Steve Loughran
Assignee: Steve Loughran
 Attachments: HADOOP-10530-001.patch, HADOOP-10530-002.patch


 As discussed on hadoop-common, hadoop 3 is envisaged to be Java7+ *only* 
 -this JIRA covers switching the build for this
 # maven enforcer plugin to set Java version = {{[1.7)}}
 # compiler to set language to java 1.7



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10978) HADOOP_IDENT_STRING is overriden in hadoop-env.sh

2014-08-19 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14102196#comment-14102196
 ] 

Allen Wittenauer commented on HADOOP-10978:
---

This problem has already been fixed in trunk as part of HADOOP-9902.

 HADOOP_IDENT_STRING is overriden in hadoop-env.sh
 -

 Key: HADOOP-10978
 URL: https://issues.apache.org/jira/browse/HADOOP-10978
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Mathias Herberts
Priority: Minor

 hadoop-env.sh forces HADOOP_IDENT_STRING to $USER possibly overriding a 
 previously set value.
 Instead hadoop-env.sh should set HADOOP_IDENT_STRING to:
 export HADOOP_IDENT_STRING=${HADOOP_IDENT_STRING:-$USER}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10908) [post-HADOOP-9902] Cluster Node Setup needs updating

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10908:
--

Summary: [post-HADOOP-9902] Cluster Node Setup needs updating  (was: 
Cluster Node Setup needs updating post-HADOOP-9902)

 [post-HADOOP-9902] Cluster Node Setup needs updating
 

 Key: HADOOP-10908
 URL: https://issues.apache.org/jira/browse/HADOOP-10908
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer

 A lot of the instructions in the cluster node setup are not good practices 
 post-9902.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10787) [post-HADOOP-9902] Rename/remove DEFAULT_LIBEXEC_DIR from the shell scripts

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10787:
--

Summary: [post-HADOOP-9902] Rename/remove DEFAULT_LIBEXEC_DIR from the 
shell scripts  (was: Rename DEFAULT_LIBEXEC_DIR from the shell scripts)

 [post-HADOOP-9902] Rename/remove DEFAULT_LIBEXEC_DIR from the shell scripts
 ---

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

 DEFAULT_LIBEXEC_DIR pollutes the shell name space.  It should be renamed to 
 HADOOP_DEFAULT_LIBEXEC_DIR.  Unfortunately, this touches every single shell 
 script.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10879) [post-HADOOP-9902] Rename -env.sh in the tree to -env.sh.example

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10879:
--

Summary: [post-HADOOP-9902] Rename *-env.sh in the tree to *-env.sh.example 
 (was: Rename *-env.sh in the tree to *-env.sh.example)

 [post-HADOOP-9902] Rename *-env.sh in the tree to *-env.sh.example
 --

 Key: HADOOP-10879
 URL: https://issues.apache.org/jira/browse/HADOOP-10879
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer

 With HADOOP-9902 in place, we don't have to ship *-env.sh called as such and 
 only provide examples.  This goes a long way with being able to upgrade the 
 binaries in place since we would no longer overwrite those files upon 
 extraction.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10854) [post-HADOOP-9902] unit tests for the shell scripts

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10854:
--

Summary: [post-HADOOP-9902] unit tests for the shell scripts  (was: unit 
tests for the shell scripts)

 [post-HADOOP-9902] unit tests for the shell scripts
 ---

 Key: HADOOP-10854
 URL: https://issues.apache.org/jira/browse/HADOOP-10854
 Project: Hadoop Common
  Issue Type: Test
Reporter: Allen Wittenauer

 With HADOOP-9902 moving a lot of functionality to functions, we should build 
 some unit tests for them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10854) [post-HADOOP-9902] unit tests for the shell scripts

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10854:
--

Description: With HADOOP-9902 moving a lot of the core functionality to 
functions, we should build some unit tests for them.  (was: With HADOOP-9902 
moving a lot of functionality to functions, we should build some unit tests for 
them.)

 [post-HADOOP-9902] unit tests for the shell scripts
 ---

 Key: HADOOP-10854
 URL: https://issues.apache.org/jira/browse/HADOOP-10854
 Project: Hadoop Common
  Issue Type: Test
Reporter: Allen Wittenauer

 With HADOOP-9902 moving a lot of the core functionality to functions, we 
 should build some unit tests for them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HADOOP-10979) [post-HADOOP-9902] Auto-entries in hadoop_usage

2014-08-19 Thread Allen Wittenauer (JIRA)

Allen Wittenauer created HADOOP-10979:
-

 Summary: [post-HADOOP-9902] Auto-entries in hadoop_usage
 Key: HADOOP-10979
 URL: https://issues.apache.org/jira/browse/HADOOP-10979
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer
Priority: Minor


It would make adding common options to hadoop_usage output easier if some 
entries were auto-populated.  This is similar to what happens in FsShell and 
other parts of the Java code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10979) [post-HADOOP-9902] Auto-entries in hadoop_usage

2014-08-19 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14102209#comment-14102209
 ] 

Allen Wittenauer commented on HADOOP-10979:
---

In particular:

* --config

*  --daemon options are standardized across all of the subsystems

* help

* version?



 [post-HADOOP-9902] Auto-entries in hadoop_usage
 ---

 Key: HADOOP-10979
 URL: https://issues.apache.org/jira/browse/HADOOP-10979
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer
Priority: Minor

 It would make adding common options to hadoop_usage output easier if some 
 entries were auto-populated.  This is similar to what happens in FsShell and 
 other parts of the Java code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10901) [post-HADOOP-9902] provide un-camelCased versions of shell commands

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10901:
--

Summary: [post-HADOOP-9902] provide un-camelCased versions of shell 
commands  (was: provide un-camelCased versions of shell commands)

 [post-HADOOP-9902] provide un-camelCased versions of shell commands
 ---

 Key: HADOOP-10901
 URL: https://issues.apache.org/jira/browse/HADOOP-10901
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer

 There is a heavy disposition to do camelCase subcommands because it reflects 
 what is in the Java code.  However, it is very counter to the shell.  We 
 should replace the case options to accept both the camelCase and the fully 
 lowercase options.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10978) HADOOP_IDENT_STRING is overriden in hadoop-env.sh

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10978:
--

Affects Version/s: 2.6.0

 HADOOP_IDENT_STRING is overriden in hadoop-env.sh
 -

 Key: HADOOP-10978
 URL: https://issues.apache.org/jira/browse/HADOOP-10978
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Mathias Herberts
Priority: Minor

 hadoop-env.sh forces HADOOP_IDENT_STRING to $USER possibly overriding a 
 previously set value.
 Instead hadoop-env.sh should set HADOOP_IDENT_STRING to:
 export HADOOP_IDENT_STRING=${HADOOP_IDENT_STRING:-$USER}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-19 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Hadoop Flags: Incompatible change

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer

 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10893) isolated classloader on the client side

2014-08-19 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103194#comment-14103194
]

Allen Wittenauer commented on HADOOP-10893:
---

{code}
+# If HADOOP_USE_CLIENT_CLASSLOADER is set, user classes and their dependencies
+# as defined by HADOOP_CLASSPATH and the jar as the hadoop jar argument are
+# loaded by a separate classloader. It should not be mixed with
+# HADOOP_USER_CLASSPATH_FIRST. If it is set, HADOOP_USER_CLASSPATH_FIRST is
+# ignored. Can be defined by doing
+# export HADOOP_USE_CLIENT_CLASSLOADER=true
+
+# HADOOP_CLIENT_CLASSLOADER_SYSTEM_CLASSES overrides the default definition of
+# system classes for the client classloader. The system classes are a
+# comma-separated list of classes that should be loaded from the system
+# classpath, not the user-supplied JARs, when HADOOP_USE_CLIENT_CLASSLOADER is
+# enabled. Names ending in '.' (period) are treated as package names, and names
+# starting with a '-' are treated as negative matches.
+
{code}

I'm not a fan of this wall of text sitting in hadoop-env.sh. Ideally, this
should really be in documentation with a very light description here; that
second paragraph seems too much. Additionally, burying the variable in the
middle of the description is confusing. It should be the last thing in the
section so that it is clear that's what one needs to change. In other words,
follow the pattern established elsewhere.

The change to hadoop_add_to_classpath_userpath looks fine, based upon my
understanding of what this patch is doing.

isolated classloader on the client side
---

Key: HADOOP-10893
URL: https://issues.apache.org/jira/browse/HADOOP-10893
Project: Hadoop Common
Issue Type: New Feature
Components: util
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
Attachments: HADOOP-10893.patch, HADOOP-10893.patch,
HADOOP-10893.patch, HADOOP-10893.patch, HADOOP-10893.patch,
HADOOP-10893.patch, classloader-test.tar.gz

We have the job classloader on the mapreduce tasks that run on the cluster.
It has a benefit of being able to isolate class space for user code and avoid
version clashes.
Although it occurs less often, version clashes do occur on the client JVM. It
would be good to introduce an isolated classloader on the client side as well
to address this. A natural point to introduce this may be through RunJar, as
that's how most of hadoop jobs are run.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10893) isolated classloader on the client side

2014-08-19 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103196#comment-14103196
 ] 

Allen Wittenauer commented on HADOOP-10893:
---

OK, I see the mistake I made.  There is no example export line for 
HADOOP_CLIENT_CLASSLOADER_SYSTEM_CLASSES so I thought it was still describing 
the first one. So yeah, add that instead. ;)

 isolated classloader on the client side
 ---

 Key: HADOOP-10893
 URL: https://issues.apache.org/jira/browse/HADOOP-10893
 Project: Hadoop Common
  Issue Type: New Feature
  Components: util
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Attachments: HADOOP-10893.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, HADOOP-10893.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, classloader-test.tar.gz


 We have the job classloader on the mapreduce tasks that run on the cluster. 
 It has a benefit of being able to isolate class space for user code and avoid 
 version clashes.
 Although it occurs less often, version clashes do occur on the client JVM. It 
 would be good to introduce an isolated classloader on the client side as well 
 to address this. A natural point to introduce this may be through RunJar, as 
 that's how most of hadoop jobs are run.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10893) isolated classloader on the client side

2014-08-20 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103886#comment-14103886
 ] 

Allen Wittenauer commented on HADOOP-10893:
---

Yeah, the export line examples make a huge difference.  Thanks.

 isolated classloader on the client side
 ---

 Key: HADOOP-10893
 URL: https://issues.apache.org/jira/browse/HADOOP-10893
 Project: Hadoop Common
  Issue Type: New Feature
  Components: util
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Attachments: HADOOP-10893-branch-2.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, HADOOP-10893.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, HADOOP-10893.patch, HADOOP-10893.patch, 
 classloader-test.tar.gz


 We have the job classloader on the mapreduce tasks that run on the cluster. 
 It has a benefit of being able to isolate class space for user code and avoid 
 version clashes.
 Although it occurs less often, version clashes do occur on the client JVM. It 
 would be good to introduce an isolated classloader on the client side as well 
 to address this. A natural point to introduce this may be through RunJar, as 
 that's how most of hadoop jobs are run.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10979) [post-HADOOP-9902] Auto-entries in hadoop_usage

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10979:
--

Component/s: scripts

 [post-HADOOP-9902] Auto-entries in hadoop_usage
 ---

 Key: HADOOP-10979
 URL: https://issues.apache.org/jira/browse/HADOOP-10979
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Priority: Minor

 It would make adding common options to hadoop_usage output easier if some 
 entries were auto-populated.  This is similar to what happens in FsShell and 
 other parts of the Java code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Component/s: scripts

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10879) [post-HADOOP-9902] Rename -env.sh in the tree to -env.sh.example

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10879:
--

Component/s: scripts

 [post-HADOOP-9902] Rename *-env.sh in the tree to *-env.sh.example
 --

 Key: HADOOP-10879
 URL: https://issues.apache.org/jira/browse/HADOOP-10879
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 With HADOOP-9902 in place, we don't have to ship *-env.sh called as such and 
 only provide examples.  This goes a long way with being able to upgrade the 
 binaries in place since we would no longer overwrite those files upon 
 extraction.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10901) [post-HADOOP-9902] provide un-camelCased versions of shell commands

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10901:
--

Component/s: scripts

 [post-HADOOP-9902] provide un-camelCased versions of shell commands
 ---

 Key: HADOOP-10901
 URL: https://issues.apache.org/jira/browse/HADOOP-10901
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 There is a heavy disposition to do camelCase subcommands because it reflects 
 what is in the Java code.  However, it is very counter to the shell.  We 
 should replace the case options to accept both the camelCase and the fully 
 lowercase options.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10854) [post-HADOOP-9902] unit tests for the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10854:
--

Component/s: scripts

 [post-HADOOP-9902] unit tests for the shell scripts
 ---

 Key: HADOOP-10854
 URL: https://issues.apache.org/jira/browse/HADOOP-10854
 Project: Hadoop Common
  Issue Type: Test
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 With HADOOP-9902 moving a lot of the core functionality to functions, we 
 should build some unit tests for them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Release Note: The default heap sizes have been removed.  This will allow 
for the JVM to use auto-tuning based upon the memory size of the host.  To 
re-enable the old default, configure HADOOP_HEAPSIZE_MAX=1g in hadoop-env.sh.

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated HADOOP-10950:
--

Release Note:
* The default heap sizes have been removed and the HADOOP_HEAPSIZE variable has
been deprecated (It will still be honored if set, but expect it to go away in
the future). This will allow for the JVM to use auto-tuning based upon the
memory size of the host. To re-enable the old default, configure
HADOOP_HEAPSIZE_MAX=1g in hadoop-env.sh.

* All heap size variables now support units. If the variable is only a number,
the size is assumed to be in megabytes.

was:
* The default heap sizes have been removed and the HADOOP_HEAPSIZE variable has
been deprecated (It will still be honored if set, but expect it to go away in
the future). This will allow for the JVM to use auto-tuning based upon the
memory size of the host. To re-enable the old default, configure
HADOOP_HEAPSIZE_MAX=1g in hadoop-env.sh.

* All heap size variables now support units. If the variable is only a number,
the size assumed to be in megabytes.

rework heap management vars
-

Key: HADOOP-10950
URL: https://issues.apache.org/jira/browse/HADOOP-10950
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Reporter: Allen Wittenauer
Labels: scripts

Post-HADOOP-9902, we need to rework how heap is configured for small
footprint machines, deprecate some options, introduce new ones for greater
flexibility.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Release Note: 
* The default heap sizes have been removed and the HADOOP_HEAPSIZE variable has 
been deprecated  (It will still be honored if set, but expect it to go away in 
the future).  This will allow for the JVM to use auto-tuning based upon the 
memory size of the host.  To re-enable the old default, configure 
HADOOP_HEAPSIZE_MAX=1g in hadoop-env.sh.

* All heap size variables now support units.  If the variable is only a number, 
the size assumed to be in megabytes.

  was:The default heap sizes have been removed.  This will allow for the JVM to 
use auto-tuning based upon the memory size of the host.  To re-enable the old 
default, configure HADOOP_HEAPSIZE_MAX=1g in hadoop-env.sh.


 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts

 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10788) [post-HADOOP-9902] Rewrite httpfs, kms, sls, and other stragglers

2014-08-20 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104082#comment-14104082
 ] 

Allen Wittenauer commented on HADOOP-10788:
---

I started to take a look at this one.  This is going to be a bit more work than 
expected.  Most of the work for httpfs and kms is being done by the tomcat 
script.  So this means that there is not only excessive verbosity coming from 
our code but also from tomcat.  

Additionally, the hadoop_daemon_* functions were built from the perspective 
that hadoop-common would be launching Java.  This is clearly not the case with 
these two daemons.

In order to kill two birds with one stone, I'm thinking the best approach would 
be to make Yet Another Set of non-java daemon functions that could be used 
generically.  We'd wrap all of the tomcat stuff such that instead of going to 
the screen it'd go to .out instead.

 [post-HADOOP-9902] Rewrite httpfs, kms, sls, and other stragglers
 -

 Key: HADOOP-10788
 URL: https://issues.apache.org/jira/browse/HADOOP-10788
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 There are some stragglers not targeted by HADOOP-9902.  These should also get 
 rewritten to use the new hadoop-functions.sh framework. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10788) [post-HADOOP-9902] Rewrite httpfs, kms, sls, and other stragglers

2014-08-20 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104091#comment-14104091
 ] 

Allen Wittenauer commented on HADOOP-10788:
---

Another approach might be:

{code}

HADOOP_USE_JAVA=${HADOOP_USE_JAVA:-true}
...

if [[ ${HADOOP_USE_JAVA} eq true ]]; then
  exec ${JAVA} blah
else
  exec blah
fi
{code}

That might be less code overall.  Hmm.

 [post-HADOOP-9902] Rewrite httpfs, kms, sls, and other stragglers
 -

 Key: HADOOP-10788
 URL: https://issues.apache.org/jira/browse/HADOOP-10788
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 There are some stragglers not targeted by HADOOP-9902.  These should also get 
 rewritten to use the new hadoop-functions.sh framework. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10787) [post-HADOOP-9902] Rename/remove non-HADOOP_, non-YARN_ from the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10787:
--

Description: We should make an effort to clean up the shell env var name 
space by removing unsafe variables.  See comments for list.  (was: We should 
make an effort to clean up the shell env var name space)

 [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from the shell 
 scripts
 

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 We should make an effort to clean up the shell env var name space by removing 
 unsafe variables.  See comments for list.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10787) [post-HADOOP-9902] Rename/remove non-HADOOP_, non-YARN_ from the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10787:
--

Summary: [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from the 
shell scripts  (was: [post-HADOOP-9902] Rename/remove DEFAULT_LIBEXEC_DIR from 
the shell scripts)

 [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from the shell 
 scripts
 

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 DEFAULT_LIBEXEC_DIR pollutes the shell name space.  It should be renamed to 
 HADOOP_DEFAULT_LIBEXEC_DIR.  Unfortunately, this touches every single shell 
 script.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10787) [post-HADOOP-9902] Rename/remove non-HADOOP_, non-YARN_ from the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10787:
--

Description: We should make an effort to clean up the shell env var name 
space  (was: DEFAULT_LIBEXEC_DIR pollutes the shell name space.  It should be 
renamed to HADOOP_DEFAULT_LIBEXEC_DIR.  Unfortunately, this touches every 
single shell script.)

 [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from the shell 
 scripts
 

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 We should make an effort to clean up the shell env var name space



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10787) [post-HADOOP-9902] Rename/remove non-HADOOP_, non-YARN_ from the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104099#comment-14104099
 ] 

Allen Wittenauer commented on HADOOP-10787:
---


List:
* DEFAULT_LIBEXEC_DIR
* TOOL_PATH
* JAVA_HEAP_MAX (handled by HADOOP-10950)
* SLAVE_NAMES? (just need to be made local?)
* SLAVE_FILE? (just need to be made local?)

Not to be removed:
* HADOOP_*
* YARN_*
* MAPRED_*

Non-Hadoop vars intentionally set/used:
* JAVA_HOME
* PDSH_SSH_ARGS_APPEND
* LD_LIBRARY_PATH (for people who don't know how to use ld -R...)
* JAVA_LIBRARY_PATH (ditto)
* MALLOC_ARENA_MAX
* EUID
* JAVA (although maybe this should get changed to HADOOP_JAVA?)
* JSVC_HOME
* CLASSPATH
* BASH_VERSINFO

NOTE: list currently ignores the craziness that is httpfs, kms, etc.

 [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from the shell 
 scripts
 

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 We should make an effort to clean up the shell env var name space by removing 
 unsafe variables.  See comments for list.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10787) [post-HADOOP-9902] Rename/remove non-HADOOP_*, etc from the shell scripts

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10787:
--

Summary: [post-HADOOP-9902] Rename/remove non-HADOOP_*, etc from the shell 
scripts  (was: [post-HADOOP-9902] Rename/remove non-HADOOP_*, non-YARN_* from 
the shell scripts)

 [post-HADOOP-9902] Rename/remove non-HADOOP_*, etc from the shell scripts
 -

 Key: HADOOP-10787
 URL: https://issues.apache.org/jira/browse/HADOOP-10787
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
  Labels: scripts

 We should make an effort to clean up the shell env var name space by removing 
 unsafe variables.  See comments for list.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Attachment: HADOOP-10950.patch

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Assignee: Allen Wittenauer
  Status: Patch Available  (was: Open)

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14104357#comment-14104357
 ] 

Allen Wittenauer commented on HADOOP-10950:
---

This patch also fixes some documentation issues from HADOOP-9902 wrt heap sizes.

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-20 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated HADOOP-10950:
--

* All global and daemon-specific heap size variables now support units. If the
variable is only a number, the size is assumed to be in megabytes.

* All heap size variables now support units. If the variable is only a number,
the size is assumed to be in megabytes.

rework heap management vars
-

Key: HADOOP-10950
URL: https://issues.apache.org/jira/browse/HADOOP-10950
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
Labels: scripts
Attachments: HADOOP-10950.patch

Post-HADOOP-9902, we need to rework how heap is configured for small
footprint machines, deprecate some options, introduce new ones for greater
flexibility.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10946) Fix a bunch of typos in log messages

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10946:
--

Status: Open  (was: Patch Available)

 Fix a bunch of typos in log messages
 

 Key: HADOOP-10946
 URL: https://issues.apache.org/jira/browse/HADOOP-10946
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 2.4.1
Reporter: Ray Chiang
Priority: Trivial
  Labels: newbie
 Attachments: HADOOP-10946-04.patch, HADOOP10946-01.patch, 
 HADOOP10946-02.patch, HADOOP10946-03.patch


 There are a bunch of typos in various log messages.  These need cleaning up.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10946) Fix a bunch of typos in log messages

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10946:
--

Status: Patch Available  (was: Open)

 Fix a bunch of typos in log messages
 

 Key: HADOOP-10946
 URL: https://issues.apache.org/jira/browse/HADOOP-10946
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 2.4.1
Reporter: Ray Chiang
Priority: Trivial
  Labels: newbie
 Attachments: HADOOP-10946-04.patch, HADOOP10946-01.patch, 
 HADOOP10946-02.patch, HADOOP10946-03.patch


 There are a bunch of typos in various log messages.  These need cleaning up.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10946) Fix a bunch of typos in log messages

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10946:
--

Assignee: Ray Chiang

 Fix a bunch of typos in log messages
 

 Key: HADOOP-10946
 URL: https://issues.apache.org/jira/browse/HADOOP-10946
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 2.4.1
Reporter: Ray Chiang
Assignee: Ray Chiang
Priority: Trivial
  Labels: newbie
 Attachments: HADOOP-10946-04.patch, HADOOP10946-01.patch, 
 HADOOP10946-02.patch, HADOOP10946-03.patch


 There are a bunch of typos in various log messages.  These need cleaning up.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-8896) Javadoc points to Wrong Reader and Writer classes in SequenceFile

2014-08-20 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-8896:
-

Assignee: Ray Chiang

 Javadoc points to Wrong Reader and Writer classes in SequenceFile
 -

 Key: HADOOP-8896
 URL: https://issues.apache.org/jira/browse/HADOOP-8896
 Project: Hadoop Common
  Issue Type: Improvement
  Components: documentation, io
Affects Versions: 2.0.1-alpha
Reporter: Timothy Mann
Assignee: Ray Chiang
Priority: Trivial
  Labels: sequence-file
 Attachments: HADOOP-8896-02.patch, HADOOP-8896-03.patch, 
 HADOOP8896-01.patch

   Original Estimate: 1m
  Remaining Estimate: 1m

 Line 56 of org.apache.hadoop.io.SequenceFile refers to {@link Writer}, {@link 
 Reader} in the javadoc comment describing the class SequenceFile. When the 
 javadoc is built Reader and Writer link to java.io.Reader and java.io.Writer, 
 respectively. However, they should instead refer to {@link 
 SequenceFile.Reader} and {@link SequenceFile.Writer}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105351#comment-14105351
]

Allen Wittenauer commented on HADOOP-9902:
--

bq. yarn command usage info seems broken. e.g. yarn application command
earlier was printing command usage info. Now, it's throwing exception.

Looks like I missed this command line stack manipulation for ApplicationCLI:

{code}
elif [ $COMMAND = application ] ||
[ $COMMAND = applicationattempt ] ||
[ $COMMAND = container ]; then
CLASS=org.apache.hadoop.yarn.client.cli.ApplicationCLI
YARN_OPTS=$YARN_OPTS $YARN_CLIENT_OPTS
set -- $COMMAND $@
{code}

... probably because it is a very oddball thing to do. I'll file a JIRA for
that.

bq. Starting/stopping yarn daemon doesn't print anything any more. Earlier it
was printing something like starting resource manager.., which I think is
useful.

Putting it inside yarn-daemon.sh or anywhere else breaks the init.d script
experience for ops teams. So, if anything, this should get changed in
yarn-daemons.sh and make it more of an analog to hadoop-daemons.sh.

bq. we should open separate jiras in YARN/MR to track YARN/MR side changes, so
that it draws enough attention in YARN/MR community as well.

It's an interesting data point to note that the follow-up JIRAs for this one to
fix bugs, add a few more features, etc, are surprisingly light with watchers,
if they even have any. Probably hints as to another reason why this part of
the code base never gets fixes. It was decided early on (see above) to do this
as one big JIRA. That was still, IMO, the correct decision based upon history
and the current state.

While this was a sweeping change across all of the subprojects, all of these
individual communities should be paying attention to what is happening in
common due to the dependency structure.

Shell script rewrite

Key: HADOOP-9902
URL: https://issues.apache.org/jira/browse/HADOOP-9902
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
Labels: releasenotes
Fix For: 3.0.0

Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch,
HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch,
HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch,
HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch,
HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch,
HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt,
hadoop-9902-1.patch, more-info.txt

Umbrella JIRA for shell script rewrite. See more-info.txt for more details.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105355#comment-14105355
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

I've file YARN-2436 and YARN-2437 (under the new script component I added 
yesterday...) for those two issues.

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105355#comment-14105355
 ] 

Allen Wittenauer edited comment on HADOOP-9902 at 8/21/14 1:10 PM:
---

I've filed YARN-2436 and YARN-2437 (under the new script component I added 
yesterday...) for those two issues.


was (Author: aw):
I've file YARN-2436 and YARN-2437 (under the new script component I added 
yesterday...) for those two issues.

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-10991) 'hadoop namenode -format' fails if user hadoop homedir is not under /home

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-10991.
---

Resolution: Cannot Reproduce

This is either:

a) permissions issue in /home/hadoop
b) misconfiguration in hdfs-site.xml

I've run namenode format in many many directories, including /home and not 
/home.

It should be noted that changing HADOOP_PREFIX has zero influence on where the 
NN fsimage gets created.

Closing as cannot reproduce.

 'hadoop namenode -format' fails if user hadoop homedir is not under /home
 -

 Key: HADOOP-10991
 URL: https://issues.apache.org/jira/browse/HADOOP-10991
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs
Affects Versions: 2.5.0
 Environment: CentOS 6.5
Reporter: Yaniv Kaul
Priority: Minor
  Labels: namenode

 Since my homedir is on shared NFS, I couldn't create a user for hadoop on 
 /home. Therefore, I've used:
 useradd hadoop --home /hadoop
 which worked well. I've adjusted HADOOP_HOME and friends to match correctly. 
 Running hdfs namenode -format failed:
 {noformat}
 14/08/21 13:57:35 INFO namenode.NNConf: XAttrs enabled? true
 14/08/21 13:57:35 INFO namenode.NNConf: Maximum size of an xattr: 16384
 14/08/21 13:57:35 INFO namenode.FSImage: Allocated new BlockPoolId: 
 BP-1696511243-10.103.234.197-1408618655940
 14/08/21 13:57:35 WARN namenode.NameNode: Encountered exception during 
 format: 
 java.io.IOException: Cannot create directory 
 /home/hadoop/hadoopdata/hdfs/namenode/current
   at 
 org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.clearDirectory(Storage.java:334)
   at 
 org.apache.hadoop.hdfs.server.namenode.NNStorage.format(NNStorage.java:546)
   at 
 org.apache.hadoop.hdfs.server.namenode.NNStorage.format(NNStorage.java:567)
   at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.format(FSImage.java:148)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.format(NameNode.java:926)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1354)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1473)
 14/08/21 13:57:35 FATAL namenode.NameNode: Exception in namenode join
 java.io.IOException: Cannot create directory 
 /home/hadoop/hadoopdata/hdfs/namenode/current
   at 
 org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.clearDirectory(Storage.java:334)
   at 
 org.apache.hadoop.hdfs.server.namenode.NNStorage.format(NNStorage.java:546)
   at 
 org.apache.hadoop.hdfs.server.namenode.NNStorage.format(NNStorage.java:567)
   at 
 org.apache.hadoop.hdfs.server.namenode.FSImage.format(FSImage.java:148)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.format(NameNode.java:926)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1354)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1473)
 14/08/21 13:57:35 INFO util.ExitUtil: Exiting with status 1
 14/08/21 13:57:35 INFO namenode.NameNode: SHUTDOWN_MSG: 
 /
 SHUTDOWN_MSG: Shutting down NameNode at 
 lgdrm432.xiodrm.lab.emc.com/10.103.234.197
 {noformat}
 The error seems to be clear:
 {noformat}
 java.io.IOException: Cannot create directory 
 /home/hadoop/hadoopdata/hdfs/namenode/current
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Attachment: HADOOP-10950-01.patch

-01: fix some typos, better desc of _MAX and _MIN.  Rebased.

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950-01.patch, HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-10899) Hadoop CommandsManual.vm documentation gives deprecated information

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-10899.
---

Resolution: Fixed

yes, probably time to close this one. Thanks for working on them!

 Hadoop CommandsManual.vm documentation gives deprecated information
 ---

 Key: HADOOP-10899
 URL: https://issues.apache.org/jira/browse/HADOOP-10899
 Project: Hadoop Common
  Issue Type: Bug
  Components: documentation
Reporter: Allen Wittenauer
Assignee: Akira AJISAKA
  Labels: newbie

 This is a rollup of several old JIRAs.
 The CommandsManual lists very old information about running HDFS and 
 MapReduce subcommands from the 'hadoop' shell CLI.  These are deprecated and 
 should be removed.  If necessary, the commands should be added to the 
 relevant subproject's documentation.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10946) Fix a bunch of typos in log messages

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10946:
--

Affects Version/s: 3.0.0

 Fix a bunch of typos in log messages
 

 Key: HADOOP-10946
 URL: https://issues.apache.org/jira/browse/HADOOP-10946
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 3.0.0, 2.4.1
Reporter: Ray Chiang
Assignee: Ray Chiang
Priority: Trivial
  Labels: newbie
 Attachments: HADOOP-10946-04.patch, HADOOP10946-01.patch, 
 HADOOP10946-02.patch, HADOOP10946-03.patch


 There are a bunch of typos in various log messages.  These need cleaning up.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10946) Fix a bunch of typos in log messages

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105486#comment-14105486
 ] 

Allen Wittenauer commented on HADOOP-10946:
---

I've triggered a manual jenkins test. (I think.) So weird.

 Fix a bunch of typos in log messages
 

 Key: HADOOP-10946
 URL: https://issues.apache.org/jira/browse/HADOOP-10946
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 3.0.0, 2.4.1
Reporter: Ray Chiang
Assignee: Ray Chiang
Priority: Trivial
  Labels: newbie
 Attachments: HADOOP-10946-04.patch, HADOOP10946-01.patch, 
 HADOOP10946-02.patch, HADOOP10946-03.patch


 There are a bunch of typos in various log messages.  These need cleaning up.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10893) isolated classloader on the client side

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105819#comment-14105819
 ] 

Allen Wittenauer commented on HADOOP-10893:
---

tl;dr: [~sjlee0]'s changes here are probably the correct ones. 

From a pure patch perspective, it does look weird.  But from a stylistic 
perspective as a part of a total work (namely, hadoop 2.x), the changes and 
lack of documentation in hadoop-env.sh, etc, to branch-2 make a lot of sense. 
One of the key points of HADOOP-9902 was to highlight to end users what things 
they could set.  Hiding that in hadoop-config.sh, which users are never 
directed to documentation-wise, didn't really work. So I pulled those out and 
popped them into hadoop-env.sh, which users definitely see.

This change just got caught in the crossfire.

 isolated classloader on the client side
 ---

 Key: HADOOP-10893
 URL: https://issues.apache.org/jira/browse/HADOOP-10893
 Project: Hadoop Common
  Issue Type: New Feature
  Components: util
Affects Versions: 2.4.0
Reporter: Sangjin Lee
Assignee: Sangjin Lee
 Attachments: HADOOP-10893-branch-2.patch, 
 HADOOP-10893-branch-2.patch, HADOOP-10893.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, HADOOP-10893.patch, HADOOP-10893.patch, 
 HADOOP-10893.patch, HADOOP-10893.patch, classloader-test.tar.gz


 We have the job classloader on the mapreduce tasks that run on the cluster. 
 It has a benefit of being able to isolate class space for user code and avoid 
 version clashes.
 Although it occurs less often, version clashes do occur on the client JVM. It 
 would be good to introduce an isolated classloader on the client side as well 
 to address this. A natural point to introduce this may be through RunJar, as 
 that's how most of hadoop jobs are run.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105924#comment-14105924
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

bq. Found one more problem that RM and NM daemon logs are now inside *.out 
file, instead of *.log file.

As mentioned in the release notes, YARN did a bunch of heinous stuff when it 
came to log4j settings, counter to the rest of Hadoop, and to much frustration 
with ops teams.  This has been made consistent, so there is good chance you 
were relying upon that behavior.  It could be any number of things: NM and RM 
_OPT settings, dependence upon the nodemanger/log4j.settings file or 
resourcemanager/log4j.settings file, yarn-env.sh settings, etc.  You can always 
do 'bash -x yarn --daemon start resourcemanager'. The out file should contain 
the java command line.

With the shipping *-env.sh files, you should see something similar to:
{code}
java -Dproc_resourcemanager -Djava.net.preferIPv4Stack=true 
-Djava.security.krb5.realm= -Djava.security.krb5.kdc= 
-Djava.security.krb5.conf= 
-Dyarn.log.dir=/Users/aw/HADOOP/hadoop-3.0.0-SNAPSHOT/logs 
-Dyarn.log.file=hadoop-aw-resourcemanager-aw-mbp-work.local.log 
-Dyarn.home.dir=/Users/aw/HADOOP/hadoop-3.0.0-SNAPSHOT 
-Dyarn.root.logger=INFO,RFA -Xmx1g 
-Dhadoop.log.dir=/Users/aw/HADOOP/hadoop-3.0.0-SNAPSHOT/logs 
-Dhadoop.log.file=hadoop-aw-resourcemanager-aw-mbp-work.local.log 
-Dhadoop.home.dir=/Users/aw/HADOOP/hadoop-3.0.0-SNAPSHOT -Dhadoop.id.str=aw 
-Dhadoop.root.logger=INFO,RFA -Dhadoop.policy.file=hadoop-policy.xml 
-Dhadoop.security.logger=INFO,NullAppender 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager
{code}

bq. Also yarn daemon log file name earlier was starting with yarn, now it's 
changed to hadoop, not sure if this is an intentional change.

Intentional.  This is to make YARN consistent with the rest of the system.  
(Noticing a theme?)

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105960#comment-14105960
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

... and, just to answer the question before it gets asked... ;)

Want to override what the RM uses for logging?  Just put this in the 
yarn-env.sh:

{code}
export YARN_RESOURCEMANAGER_OPTS=-Dhadoop.root.logger=INFO,DRFA 
-Dyarn.root.logger=INFO,DRFA
{code}

The rest of the system will fill in the blanks.

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Status: Patch Available  (was: Open)

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950-01.patch, HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10950) rework heap management vars

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10950:
--

Status: Open  (was: Patch Available)

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950-01.patch, HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10950) rework heap management vars

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105971#comment-14105971
 ] 

Allen Wittenauer commented on HADOOP-10950:
---

Jenkins tests, of course, are fairly meaningless for this patch.

 rework  heap management  vars
 -

 Key: HADOOP-10950
 URL: https://issues.apache.org/jira/browse/HADOOP-10950
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: scripts
 Attachments: HADOOP-10950-01.patch, HADOOP-10950.patch


 Post-HADOOP-9902, we need to rework how heap is configured for small 
 footprint machines, deprecate some options, introduce new ones for greater 
 flexibility.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HADOOP-10993) Dump java command line to *.out file

2014-08-21 Thread Allen Wittenauer (JIRA)

Allen Wittenauer created HADOOP-10993:
-

 Summary: Dump java command line to *.out file
 Key: HADOOP-10993
 URL: https://issues.apache.org/jira/browse/HADOOP-10993
 Project: Hadoop Common
  Issue Type: Improvement
Reporter: Allen Wittenauer
Priority: Minor


It might be a nice enhancement to print the contents of the java command line 
to the out file during daemon startup to help with debugging.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-8896) Javadoc points to Wrong Reader and Writer classes in SequenceFile

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-8896:
-

   Resolution: Fixed
Fix Version/s: 2.6.0
   3.0.0
   Status: Resolved  (was: Patch Available)

+1. Committing to trunk and branch-2.

Thanks!

 Javadoc points to Wrong Reader and Writer classes in SequenceFile
 -

 Key: HADOOP-8896
 URL: https://issues.apache.org/jira/browse/HADOOP-8896
 Project: Hadoop Common
  Issue Type: Improvement
  Components: documentation, io
Affects Versions: 2.0.1-alpha
Reporter: Timothy Mann
Assignee: Ray Chiang
Priority: Trivial
  Labels: sequence-file
 Fix For: 3.0.0, 2.6.0

 Attachments: HADOOP-8896-02.patch, HADOOP-8896-03.patch, 
 HADOOP8896-01.patch

   Original Estimate: 1m
  Remaining Estimate: 1m

 Line 56 of org.apache.hadoop.io.SequenceFile refers to {@link Writer}, {@link 
 Reader} in the javadoc comment describing the class SequenceFile. When the 
 javadoc is built Reader and Writer link to java.io.Reader and java.io.Writer, 
 respectively. However, they should instead refer to {@link 
 SequenceFile.Reader} and {@link SequenceFile.Writer}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106294#comment-14106294
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

Hmm it's basically looking for a working libexec.  You need a dir with 
hadoop-functions.sh, hadoop-config.sh, and  hdfs-config.sh (for hdfs). So you 
could construct that manually and point HADOOP_LIBEXEC_DIR to it and I think 
all would work.

But I definitely see a bug and/or feature here, depending upon ones world view. 
;)  I've filed a separate jira to cover this case (HADOOP-10996 ), as it's not 
an insignificant amount of work.

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HADOOP-10996) run hdfs, yarn, mapred, etc from build tree

2014-08-21 Thread Allen Wittenauer (JIRA)

Allen Wittenauer created HADOOP-10996:
-

 Summary: run hdfs, yarn, mapred, etc from build tree
 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer


There is a developer use case for running the shell scripts from the build 
tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10996) run hdfs, yarn, mapred, etc from build tree

2014-08-21 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106305#comment-14106305
]

Allen Wittenauer commented on HADOOP-10996:
---

Given this:

{code}
$ export HADOOP_COMMON_HOME=$(pwd)/$(ls -d
hadoop-common-project/hadoop-common/target/hadoop-common-*/)
$ export HADOOP_HDFS_HOME=$(pwd)/$(ls -d
hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-*/)
$ export PATH=$HADOOP_COMMON_HOME/bin:$HADOOP_HDFS_HOME/bin:$PATH
$ hdfs
ERROR: Unable to exec
(path)target/hadoop-hdfs-3.0.0-SNAPSHOT/bin/../libexec/hadoop-functions.sh.
{code}

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located. It does
this by looking for ../libexec, where it finds it. It now makes the (false)
assumption that this must be the libexec dir. So it now tries to fire up
hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

* Look to see if HADOOP_COMMON_HOME is defined and look for
hadoop-config.sh/hadoop-functions.sh is there as well.
* Throw caution to the wind and see if this stuff is in our current path.
* Do the full gamut of checks for HADOOP_HDFS_HOME, etc, for hdfs-config.sh +
the stuff above.

One sticking point is what happens if hadoop-layout.sh redefines the directory
structure? The code is sort of in a catch-22.

run hdfs, yarn, mapred, etc from build tree
---

Key: HADOOP-10996
URL: https://issues.apache.org/jira/browse/HADOOP-10996
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

There is a developer use case for running the shell scripts from the build
tree. What would it take to make it work?

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) run hdfs, yarn, mapred, etc from build tree

2014-08-21 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106305#comment-14106305
]

Allen Wittenauer edited comment on HADOOP-10996 at 8/22/14 1:30 AM:

Given this:

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located. It does
this by looking for ../libexec, where it finds it. It now makes the (false)
assumption that this must be the one, true libexec dir. So it now tries to
fire up hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

One sticking point is what happens if hadoop-layout.sh redefines the directory
structure? The code is sort of in a catch-22.

was (Author: aw):
Given this:

How do we make hdfs work properly?

First, what is happening?

There are a couple of different ways to solve this:

One sticking point is what happens if hadoop-layout.sh redefines the directory
structure? The code is sort of in a catch-22.

run hdfs, yarn, mapred, etc from build tree
---

Key: HADOOP-10996
URL: https://issues.apache.org/jira/browse/HADOOP-10996
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

There is a developer use case for running the shell scripts from the build
tree. What would it take to make it work?

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) run hdfs, yarn, mapred, etc from build tree

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106305#comment-14106305
 ] 

Allen Wittenauer edited comment on HADOOP-10996 at 8/22/14 1:31 AM:


(Source: 
https://issues.apache.org/jira/secure/EditComment!default.jspa?id=12665400commentId=14106278)

Given this:

{code}
$ export HADOOP_COMMON_HOME=$(pwd)/$(ls -d 
hadoop-common-project/hadoop-common/target/hadoop-common-*/)
$ export HADOOP_HDFS_HOME=$(pwd)/$(ls -d 
hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-*/)
$ export PATH=$HADOOP_COMMON_HOME/bin:$HADOOP_HDFS_HOME/bin:$PATH
$ hdfs
ERROR: Unable to exec 
(path)target/hadoop-hdfs-3.0.0-SNAPSHOT/bin/../libexec/hadoop-functions.sh.
{code}

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located.  It does 
this by looking for ../libexec, where it finds it.  It now makes the (false) 
assumption that this must be the one, true libexec dir.  So it now tries to 
fire up hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

* Look to see if HADOOP_COMMON_HOME is defined and look for 
hadoop-config.sh/hadoop-functions.sh is there as well.
* Throw caution to the wind and see if this stuff is in our current path.
* Do the full gamut of checks for HADOOP_HDFS_HOME, etc, for hdfs-config.sh + 
the stuff above.

One sticking point is what happens if hadoop-layout.sh redefines the directory 
structure?  The code is sort of in a catch-22.


was (Author: aw):
Given this:

{code}
$ export HADOOP_COMMON_HOME=$(pwd)/$(ls -d 
hadoop-common-project/hadoop-common/target/hadoop-common-*/)
$ export HADOOP_HDFS_HOME=$(pwd)/$(ls -d 
hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-*/)
$ export PATH=$HADOOP_COMMON_HOME/bin:$HADOOP_HDFS_HOME/bin:$PATH
$ hdfs
ERROR: Unable to exec 
(path)target/hadoop-hdfs-3.0.0-SNAPSHOT/bin/../libexec/hadoop-functions.sh.
{code}

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located.  It does 
this by looking for ../libexec, where it finds it.  It now makes the (false) 
assumption that this must be the one, true libexec dir.  So it now tries to 
fire up hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

* Look to see if HADOOP_COMMON_HOME is defined and look for 
hadoop-config.sh/hadoop-functions.sh is there as well.
* Throw caution to the wind and see if this stuff is in our current path.
* Do the full gamut of checks for HADOOP_HDFS_HOME, etc, for hdfs-config.sh + 
the stuff above.

One sticking point is what happens if hadoop-layout.sh redefines the directory 
structure?  The code is sort of in a catch-22.

 run hdfs, yarn, mapred, etc from build tree
 ---

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Summary: run hdfs, yarn, mapred, etc from target tree  (was: run hdfs, 
yarn, mapred, etc from build tree)

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106305#comment-14106305
 ] 

Allen Wittenauer edited comment on HADOOP-10996 at 8/22/14 1:33 AM:


(Source: 
https://issues.apache.org/jira/secure/EditComment!default.jspa?id=12665400commentId=14106278)

Given this:

{code}
$ export HADOOP_COMMON_HOME=$(pwd)/$(ls -d 
hadoop-common-project/hadoop-common/target/hadoop-common-*/)
$ export HADOOP_HDFS_HOME=$(pwd)/$(ls -d 
hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-*/)
$ export PATH=$HADOOP_COMMON_HOME/bin:$HADOOP_HDFS_HOME/bin:$PATH
$ hdfs
ERROR: Unable to exec 
(path)target/hadoop-hdfs-3.0.0-SNAPSHOT/bin/../libexec/hadoop-functions.sh.
{code}

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located.  It does 
this by looking for ../libexec, where it finds it.  It now makes the (false) 
assumption that this must be the one, true libexec dir.  So it now tries to 
fire up hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

* Look to see if HADOOP_COMMON_HOME is defined and look for 
hadoop-config.sh/hadoop-functions.sh is there as well.
* Throw caution to the wind and see if this stuff is in our current path.
* Recalculate HADOOP_LIBEXEC_DIR in hadoop-config.sh might work too, since 
clearly hdfs found it.
* Do the full gamut of checks for HADOOP_HDFS_HOME, etc, for hdfs-config.sh + 
some of the stuff above.

One sticking point is what happens if hadoop-layout.sh redefines the directory 
structure?  The code is sort of in a catch-22.


was (Author: aw):
(Source: 
https://issues.apache.org/jira/secure/EditComment!default.jspa?id=12665400commentId=14106278)

Given this:

{code}
$ export HADOOP_COMMON_HOME=$(pwd)/$(ls -d 
hadoop-common-project/hadoop-common/target/hadoop-common-*/)
$ export HADOOP_HDFS_HOME=$(pwd)/$(ls -d 
hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-*/)
$ export PATH=$HADOOP_COMMON_HOME/bin:$HADOOP_HDFS_HOME/bin:$PATH
$ hdfs
ERROR: Unable to exec 
(path)target/hadoop-hdfs-3.0.0-SNAPSHOT/bin/../libexec/hadoop-functions.sh.
{code}

How do we make hdfs work properly?

First, what is happening?

The code tries to find where to look for hdfs-config.sh is located.  It does 
this by looking for ../libexec, where it finds it.  It now makes the (false) 
assumption that this must be the one, true libexec dir.  So it now tries to 
fire up hadoop-config.sh and hadoop-functions.sh which fail.

There are a couple of different ways to solve this:

* Look to see if HADOOP_COMMON_HOME is defined and look for 
hadoop-config.sh/hadoop-functions.sh is there as well.
* Throw caution to the wind and see if this stuff is in our current path.
* Do the full gamut of checks for HADOOP_HDFS_HOME, etc, for hdfs-config.sh + 
the stuff above.

One sticking point is what happens if hadoop-layout.sh redefines the directory 
structure?  The code is sort of in a catch-22.

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106321#comment-14106321
 ] 

Allen Wittenauer edited comment on HADOOP-10996 at 8/22/14 1:41 AM:


OK, hdfs-config.sh does to the right thing (although it could be argued the 
order should be reversed):

{code}
if [ -e ${HADOOP_LIBEXEC_DIR}/hadoop-config.sh ]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-config.sh
elif [ -e ${HADOOP_COMMON_HOME}/libexec/hadoop-config.sh ]; then
  . ${HADOOP_COMMON_HOME}/libexec/hadoop-config.sh
elif [ -e ${HADOOP_HOME}/libexec/hadoop-config.sh ]; then
  . ${HADOOP_HOME}/libexec/hadoop-config.sh
else
  echo ERROR: Hadoop common not found. 21
  exit 1
fi
{code}

So it's really hadoop-config.sh that's broken here:

{code}
# get our functions defined for usage later
if [[ -f ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh ]]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh
else
  echo ERROR: Unable to exec ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh. 12
  exit 1
fi

# allow overrides of the above and pre-defines of the below
if [[ -f ${HADOOP_LIBEXEC_DIR}/hadoop-layout.sh ]]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-layout.sh
fi
{code}

This is going to be a relatively easy fix, I think.  We just need to add checks 
for HADOOP_COMMON_HOME prior to using HADOOP_LIBEXEC_DIR.


was (Author: aw):
OK, hdfs-config.sh does to the right thing:

{code}
if [ -e ${HADOOP_LIBEXEC_DIR}/hadoop-config.sh ]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-config.sh
elif [ -e ${HADOOP_COMMON_HOME}/libexec/hadoop-config.sh ]; then
  . ${HADOOP_COMMON_HOME}/libexec/hadoop-config.sh
elif [ -e ${HADOOP_HOME}/libexec/hadoop-config.sh ]; then
  . ${HADOOP_HOME}/libexec/hadoop-config.sh
else
  echo ERROR: Hadoop common not found. 21
  exit 1
fi
{code}

So it's really hadoop-config.sh that's broken here:

{code}
# get our functions defined for usage later
if [[ -f ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh ]]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh
else
  echo ERROR: Unable to exec ${HADOOP_LIBEXEC_DIR}/hadoop-functions.sh. 12
  exit 1
fi

# allow overrides of the above and pre-defines of the below
if [[ -f ${HADOOP_LIBEXEC_DIR}/hadoop-layout.sh ]]; then
  . ${HADOOP_LIBEXEC_DIR}/hadoop-layout.sh
fi
{code}

This is going to be a relatively easy fix, I think.  We just need to add checks 
for HADOOP_COMMON_HOME prior to using HADOOP_LIBEXEC_DIR.

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer

 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Attachment: HADOOP-10996.patch

Patch that fixes hadoop-config.sh to use HADOOP_COMMON_HOME/libexec if it can't 
find it in HADOOP_LIBEXEC_DIR as well as fixes two bugs in HADOOP_HDFS_HOME and 
HADOOP_MAPRED_HOME definitions when they aren't defined.

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
 Attachments: HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106350#comment-14106350
 ] 

Allen Wittenauer commented on HADOOP-10996:
---

FWIW, I opted to reverse the order because I remembered why I did it in the 
other code as well:  in NORMAL operating modes, HADOOP_LIBEXEC_DIR is the 
correct place.

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
 Attachments: HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Attachment: HADOOP-10996-01.patch



-01:  Wait... wait... wait... We should NOT be using HADOOP_HOME for anything! 
So let's fix that too.

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Assigned] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer reassigned HADOOP-10996:
-

Assignee: Allen Wittenauer

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) run hdfs, yarn, mapred, etc from target tree

2014-08-21 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Status: Patch Available  (was: Open)

 run hdfs, yarn, mapred, etc from target tree
 

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10998) Bash tab completion support for newer versions of hadoop

2014-08-22 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14106828#comment-14106828
 ] 

Allen Wittenauer commented on HADOOP-10998:
---

It'd probably be better if you took at a look at and updated HADOOP-6310, as it 
has some other fixes in it as well I think.

 Bash tab completion support for newer versions of hadoop
 

 Key: HADOOP-10998
 URL: https://issues.apache.org/jira/browse/HADOOP-10998
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
 Environment: Hadoop 2.0.0-cdh4.2.1
Reporter: Jim Hester
Priority: Trivial
 Attachments: HADOOP-10998.patch


 The included version of bash completion has not been updated in almost two 
 years and does not work correctly with more recent versions of Hadoop.
 The command substitutions in the script were using backticks, which were 
 interpolating variables before they were executed and breaking the script.  
 In particular the awk commands were breaking due to this interpolation.
 The attached patch changes the backticks to {{$(command)}} style for command 
 substitution and updates the script to work properly with newer versions of 
 hadoop.
 I tested the script using {{zsh 5.0.5}}, {{zsh 4.3.10]}, {{bash-4.1.2}} and 
 {{bash-3.2}}.  Hadoop versions tested were cloudera version 
 {{2.0.0-cdh4.2.1}} and briefly Apache {{2.5.0}}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop the violence in the *_HOME

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Summary: [post-HADOOP-9902] Stop the violence in the *_HOME  (was: run 
hdfs, yarn, mapred, etc from target tree)

 [post-HADOOP-9902] Stop the violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 There is a developer use case for running the shell scripts from the build 
 tree.  What would it take to make it work?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop the violence in the *_HOME

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Description: 
(Updated from original description)

There are various places where the various HOME directories are missing or 
mis-defined. 

  was:There are various places where the various HOME directories are missing 
or mis-defined.


 [post-HADOOP-9902] Stop the violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Summary: [post-HADOOP-9902] Stop violence in the *_HOME  (was: 
[post-HADOOP-9902] Stop the violence in the *_HOME)

 [post-HADOOP-9902] Stop violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop the violence in the *_HOME

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Description: There are various places where the various HOME directories 
are missing or mis-defined.  (was: There is a developer use case for running 
the shell scripts from the build tree.  What would it take to make it work?)

 [post-HADOOP-9902] Stop the violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 There are various places where the various HOME directories are missing or 
 mis-defined.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10998) Bash tab completion support for newer versions of hadoop

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10998:
--

   Resolution: Fixed
Fix Version/s: 2.6.0
   3.0.0
   Status: Resolved  (was: Patch Available)

+1, I'll commit this to branch-2 and trunk. Thanks!

For future generations:

The bash tab completion code still has a lot of problems.  Ideally, we'd fix 
them all with one patch that would be a rewrite of the current code.  (In fact, 
I wouldn't recommend people actually use this feature at all.) But at least 
this patch moves the code forward in a positive way.  There is still a lot of 
work to be done.

 Bash tab completion support for newer versions of hadoop
 

 Key: HADOOP-10998
 URL: https://issues.apache.org/jira/browse/HADOOP-10998
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
 Environment: Hadoop 2.0.0-cdh4.2.1
Reporter: Jim Hester
Priority: Trivial
 Fix For: 3.0.0, 2.6.0

 Attachments: HADOOP-10998.patch


 The included version of bash completion has not been updated in almost two 
 years and does not work correctly with more recent versions of Hadoop.
 The command substitutions in the script were using backticks, which were 
 interpolating variables before they were executed and breaking the script.  
 In particular the awk commands were breaking due to this interpolation.
 The attached patch changes the backticks to {{$(command)}} style for command 
 substitution and updates the script to work properly with newer versions of 
 hadoop.
 I tested the script using {{zsh 5.0.5}}, {{zsh 4.3.10]}, {{bash-4.1.2}} and 
 {{bash-3.2}}.  Hadoop versions tested were cloudera version 
 {{2.0.0-cdh4.2.1}} and briefly Apache {{2.5.0}}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10998) Fix bash tab completion code to work

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10998:
--

Summary: Fix bash tab completion code to work  (was: Bash tab completion 
support for newer versions of hadoop)

 Fix bash tab completion code to work
 

 Key: HADOOP-10998
 URL: https://issues.apache.org/jira/browse/HADOOP-10998
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
 Environment: Hadoop 2.0.0-cdh4.2.1
Reporter: Jim Hester
Priority: Trivial
 Fix For: 3.0.0, 2.6.0

 Attachments: HADOOP-10998.patch


 The included version of bash completion has not been updated in almost two 
 years and does not work correctly with more recent versions of Hadoop.
 The command substitutions in the script were using backticks, which were 
 interpolating variables before they were executed and breaking the script.  
 In particular the awk commands were breaking due to this interpolation.
 The attached patch changes the backticks to {{$(command)}} style for command 
 substitution and updates the script to work properly with newer versions of 
 hadoop.
 I tested the script using {{zsh 5.0.5}}, {{zsh 4.3.10]}, {{bash-4.1.2}} and 
 {{bash-3.2}}.  Hadoop versions tested were cloudera version 
 {{2.0.0-cdh4.2.1}} and briefly Apache {{2.5.0}}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10998) Fix bash tab completion code to work

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10998:
--

Assignee: Jim Hester

 Fix bash tab completion code to work
 

 Key: HADOOP-10998
 URL: https://issues.apache.org/jira/browse/HADOOP-10998
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
 Environment: Hadoop 2.0.0-cdh4.2.1
Reporter: Jim Hester
Assignee: Jim Hester
Priority: Trivial
 Fix For: 3.0.0, 2.6.0

 Attachments: HADOOP-10998.patch


 The included version of bash completion has not been updated in almost two 
 years and does not work correctly with more recent versions of Hadoop.
 The command substitutions in the script were using backticks, which were 
 interpolating variables before they were executed and breaking the script.  
 In particular the awk commands were breaking due to this interpolation.
 The attached patch changes the backticks to {{$(command)}} style for command 
 substitution and updates the script to work properly with newer versions of 
 hadoop.
 I tested the script using {{zsh 5.0.5}}, {{zsh 4.3.10]}, {{bash-4.1.2}} and 
 {{bash-3.2}}.  Hadoop versions tested were cloudera version 
 {{2.0.0-cdh4.2.1}} and briefly Apache {{2.5.0}}.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-6310) bash completion doesn't quite work.

2014-08-22 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-6310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-6310.
--

Resolution: Duplicate

Closing this as a dupe of HADOOP-10998.  See commit comment there.

 bash completion doesn't quite work.
 ---

 Key: HADOOP-6310
 URL: https://issues.apache.org/jira/browse/HADOOP-6310
 Project: Hadoop Common
  Issue Type: Bug
Affects Versions: 0.18.3
Reporter: Paul Huff
Assignee: Sean Mackrory
Priority: Trivial
 Attachments: HADOOP-6310.patch, HADOOP-6310.patch.1


 The bash completion script in src/contrib/bash-tab-completion/hadoop.sh 
 doesn't quite work the way you'd expect it to against 18.3 (and I assume 
 anything afterwards, since the author claimed compatibility with 16-20).
 It doesn't complete things like you'd expect against HDFS, and it doesn't 
 have job-id completion.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-25 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Status: Open  (was: Patch Available)

Cancelling patch -01.

After working with it, I've found some edge and not-so-edge cases that either:

a) are made worse (usage of *_HOME should be viewed as exceptions to _PREFIX, 
not as an all or nothing scenario)
b) aren't covered (etc/hadoop/*-site.xml comes from *_HOME)



 [post-HADOOP-9902] Stop violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-25 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14109162#comment-14109162
 ] 

Allen Wittenauer edited comment on HADOOP-10996 at 8/25/14 2:57 PM:


Cancelling patch -01.

After working with it, I've found some edge and not-so-edge cases that either:

a) are made worse (e.g., usage of *_HOME should be viewed as exceptions to 
_PREFIX, not as an all or nothing scenario)
b) aren't covered (e.g., etc/hadoop/*-site.xml comes from *_HOME)




was (Author: aw):
Cancelling patch -01.

After working with it, I've found some edge and not-so-edge cases that either:

a) are made worse (usage of *_HOME should be viewed as exceptions to _PREFIX, 
not as an all or nothing scenario)
b) aren't covered (etc/hadoop/*-site.xml comes from *_HOME)



 [post-HADOOP-9902] Stop violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)

Allen Wittenauer created HADOOP-11002:
-

 Summary: shell escapes are incompatible with previous releases
 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer


Post-HADOOP-9902, the following in namenode opts doesn't work without being 
escaped:

{code}
-XX:HeapDumpPath=./java_pid_pid.hprof
{code}

This is a bit of surprising behavior to the users.  The breakage is directly 
result of the code that fixes spaces in directories.  Since it is much more 
likely to hit weird metacharacters in shell than have directories with spaces, 
that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-25 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14109673#comment-14109673
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

FYI: HADOOP-11002 - shell escapes are incompatible with previous releases

 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated HADOOP-11002:
--

Description:
Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being escaped:

{code}
-XX:HeapDumpPath=./java_pid_pid.hprof
{code}

This is a bit of surprising behavior to the users. The breakage is directly
result of the code that fixes spaces in directories. Since it is much more
likely to hit weird metacharacters in shell than have directories with spaces,
that part of HADOOP-9902 needs to get replaced.

was:
Post-HADOOP-9902, the following in namenode opts doesn't work without being
escaped:

{code}
-XX:HeapDumpPath=./java_pid_pid.hprof
{code}

shell escapes are incompatible with previous releases
-

Key: HADOOP-11002
URL: https://issues.apache.org/jira/browse/HADOOP-11002
Project: Hadoop Common
Issue Type: Bug
Components: scripts
Reporter: Allen Wittenauer

Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being
escaped:
{code}
-XX:HeapDumpPath=./java_pid_pid.hprof
{code}
This is a bit of surprising behavior to the users. The breakage is directly
result of the code that fixes spaces in directories. Since it is much more
likely to hit weird metacharacters in shell than have directories with
spaces, that part of HADOOP-9902 needs to get replaced.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14110201#comment-14110201
 ] 

Allen Wittenauer commented on HADOOP-11002:
---

After playing with this for most of today, I've come to the conclusion that 
we're going to have to drop support for directories with spaces.  There's 
simply no, 100% reliable way to make them work with a global var like 
HADOOP_OPTS without doing some really bizarre tricks that break other things.  
Support for metacharacters like  is significantly more important.  


 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
  Labels: regression

 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-11002:
--

Labels: regression  (was: )

 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
  Labels: regression

 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-9902) Shell script rewrite

2014-08-25 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Allen Wittenauer updated HADOOP-9902:
-

Release Note:
The Hadoop shell scripts have been rewritten to fix many long standing bugs and
include some new features. While an eye has been kept towards compatibility,
some changes may break existing installations.

INCOMPATIBLE CHANGES:

* The pid and out files for secure daemons have been renamed to include the
appropriate ${HADOOP_IDENT_STR}. This should allow, with proper configurations
in place, for multiple versions of the same secure daemon to run on a host.
Additionally, pid files are now created when daemons are run in interactive
mode. This will also prevent the accidental starting of two daemons with the
same configuration prior to launching java (i.e., fast fail without having to
wait for socket opening).
* All Hadoop shell script subsystems now execute hadoop-env.sh, which allows
for all of the environment variables to be in one location. This was not the
case previously.
* The default content of *-env.sh has been significantly alterated, with the
majority of defaults moved into more protected areas inside the code.
Additionally, these files do not auto-append anymore; setting a variable on the
command line prior to calling a shell command must contain the entire content,
not just any extra settings. This brings Hadoop more in-line with the vast
majority of other software packages.
* All HDFS_*, YARN_*, and MAPRED_* environment variables act as overrides to
their equivalent HADOOP_* environment variables when 'hdfs', 'yarn', 'mapred',
and related commands are executed. Previously, these were separated out which
meant a significant amount of duplication of common settings.
* hdfs-config.sh and hdfs-config.cmd were inadvertently duplicated into libexec
and sbin. The sbin versions have been removed.
* The log4j settings forcibly set by some *-daemon.sh commands have been
removed. These settings are now configurable in the *-env.sh files via *_OPT.
* Some formerly 'documented' entries in yarn-env.sh have been undocumented as a
simple form of deprecration in order to greatly simplify configuration and
reduce unnecessary duplication. They will still work, but those variables will
likely be removed in a future release.
* Support for various undocumentented YARN log4j.properties files has been
removed.
* Support for ${HADOOP_MASTER} and the related rsync code have been removed.
* The undocumented yarn.id.str has been removed.
* We now require bash v3 (released July 27, 2004) or better in order to take
advantage of better regex handling and ${BASH_SOURCE}. POSIX sh will not work.
* Support for --script has been removed. We now use ${HADOOP_*_PATH} or
${HADOOP_PREFIX} to find the necessary binaries. (See other note regarding
${HADOOP_PREFIX} auto discovery.)
* Non-existent classpaths, ld.so library paths, JNI library paths, etc, will be
ignored and stripped from their respective environment settings.

BUG FIXES:

* ${HADOOP_CONF_DIR} is now properly honored everywhere, without requiring
symlinking and other such tricks.
* ${HADOOP_CONF_DIR}/hadoop-layout.sh is now documented with a provided
hadoop-layout.sh.example file.
* Shell commands should now work properly when called as a relative path,
without ${HADOOP_PREFIX} being defined, and as the target of bash -x for
debugging. If ${HADOOP_PREFIX} is not set, it will be automatically determined
based upon the current location of the shell library. Note that other parts of
the extended Hadoop ecosystem may still require this environment variable to be
configured.
* Operations which trigger ssh will now limit the number of connections to run
in parallel to ${HADOOP_SSH_PARALLEL} to prevent memory and network exhaustion.
By default, this is set to 10.
* ${HADOOP_CLIENT_OPTS} support has been added to a few more commands.
* Some subcommands were not listed in the usage.
* Various options on hadoop command lines were supported inconsistently. These
have been unified into hadoop-config.sh. --config is still required to be
first, however.
* ulimit logging for secure daemons no longer assumes /bin/bash but does assume
bash is on the command line path.
* Removed references to some Yahoo! specific paths.
* Removed unused slaves.sh from YARN build tree.
* Many exit states have been changed to reflect reality.
* Shell level errors now go to STDERR. Before, many of them went incorrectly
to STDOUT.
* CDPATH with a period (.) should no longer break the scripts.
* The scripts no longer try to chown directories.

IMPROVEMENTS:

* The *.out files are now appended instead of overwritten to allow for external
log rotation.
* The style and layout of the scripts is much more consistent across
subprojects.
* More of the shell code is now commented.
* Significant amounts of redundant code have been moved

[jira] [Commented] (HADOOP-9902) Shell script rewrite

2014-08-25 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14110218#comment-14110218
 ] 

Allen Wittenauer commented on HADOOP-9902:
--

Given bug fix JIRAs are getting little-to-no traction in watchers, much less 
reviews, some advice on how to proceed on what are clearly bugs (HADOOP-10996, 
HADOOP-11002, likely more as they come in) would be appreciated...  


 Shell script rewrite
 

 Key: HADOOP-9902
 URL: https://issues.apache.org/jira/browse/HADOOP-9902
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: releasenotes
 Fix For: 3.0.0

 Attachments: HADOOP-9902-10.patch, HADOOP-9902-11.patch, 
 HADOOP-9902-12.patch, HADOOP-9902-13-branch-2.patch, HADOOP-9902-13.patch, 
 HADOOP-9902-14.patch, HADOOP-9902-15.patch, HADOOP-9902-16.patch, 
 HADOOP-9902-2.patch, HADOOP-9902-3.patch, HADOOP-9902-4.patch, 
 HADOOP-9902-5.patch, HADOOP-9902-6.patch, HADOOP-9902-7.patch, 
 HADOOP-9902-8.patch, HADOOP-9902-9.patch, HADOOP-9902.patch, HADOOP-9902.txt, 
 hadoop-9902-1.patch, more-info.txt


 Umbrella JIRA for shell script rewrite.  See more-info.txt for more details.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-11002:
--

Attachment: HADOOP-11002.patch

This removes support for spaces.

 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
  Labels: regression
 Attachments: HADOOP-11002.patch


 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-25 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HADOOP-10996:
--

Attachment: HADOOP-10996-02.patch

-02:

Changed up the order. Reduces the amount of stat calls needed by checking if 
some of the *_HOME vars are defined.

I started to poke around at enabling *_HOME/etc/hadoop (or whatever), but 
decided it probably isn't worth it since it will likely lead to unpredictable 
results.

[~andrew.wang], please try this out and see if it fixes your specific issue. 
Thanks!

 [post-HADOOP-9902] Stop violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Attachments: HADOOP-10996-01.patch, HADOOP-10996-02.patch, 
 HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-25 Thread Allen Wittenauer (JIRA)


[ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14110255#comment-14110255
 ] 

Allen Wittenauer commented on HADOOP-11002:
---

(It's worth pointing out that directories with spaces are completely broken in 
branch-2.  HADOOP-9902 made quite a few things work, but without fixing this 
issue, not enough to be usable.)

 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
  Labels: regression
 Attachments: HADOOP-11002.patch


 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-26 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-11002.
---

   Resolution: Fixed
Fix Version/s: 3.0.0

Thanks! I'll commit this as soon as git opens up!

 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
  Labels: regression
 Fix For: 3.0.0

 Attachments: HADOOP-11002.patch


 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Assigned] (HADOOP-11002) shell escapes are incompatible with previous releases

2014-08-26 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-11002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer reassigned HADOOP-11002:
-

Assignee: Allen Wittenauer

 shell escapes are incompatible with previous releases
 -

 Key: HADOOP-11002
 URL: https://issues.apache.org/jira/browse/HADOOP-11002
 Project: Hadoop Common
  Issue Type: Bug
  Components: scripts
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
  Labels: regression
 Fix For: 3.0.0

 Attachments: HADOOP-11002.patch


 Post-HADOOP-9902, the following in xyz_OPTS doesn't work without being 
 escaped:
 {code}
 -XX:HeapDumpPath=./java_pid_pid.hprof
 {code}
 This is a bit of surprising behavior to the users.  The breakage is directly 
 result of the code that fixes spaces in directories.  Since it is much more 
 likely to hit weird metacharacters in shell than have directories with 
 spaces, that part of HADOOP-9902 needs to get replaced.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-26 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14111776#comment-14111776
]

Allen Wittenauer commented on HADOOP-10996:
---

TL;DR: Absolute best bet is to put configs some place and assign
HADOOP_CONF_DIR to it so that you have absolute certainty on where Hadoop is
pulling settings.

Longer story:

Currently, if HADOOP_CONF_DIR isn't defined, it uses a bit of twisted logic to
locate it:

1. Figure out where HADOOP_PREFIX is at. Is HADOOP_PREFIX defined? If not, then
let's assume it's what's called us/...
2. Does HADOOP_PREFIX/conf/hadoop-env.sh exist? OK, then that must be
HADOOP_CONF_DIR
3. No? OK, then HADOOP_CONF_DIR must be HADOOP_PREFIX/etc/hadoop.

What's fun about this and what you're doing is that HADOOP_CONF_DIR will get
defined differently depending upon which bin dir you are using. :D

Fine, you say! Let's just treat all *_HOME/etc/hadoop and *_HOME/conf as
potentially valid. Now we have a very interesting problem: how do you define
HADOOP_CONF_DIR? Other stuff past Hadoop depends upon this being *one*
directory. We could pick the first one and then just shove the rest in the
classpath and none would be the wiser!

Aha! But they would. Which one takes precedence? What happens if there are
conflicts? etc, etc. It gets messy very very fast. So... ABORT! ABORT!

(BTW, this is pretty much the same logic from branch-2. It could be argued that
there should be a check to see if etc/hadoop is 'real' too and abort on it.
Here's the fun part: the shell code works perfectly fine if *-env.sh is empty
now... the NN will still crash though. That said, if HADOOP-10879 gets
finished, this will almost certainly need to get revisited. Probably better to
look for core-site.xml, honestly, since all of the sub-projects all depend upon
that. In other words, we could run through all of the *_HOME, HADOOP_PREFIX,
etc, and use the first core-site.xml we find as the 'real' HADOOP_CONF_DIR.)

[post-HADOOP-9902] Stop violence in the *_HOME
--

Key: HADOOP-10996
URL: https://issues.apache.org/jira/browse/HADOOP-10996
Project: Hadoop Common
Issue Type: Improvement
Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
Attachments: HADOOP-10996-01.patch, HADOOP-10996-02.patch,
HADOOP-10996.patch

(Updated from original description)
There are various places where the various HOME directories are missing or
mis-defined.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-26 Thread Allen Wittenauer (JIRA)

[
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14111776#comment-14111776
]

Allen Wittenauer edited comment on HADOOP-10996 at 8/27/14 3:32 AM:

TL;DR: Absolute best bet is to put configs some place and assign
HADOOP_CONF_DIR to it so that you have absolute certainty on where Hadoop is
pulling settings.

Longer story:

Currently, if HADOOP_CONF_DIR isn't defined, it uses a bit of interesting logic
to locate it:

What's fun about this and what you're doing is that HADOOP_CONF_DIR will get
defined differently depending upon which bin dir you are using. :D

Fine, you say! Let's just treat all _HOME/etc/hadoop and _HOME/conf as
potentially valid. Now we have a very interesting problem: how do you define
HADOOP_CONF_DIR? Other stuff past Hadoop depends upon this being one
directory. We could pick the first one and then just shove the rest in the
classpath and none would be the wiser!

Aha! But they would. Which one takes precedence? What happens if there are
conflicts? etc, etc. It gets messy very very fast. So... ABORT! ABORT!

(BTW, this is pretty much the same logic from branch-2. It could be argued that
there should be a check to see if etc/hadoop is 'real' too and abort on it.
Here's the fun part: the shell code works perfectly fine if -env.sh is empty
now... the NN will still crash though. That said, if HADOOP-10879 gets
finished, this will almost certainly need to get revisited. Probably better to
look for core-site.xml, honestly, since all of the sub-projects all depend upon
that. In other words, we could run through all of the _HOME, HADOOP_PREFIX,
etc, and use the first core-site.xml we find as the 'real' HADOOP_CONF_DIR.)

was (Author: aw):
TL;DR: Absolute best bet is to put configs some place and assign
HADOOP_CONF_DIR to it so that you have absolute certainty on where Hadoop is
pulling settings.

Longer story:

Currently, if HADOOP_CONF_DIR isn't defined, it uses a bit of twisted logic to
locate it:

What's fun about this and what you're doing is that HADOOP_CONF_DIR will get
defined differently depending upon which bin dir you are using. :D

Aha! But they would. Which one takes precedence? What happens if there are
conflicts? etc, etc. It gets messy very very fast. So... ABORT! ABORT!

[post-HADOOP-9902] Stop violence in the *_HOME
--

(Updated from original description)
There are various places where the various HOME directories are missing or
mis-defined.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HADOOP-10996) [post-HADOOP-9902] Stop violence in the *_HOME

2014-08-26 Thread Allen Wittenauer (JIRA)


 [ 
https://issues.apache.org/jira/browse/HADOOP-10996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved HADOOP-10996.
---

   Resolution: Fixed
Fix Version/s: 3.0.0

Thanks! I'll commit this as soon as the git repo opens up!

 [post-HADOOP-9902] Stop violence in the *_HOME
 --

 Key: HADOOP-10996
 URL: https://issues.apache.org/jira/browse/HADOOP-10996
 Project: Hadoop Common
  Issue Type: Improvement
  Components: scripts
Affects Versions: 3.0.0
Reporter: Allen Wittenauer
Assignee: Allen Wittenauer
 Fix For: 3.0.0

 Attachments: HADOOP-10996-01.patch, HADOOP-10996-02.patch, 
 HADOOP-10996.patch


 (Updated from original description)
 There are various places where the various HOME directories are missing or 
 mis-defined. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

< 2 3 4 5 6 7 8 9 10 11 >

601 - 700 of 7059 matches

Mail list logo