[jira] Subscription: Oozie Patch Available

2019-01-02 Thread jira
Issue Subscription
Filter: Oozie Patch Available (96 issues)

Subscriber: ooziedaily

Key Summary
OOZIE-3407  Cleanup TestPurgeXCommand
https://issues.apache.org/jira/browse/OOZIE-3407
OOZIE-3404  The env variable of SPARK_HOME needs to be set when running pySpark
https://issues.apache.org/jira/browse/OOZIE-3404
OOZIE-3401  TestPySpark failure
https://issues.apache.org/jira/browse/OOZIE-3401
OOZIE-3375  Can't use empty  in coordinator
https://issues.apache.org/jira/browse/OOZIE-3375
OOZIE-3367  Using && in EL expressions in oozie bundle.xml files generates 
parse errors
https://issues.apache.org/jira/browse/OOZIE-3367
OOZIE-3366  Update workflow status and subworkflow status on suspend command
https://issues.apache.org/jira/browse/OOZIE-3366
OOZIE-3364  Rerunning Oozie bundle jobs starts the coordinators in 
indeterminate order
https://issues.apache.org/jira/browse/OOZIE-3364
OOZIE-3362  When killed, SSH action should kill the spawned processes on target 
host
https://issues.apache.org/jira/browse/OOZIE-3362
OOZIE-3326  Sqoop Action should support tez delegation tokens for hive-import
https://issues.apache.org/jira/browse/OOZIE-3326
OOZIE-3320  Oozie ShellAction should support absolute bash file path
https://issues.apache.org/jira/browse/OOZIE-3320
OOZIE-3319  Log SSH action callback error output
https://issues.apache.org/jira/browse/OOZIE-3319
OOZIE-3301  Update NOTICE file
https://issues.apache.org/jira/browse/OOZIE-3301
OOZIE-3274  Remove slf4j
https://issues.apache.org/jira/browse/OOZIE-3274
OOZIE-3266  Coord action rerun support RERUN_SKIP_NODES option
https://issues.apache.org/jira/browse/OOZIE-3266
OOZIE-3265  properties RERUN_FAIL_NODES and RERUN_SKIP_NODES should be able to 
appear together
https://issues.apache.org/jira/browse/OOZIE-3265
OOZIE-3256  refactor OozieCLI class
https://issues.apache.org/jira/browse/OOZIE-3256
OOZIE-3249  [tools] Instrumentation log parser
https://issues.apache.org/jira/browse/OOZIE-3249
OOZIE-3218  Oozie Sqoop action with command splits the select clause into 
multiple parts due to delimiter being space
https://issues.apache.org/jira/browse/OOZIE-3218
OOZIE-3199  Let system property restriction configurable
https://issues.apache.org/jira/browse/OOZIE-3199
OOZIE-3196  Authorization: restrict world readability by user
https://issues.apache.org/jira/browse/OOZIE-3196
OOZIE-3194  Oozie should set proper permissions to sharelib after upload
https://issues.apache.org/jira/browse/OOZIE-3194
OOZIE-3186  Oozie is unable to use configuration linked using jceks://file/...
https://issues.apache.org/jira/browse/OOZIE-3186
OOZIE-3179  Adding a configurable config-default.xml location to a workflow
https://issues.apache.org/jira/browse/OOZIE-3179
OOZIE-3170  Oozie Diagnostic Bundle tool fails with NPE due to missing service 
class
https://issues.apache.org/jira/browse/OOZIE-3170
OOZIE-3137  Add support for log4j2 in HiveMain
https://issues.apache.org/jira/browse/OOZIE-3137
OOZIE-3135  Configure log4j2 in SqoopMain
https://issues.apache.org/jira/browse/OOZIE-3135
OOZIE-3091  Oozie Sqoop Avro Import fails with "java.lang.NoClassDefFoundError: 
org/apache/avro/mapred/AvroWrapper"
https://issues.apache.org/jira/browse/OOZIE-3091
OOZIE-3071  Oozie 4.3 Spark sharelib ueses a different version of commons-lang3 
than Spark 2.2.0
https://issues.apache.org/jira/browse/OOZIE-3071
OOZIE-3063  Sanitizing variables that are part of openjpa.ConnectionProperties
https://issues.apache.org/jira/browse/OOZIE-3063
OOZIE-3062  Set HADOOP_CONF_DIR for spark action
https://issues.apache.org/jira/browse/OOZIE-3062
OOZIE-2952  Fix Findbugs warnings in oozie-sharelib-oozie
https://issues.apache.org/jira/browse/OOZIE-2952
OOZIE-2949  Escape quotes whitespaces in Sqoop  field
https://issues.apache.org/jira/browse/OOZIE-2949
OOZIE-2927  Append new line character for Hive2 query using query tag
https://issues.apache.org/jira/browse/OOZIE-2927
OOZIE-2834  ParameterVerifier logging non-useful warning for workflow definition
https://issues.apache.org/jira/browse/OOZIE-2834
OOZIE-2833  when using uber mode the regex pattern used in the 
extractHeapSizeMB method does not allow heap sizes specified in bytes.
https://issues.apache.org/jira/browse/OOZIE-2833
OOZIE-2812  SparkConfigurationService should support loading configurations 
from multiple Spark versions
https://issues.apache.org/jira/browse/OOZIE-2812
OOZIE-2795  Create lib directory or symlink for Oozie CLI during packaging
https://issues.apache.org/jira/browse/OOZIE-2795
OOZIE-2784  Include WEEK as a parameter in 

[jira] [Commented] (OOZIE-3407) Cleanup TestPurgeXCommand

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732171#comment-16732171
 ] 

Andras Salamon commented on OOZIE-3407:
---

The tests were already flaky: https://issues.apache.org/jira/browse/OOZIE-3252 
this problem was not introduced with this patch.

 

> Cleanup TestPurgeXCommand
> -
>
> Key: OOZIE-3407
> URL: https://issues.apache.org/jira/browse/OOZIE-3407
> Project: Oozie
>  Issue Type: Sub-task
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3407-01.patch, OOZIE-3407-02.patch
>
>
> [TestPurgeXCommand|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java]
>  is the longest java source file of Oozie, more than 4000 rows. 
> As part of the Purge Service rewrite we need to use this file to make sure we 
> don't introduce new bugs and probably we also need to add a few more test 
> cases. Before that it would be useful to cleanup this file and extract 
> redundant code to new methods to make the code shorter and more manageable.
> The patch uploaded to OOZIE-3400 introduced three new test cases to this file 
> and shows a simple way to make it shorter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3407) Cleanup TestPurgeXCommand

2019-01-02 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732161#comment-16732161
 ] 

Hadoop QA commented on OOZIE-3407:
--


Testing JIRA OOZIE-3407

Cleaning local git workspace



{color:green}+1 PATCH_APPLIES{color}
{color:green}+1 CLEAN{color}
{color:green}+1 RAW_PATCH_ANALYSIS{color}
.{color:green}+1{color} the patch does not introduce any @author tags
.{color:green}+1{color} the patch does not introduce any tabs
.{color:green}+1{color} the patch does not introduce any trailing spaces
.{color:green}+1{color} the patch does not introduce any star imports
.{color:green}+1{color} the patch does not introduce any line longer than 
132
.{color:green}+1{color} the patch adds/modifies 1 testcase(s)
{color:green}+1 RAT{color}
.{color:green}+1{color} the patch does not seem to introduce new RAT 
warnings
{color:green}+1 JAVADOC{color}
.{color:green}+1{color} Javadoc generation succeeded with the patch
.{color:green}+1{color} the patch does not seem to introduce new Javadoc 
warning(s)
.{color:orange}WARNING{color}: the current HEAD has 100 Javadoc warning(s)
{color:green}+1 COMPILE{color}
.{color:green}+1{color} HEAD compiles
.{color:green}+1{color} patch compiles
.{color:green}+1{color} the patch does not seem to introduce new javac 
warnings
{color:green}+1{color} There are no new bugs found in total.
. {color:green}+1{color} There are no new bugs found in [sharelib/hive2].
. {color:green}+1{color} There are no new bugs found in [sharelib/spark].
. {color:green}+1{color} There are no new bugs found in [sharelib/oozie].
. {color:green}+1{color} There are no new bugs found in [sharelib/pig].
. {color:green}+1{color} There are no new bugs found in [sharelib/streaming].
. {color:green}+1{color} There are no new bugs found in [sharelib/hive].
. {color:green}+1{color} There are no new bugs found in [sharelib/distcp].
. {color:green}+1{color} There are no new bugs found in [sharelib/hcatalog].
. {color:green}+1{color} There are no new bugs found in [sharelib/sqoop].
. {color:green}+1{color} There are no new bugs found in [sharelib/git].
. {color:green}+1{color} There are no new bugs found in [client].
. {color:green}+1{color} There are no new bugs found in [docs].
. {color:green}+1{color} There are no new bugs found in [tools].
. {color:green}+1{color} There are no new bugs found in 
[fluent-job/fluent-job-api].
. {color:green}+1{color} There are no new bugs found in [server].
. {color:green}+1{color} There are no new bugs found in [webapp].
. {color:green}+1{color} There are no new bugs found in [examples].
. {color:green}+1{color} There are no new bugs found in [core].
{color:green}+1 BACKWARDS_COMPATIBILITY{color}
.{color:green}+1{color} the patch does not change any JPA 
Entity/Colum/Basic/Lob/Transient annotations
.{color:green}+1{color} the patch does not modify JPA files
{color:green}+1 TESTS{color}
.Tests run: 3106
.{color:orange}Tests failed at first run:{color}
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF1
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF2
TestPurgeXCommand#testPurgeCoordWithWFChild2
.For the complete list of flaky tests, see TEST-SUMMARY-FULL files.
{color:green}+1 DISTRO{color}
.{color:green}+1{color} distro tarball builds with the patch 


{color:green}*+1 Overall result, good!, no -1s*{color}

{color:red}. There is at least one warning, please check{color}

The full output of the test-patch run is available at

. https://builds.apache.org/job/PreCommit-OOZIE-Build/956/



> Cleanup TestPurgeXCommand
> -
>
> Key: OOZIE-3407
> URL: https://issues.apache.org/jira/browse/OOZIE-3407
> Project: Oozie
>  Issue Type: Sub-task
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3407-01.patch, OOZIE-3407-02.patch
>
>
> [TestPurgeXCommand|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java]
>  is the longest java source file of Oozie, more than 4000 rows. 
> As part of the Purge Service rewrite we need to use this file to make sure we 
> don't introduce new bugs and probably we also need to add a few more test 
> cases. Before that it would be useful to cleanup this file and extract 
> redundant code to new methods to make the code shorter and more manageable.
> The patch uploaded to OOZIE-3400 introduced three new test cases to this file 
> and shows a simple way to make it shorter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


Failed: OOZIE-3407 PreCommit Build #956

2019-01-02 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/OOZIE-3407
Build: https://builds.apache.org/job/PreCommit-OOZIE-Build/956/

###
## LAST 100 LINES OF THE CONSOLE 
###
[...truncated 1.92 MB...]
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [server].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [webapp].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [examples].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [core].
[INFO] There are no new bugs found totally].
[TRACE] FindBugs diffs checked and reports created
[TRACE] Summary file size is 2560 bytes
[TRACE] Full summary file size is 1471 bytes
[TRACE] File 
[/home/jenkins/jenkins-slave/workspace/PreCommit-OOZIE-Build/test-patch/tmp/FINDBUGS_DIFF/diff/findbugs-diff-0.1.0-all.jar]
 removed
[TRACE] File 
[/home/jenkins/jenkins-slave/workspace/PreCommit-OOZIE-Build/test-patch/tmp/FINDBUGS_DIFF/diff/findbugs-diff-0.1.0-all.jar.md5sum]
 removed
  Running test-patch task BACKWARDS_COMPATIBILITY
  Running test-patch task TESTS
  Running test-patch task DISTRO


Testing JIRA OOZIE-3407

Cleaning local git workspace



+1 PATCH_APPLIES
+1 CLEAN
+1 RAW_PATCH_ANALYSIS
+1 the patch does not introduce any @author tags
+1 the patch does not introduce any tabs
+1 the patch does not introduce any trailing spaces
+1 the patch does not introduce any star imports
+1 the patch does not introduce any line longer than 132
+1 the patch adds/modifies 1 testcase(s)
+1 RAT
+1 the patch does not seem to introduce new RAT warnings
+1 JAVADOC
+1 Javadoc generation succeeded with the patch
+1 the patch does not seem to introduce new Javadoc warning(s)
WARNING: the current HEAD has 100 Javadoc warning(s)
+1 COMPILE
+1 HEAD compiles
+1 patch compiles
+1 the patch does not seem to introduce new javac warnings
+1 There are no new bugs found in total.
 +1 There are no new bugs found in [sharelib/hive2].
 +1 There are no new bugs found in [sharelib/spark].
 +1 There are no new bugs found in [sharelib/oozie].
 +1 There are no new bugs found in [sharelib/pig].
 +1 There are no new bugs found in [sharelib/streaming].
 +1 There are no new bugs found in [sharelib/hive].
 +1 There are no new bugs found in [sharelib/distcp].
 +1 There are no new bugs found in [sharelib/hcatalog].
 +1 There are no new bugs found in [sharelib/sqoop].
 +1 There are no new bugs found in [sharelib/git].
 +1 There are no new bugs found in [client].
 +1 There are no new bugs found in [docs].
 +1 There are no new bugs found in [tools].
 +1 There are no new bugs found in [fluent-job/fluent-job-api].
 +1 There are no new bugs found in [server].
 +1 There are no new bugs found in [webapp].
 +1 There are no new bugs found in [examples].
 +1 There are no new bugs found in [core].
+1 BACKWARDS_COMPATIBILITY
+1 the patch does not change any JPA Entity/Colum/Basic/Lob/Transient 
annotations
+1 the patch does not modify JPA files
+1 TESTS
Tests run: 3106
Tests failed at first run:
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF1
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF2
TestPurgeXCommand#testPurgeCoordWithWFChild2
For the complete list of flaky tests, see TEST-SUMMARY-FULL files.
+1 DISTRO
+1 distro tarball builds with the patch 


+1 Overall result, good!, no -1s

 There is at least one warning, please check

The full output of the test-patch run is available at

 https://builds.apache.org/job/PreCommit-OOZIE-Build/956/

Adding comment to JIRA
  % Total% Received % Xferd  Average Speed   TimeTime Time  Current
 Dload  Upload   Total   SpentLeft  Speed
  0 00 00 0  0  0 --:--:-- --:--:-- --:--:-- 
0100  35770 0  100  3577  0   4441 --:--:-- --:--:-- --:--:--  
4437{"self":"https://issues.apache.org/jira/rest/api/2/issue/13205346/comment/16732161","id":"16732161","author":{"self":"https://issues.apache.org/jira/rest/api/2/user?username=hadoopqa","name":"hadoopqa","key":"hadoopqa","avatarUrls":{"48x48":"https://issues.apache.org/jira/secure/useravatar?ownerId=hadoopqa=10393","24x24":"https://issues.apache.org/jira/secure/useravatar?size=small=hadoopqa=10393","16x16":"https://issues.apache.org/jira/secure/useravatar?size=xsmall=hadoopqa=10393","32x32":"https://issues.apache.org/jira/secure/useravatar?size=medium=hadoopqa=10393"},"displayName":"Hadoop
 QA","active":true,"timeZone":"Etc/UTC"},"body":"\nTesting JIRA 
OOZIE-3407\n\nCleaning local git 

[jira] [Commented] (OOZIE-3409) Oozie Server : Possible memory leak in EL evaluation

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732123#comment-16732123
 ] 

Andras Salamon commented on OOZIE-3409:
---

We could create a hack solution like:
 # Extending {{ExpressionEvaluatorImpl}} and not using the cache
 # Counting EL expression evaluations and drop {{ExpressionEvaluatorImpl}} and 
start over after a while

The best solution is to replace the obsolete {{commons.el}}. I've checked our 
code, it seems to be this is the only place we use {{commons.el}} in Oozie. 
There are seveal EL evaluators listed here: 
[https://stackoverflow.com/a/17051440/21348] We might check JEXL ( 
[http://commons.apache.org/proper/commons-jexl/] )  since we already use that 
in other parts of Oozie. Of course we have to be sure it does not have the same 
bug.

> Oozie Server : Possible memory leak in EL evaluation
> 
>
> Key: OOZIE-3409
> URL: https://issues.apache.org/jira/browse/OOZIE-3409
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 5.1.0
>Reporter: Praneeth
>Assignee: Andras Salamon
>Priority: Major
>
> I have noticed that there is a potential memory leak in the Oozie server due 
> to one of its dependencies - {{commons-el:commons-el:1.0}}. Our oozie servers 
> have been running out of memory occasionally. It happens more often when we 
> have more activity. Upon analysis of the heap dump, it seems that the culprit 
> here is the {{org.apache.commons.el.ExpressionEvaluatorImpl}} from 
> {{commons-el}} which is used in the Oozie server through 
> {{org.apache.oozie.util.ELEvaluator}} [1] . {{ExpressionEvaluatorImpl}} uses 
> a static {{Map}} [2] to cache the EL evaluations. This map never gets 
> cleared, eventually taking up all the memory. So, in an environment with 
> heavy usage and long running oozie servers, this would lead to an OOM. The 
> configurations for workflow actions we have are possibly larger than normal 
> which is worsening the situation as the Map has to maintain larger strings.
>  # 
> [https://github.com/apache/oozie/blob/release-5.1.0/core/src/main/java/org/apache/oozie/util/ELEvaluator.java#L143]
>  # 
> [http://svn.apache.org/viewvc/commons/dormant/el/trunk/src/java/org/apache/commons/el/ExpressionEvaluatorImpl.java?view=markup#l94]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Issue Comment Deleted] (OOZIE-2949) Escape quotes whitespaces in Sqoop field

2019-01-02 Thread Andras Salamon (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Salamon updated OOZIE-2949:
--
Comment: was deleted

(was: We could create a hack solution like:
 # Extending {{ExpressionEvaluatorImpl}} and not using the cache
 # Counting EL expression evaluations and drop {{ExpressionEvaluatorImpl}} and 
start over after a while

The best solution (just like [~gezapeti] suggested) is to replace commons el. 
I've checked our code, it seems to be this is the only place we use 
{{commons.el}} in Oozie. There are seveal EL evaluators listed here: 
[https://stackoverflow.com/a/17051440/21348] We might check JEXL ( 
[http://commons.apache.org/proper/commons-jexl/] )  since we already use that 
in other parts of Oozie.

 

 )

> Escape quotes whitespaces in Sqoop  field
> --
>
> Key: OOZIE-2949
> URL: https://issues.apache.org/jira/browse/OOZIE-2949
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 4.3.0
>Reporter: Peter Cseh
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-2949-0.patch, OOZIE-2949-01.patch
>
>
> The current behavior of the Sqoop action is:
> {noformat}
> The Sqoop command can be specified either using the command element or 
> multiple arg elements.
> When using the command element, Oozie will split the command on every space 
> into multiple arguments.
> When using the arg elements, Oozie will pass each argument value as an 
> argument to Sqoop.
> {noformat}
> This prevents the user to simply copy-paste the command worked in the shell 
> into the workflow.xml.
> We should split the  field by taking quotes into account, similar to 
> what OOZIE-2391
> did for the Spark action's  field.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-2949) Escape quotes whitespaces in Sqoop field

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732113#comment-16732113
 ] 

Andras Salamon commented on OOZIE-2949:
---

We could create a hack solution like:
 # Extending {{ExpressionEvaluatorImpl}} and not using the cache
 # Counting EL expression evaluations and drop {{ExpressionEvaluatorImpl}} and 
start over after a while

The best solution (just like [~gezapeti] suggested) is to replace commons el. 
I've checked our code, it seems to be this is the only place we use 
{{commons.el}} in Oozie. There are seveal EL evaluators listed here: 
[https://stackoverflow.com/a/17051440/21348] We might check JEXL ( 
[http://commons.apache.org/proper/commons-jexl/] )  since we already use that 
in other parts of Oozie.

 

 

> Escape quotes whitespaces in Sqoop  field
> --
>
> Key: OOZIE-2949
> URL: https://issues.apache.org/jira/browse/OOZIE-2949
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 4.3.0
>Reporter: Peter Cseh
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-2949-0.patch, OOZIE-2949-01.patch
>
>
> The current behavior of the Sqoop action is:
> {noformat}
> The Sqoop command can be specified either using the command element or 
> multiple arg elements.
> When using the command element, Oozie will split the command on every space 
> into multiple arguments.
> When using the arg elements, Oozie will pass each argument value as an 
> argument to Sqoop.
> {noformat}
> This prevents the user to simply copy-paste the command worked in the shell 
> into the workflow.xml.
> We should split the  field by taking quotes into account, similar to 
> what OOZIE-2391
> did for the Spark action's  field.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (OOZIE-2949) Escape quotes whitespaces in Sqoop field

2019-01-02 Thread Peter Cseh (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732068#comment-16732068
 ] 

Peter Cseh edited comment on OOZIE-2949 at 1/2/19 2:06 PM:
---

The best solution would be some third party lib that handles this for us and is 
well-tested, developed. I could not find anything except for Java code 
snippets. (There are some 10+ year old things out there though)

It looks like we'll have to implement our own solution here unfortunately. 
I think something similar to [this 
gist|https://gist.github.com/raymyers/8077031] should be a good enough to start 
with. It handles embedding " and ' and escaping with "\"


was (Author: gezapeti):
The best solution would be some third party lib that handles this for us and is 
well-tested, developed. I could not find anything except for Java code 
snippets. (There are some 10+ year old things out there though)

It looks like we'll have to implement our own solution here unfortunately. 

> Escape quotes whitespaces in Sqoop  field
> --
>
> Key: OOZIE-2949
> URL: https://issues.apache.org/jira/browse/OOZIE-2949
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 4.3.0
>Reporter: Peter Cseh
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-2949-0.patch, OOZIE-2949-01.patch
>
>
> The current behavior of the Sqoop action is:
> {noformat}
> The Sqoop command can be specified either using the command element or 
> multiple arg elements.
> When using the command element, Oozie will split the command on every space 
> into multiple arguments.
> When using the arg elements, Oozie will pass each argument value as an 
> argument to Sqoop.
> {noformat}
> This prevents the user to simply copy-paste the command worked in the shell 
> into the workflow.xml.
> We should split the  field by taking quotes into account, similar to 
> what OOZIE-2391
> did for the Spark action's  field.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-2949) Escape quotes whitespaces in Sqoop field

2019-01-02 Thread Peter Cseh (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732068#comment-16732068
 ] 

Peter Cseh commented on OOZIE-2949:
---

The best solution would be some third party lib that handles this for us and is 
well-tested, developed. I could not find anything except for Java code 
snippets. (There are some 10+ year old things out there though)

It looks like we'll have to implement our own solution here unfortunately. 

> Escape quotes whitespaces in Sqoop  field
> --
>
> Key: OOZIE-2949
> URL: https://issues.apache.org/jira/browse/OOZIE-2949
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 4.3.0
>Reporter: Peter Cseh
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-2949-0.patch, OOZIE-2949-01.patch
>
>
> The current behavior of the Sqoop action is:
> {noformat}
> The Sqoop command can be specified either using the command element or 
> multiple arg elements.
> When using the command element, Oozie will split the command on every space 
> into multiple arguments.
> When using the arg elements, Oozie will pass each argument value as an 
> argument to Sqoop.
> {noformat}
> This prevents the user to simply copy-paste the command worked in the shell 
> into the workflow.xml.
> We should split the  field by taking quotes into account, similar to 
> what OOZIE-2391
> did for the Spark action's  field.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3407) Cleanup TestPurgeXCommand

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732065#comment-16732065
 ] 

Andras Salamon commented on OOZIE-3407:
---

I cannot open a ReviewBoard request: 
https://issues.apache.org/jira/browse/INFRA-17525


> Cleanup TestPurgeXCommand
> -
>
> Key: OOZIE-3407
> URL: https://issues.apache.org/jira/browse/OOZIE-3407
> Project: Oozie
>  Issue Type: Sub-task
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3407-01.patch, OOZIE-3407-02.patch
>
>
> [TestPurgeXCommand|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java]
>  is the longest java source file of Oozie, more than 4000 rows. 
> As part of the Purge Service rewrite we need to use this file to make sure we 
> don't introduce new bugs and probably we also need to add a few more test 
> cases. Before that it would be useful to cleanup this file and extract 
> redundant code to new methods to make the code shorter and more manageable.
> The patch uploaded to OOZIE-3400 introduced three new test cases to this file 
> and shows a simple way to make it shorter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3407) Cleanup TestPurgeXCommand

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732060#comment-16732060
 ] 

Andras Salamon commented on OOZIE-3407:
---

Reuploaded the patch using {{--full-index}} to make the ReviewBoard happy. 

> Cleanup TestPurgeXCommand
> -
>
> Key: OOZIE-3407
> URL: https://issues.apache.org/jira/browse/OOZIE-3407
> Project: Oozie
>  Issue Type: Sub-task
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3407-01.patch, OOZIE-3407-02.patch
>
>
> [TestPurgeXCommand|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java]
>  is the longest java source file of Oozie, more than 4000 rows. 
> As part of the Purge Service rewrite we need to use this file to make sure we 
> don't introduce new bugs and probably we also need to add a few more test 
> cases. Before that it would be useful to cleanup this file and extract 
> redundant code to new methods to make the code shorter and more manageable.
> The patch uploaded to OOZIE-3400 introduced three new test cases to this file 
> and shows a simple way to make it shorter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3404) The env variable of SPARK_HOME needs to be set when running pySpark

2019-01-02 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732031#comment-16732031
 ] 

Hadoop QA commented on OOZIE-3404:
--


Testing JIRA OOZIE-3404

Cleaning local git workspace



{color:green}+1 PATCH_APPLIES{color}
{color:green}+1 CLEAN{color}
{color:red}-1 RAW_PATCH_ANALYSIS{color}
.{color:green}+1{color} the patch does not introduce any @author tags
.{color:green}+1{color} the patch does not introduce any tabs
.{color:green}+1{color} the patch does not introduce any trailing spaces
.{color:green}+1{color} the patch does not introduce any star imports
.{color:green}+1{color} the patch does not introduce any line longer than 
132
.{color:red}-1{color} the patch does not add/modify any testcase
{color:green}+1 RAT{color}
.{color:green}+1{color} the patch does not seem to introduce new RAT 
warnings
{color:green}+1 JAVADOC{color}
.{color:green}+1{color} Javadoc generation succeeded with the patch
.{color:green}+1{color} the patch does not seem to introduce new Javadoc 
warning(s)
.{color:orange}WARNING{color}: the current HEAD has 100 Javadoc warning(s)
{color:green}+1 COMPILE{color}
.{color:green}+1{color} HEAD compiles
.{color:green}+1{color} patch compiles
.{color:green}+1{color} the patch does not seem to introduce new javac 
warnings
{color:green}+1{color} There are no new bugs found in total.
. {color:green}+1{color} There are no new bugs found in [sharelib/hive2].
. {color:green}+1{color} There are no new bugs found in [sharelib/spark].
. {color:green}+1{color} There are no new bugs found in [sharelib/oozie].
. {color:green}+1{color} There are no new bugs found in [sharelib/pig].
. {color:green}+1{color} There are no new bugs found in [sharelib/streaming].
. {color:green}+1{color} There are no new bugs found in [sharelib/hive].
. {color:green}+1{color} There are no new bugs found in [sharelib/distcp].
. {color:green}+1{color} There are no new bugs found in [sharelib/hcatalog].
. {color:green}+1{color} There are no new bugs found in [sharelib/sqoop].
. {color:green}+1{color} There are no new bugs found in [sharelib/git].
. {color:green}+1{color} There are no new bugs found in [client].
. {color:green}+1{color} There are no new bugs found in [docs].
. {color:green}+1{color} There are no new bugs found in [tools].
. {color:green}+1{color} There are no new bugs found in 
[fluent-job/fluent-job-api].
. {color:green}+1{color} There are no new bugs found in [server].
. {color:green}+1{color} There are no new bugs found in [webapp].
. {color:green}+1{color} There are no new bugs found in [examples].
. {color:green}+1{color} There are no new bugs found in [core].
{color:green}+1 BACKWARDS_COMPATIBILITY{color}
.{color:green}+1{color} the patch does not change any JPA 
Entity/Colum/Basic/Lob/Transient annotations
.{color:green}+1{color} the patch does not modify JPA files
{color:green}+1 TESTS{color}
.Tests run: 3106
.{color:orange}Tests failed at first run:{color}
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF2
TestPurgeXCommand#testPurgeCoordWithWFChild1
.For the complete list of flaky tests, see TEST-SUMMARY-FULL files.
{color:green}+1 DISTRO{color}
.{color:green}+1{color} distro tarball builds with the patch 


{color:red}*-1 Overall result, please check the reported -1(s)*{color}

{color:red}. There is at least one warning, please check{color}

The full output of the test-patch run is available at

. https://builds.apache.org/job/PreCommit-OOZIE-Build/955/



> The env variable of SPARK_HOME needs to be set when running pySpark
> ---
>
> Key: OOZIE-3404
> URL: https://issues.apache.org/jira/browse/OOZIE-3404
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 5.1.0
>Reporter: Junfan Zhang
>Assignee: Junfan Zhang
>Priority: Major
> Attachments: oozie-3404-1.patch
>
>
> When we run spark in a cluster, we rely on the spark jars on hdfs. We don't 
> deploy Spark on the cluster server. So running pySpark according to the Oozie 
> documentation is not successful.
>  
> I found that when Hadoop is a 2.0+ version, although Oozie sets the 
> {{SPARK_HOME}}  variable in {{mapred.child.env}} , the {{mapreduce.map.env}} 
> variable is read first in Hadoop ([source 
> code|https://github.com/apache/hadoop/blob/f95b390df2ca7d599f0ad82cf6e8d980469e7abb/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/MapReduceChildJVM.java#L45])
>  . So when we don't set {{SPARK_HOME}} env in {{mapreduce.map.env}} , pySpark 
> doesn't work.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


Failed: OOZIE-3404 PreCommit Build #955

2019-01-02 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/OOZIE-3404
Build: https://builds.apache.org/job/PreCommit-OOZIE-Build/955/

###
## LAST 100 LINES OF THE CONSOLE 
###
[...truncated 1.91 MB...]
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [server].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [webapp].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [examples].
[TRACE] New XMLLib present, calling 'xmllint --xpath' to get bug instance counts
[DEBUG] There are no new bugs found in [core].
[INFO] There are no new bugs found totally].
[TRACE] FindBugs diffs checked and reports created
[TRACE] Summary file size is 2561 bytes
[TRACE] Full summary file size is 1471 bytes
[TRACE] File 
[/home/jenkins/jenkins-slave/workspace/PreCommit-OOZIE-Build/test-patch/tmp/FINDBUGS_DIFF/diff/findbugs-diff-0.1.0-all.jar]
 removed
[TRACE] File 
[/home/jenkins/jenkins-slave/workspace/PreCommit-OOZIE-Build/test-patch/tmp/FINDBUGS_DIFF/diff/findbugs-diff-0.1.0-all.jar.md5sum]
 removed
  Running test-patch task BACKWARDS_COMPATIBILITY
  Running test-patch task TESTS
  Running test-patch task DISTRO


Testing JIRA OOZIE-3404

Cleaning local git workspace



+1 PATCH_APPLIES
+1 CLEAN
-1 RAW_PATCH_ANALYSIS
+1 the patch does not introduce any @author tags
+1 the patch does not introduce any tabs
+1 the patch does not introduce any trailing spaces
+1 the patch does not introduce any star imports
+1 the patch does not introduce any line longer than 132
-1 the patch does not add/modify any testcase
+1 RAT
+1 the patch does not seem to introduce new RAT warnings
+1 JAVADOC
+1 Javadoc generation succeeded with the patch
+1 the patch does not seem to introduce new Javadoc warning(s)
WARNING: the current HEAD has 100 Javadoc warning(s)
+1 COMPILE
+1 HEAD compiles
+1 patch compiles
+1 the patch does not seem to introduce new javac warnings
+1 There are no new bugs found in total.
 +1 There are no new bugs found in [sharelib/hive2].
 +1 There are no new bugs found in [sharelib/spark].
 +1 There are no new bugs found in [sharelib/oozie].
 +1 There are no new bugs found in [sharelib/pig].
 +1 There are no new bugs found in [sharelib/streaming].
 +1 There are no new bugs found in [sharelib/hive].
 +1 There are no new bugs found in [sharelib/distcp].
 +1 There are no new bugs found in [sharelib/hcatalog].
 +1 There are no new bugs found in [sharelib/sqoop].
 +1 There are no new bugs found in [sharelib/git].
 +1 There are no new bugs found in [client].
 +1 There are no new bugs found in [docs].
 +1 There are no new bugs found in [tools].
 +1 There are no new bugs found in [fluent-job/fluent-job-api].
 +1 There are no new bugs found in [server].
 +1 There are no new bugs found in [webapp].
 +1 There are no new bugs found in [examples].
 +1 There are no new bugs found in [core].
+1 BACKWARDS_COMPATIBILITY
+1 the patch does not change any JPA Entity/Colum/Basic/Lob/Transient 
annotations
+1 the patch does not modify JPA files
+1 TESTS
Tests run: 3106
Tests failed at first run:
TestPurgeXCommand#testPurgeBundleWithCoordChildWithWFChildWithSubWF2
TestPurgeXCommand#testPurgeCoordWithWFChild1
For the complete list of flaky tests, see TEST-SUMMARY-FULL files.
+1 DISTRO
+1 distro tarball builds with the patch 


-1 Overall result, please check the reported -1(s)

 There is at least one warning, please check

The full output of the test-patch run is available at

 https://builds.apache.org/job/PreCommit-OOZIE-Build/955/

Adding comment to JIRA
  % Total% Received % Xferd  Average Speed   TimeTime Time  Current
 Dload  Upload   Total   SpentLeft  Speed
  0 00 00 0  0  0 --:--:-- --:--:-- --:--:-- 0  
0  35240 00 0  0  0 --:--:-- --:--:-- --:--:-- 
0{"self":"https://issues.apache.org/jira/rest/api/2/issue/13205107/comment/16732031","id":"16732031","author":{"self":"https://issues.apache.org/jira/rest/api/2/user?username=hadoopqa","name":"hadoopqa","key":"hadoopqa","avatarUrls":{"48x48":"https://issues.apache.org/jira/secure/useravatar?ownerId=hadoopqa=10393","24x24":"https://issues.apache.org/jira/secure/useravatar?size=small=hadoopqa=10393","16x16":"https://issues.apache.org/jira/secure/useravatar?size=xsmall=hadoopqa=10393","32x32":"https://issues.apache.org/jira/secure/useravatar?size=medium=hadoopqa=10393"},"displayName":"Hadoop
 QA","active":true,"timeZone":"Etc/UTC"},"body":"\nTesting JIRA 
OOZIE-3404\n\nCleaning local git 

[jira] [Commented] (OOZIE-2949) Escape quotes whitespaces in Sqoop field

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731989#comment-16731989
 ] 

Andras Salamon commented on OOZIE-2949:
---

I've checked the current version of the patch in more detail.
 
The plan was to handle commands like {{--query 'SELECT * FROM employee'}} and 
use two tokens in this case ({{"--query"}} and {{"SELECT * FROM employee"}}). 
The current patch works in this case.
 
It fails if single and double quotes are mixed like ' ... "  "  ' 
because it thinks that the first double quote closes the single quote. It would 
be easy to fix the patch and recognise the whole command like one token. 
 
I'm not sure if this is a good enough solution.
 
It will still fail if we want to use a single quote inside the query (e.g. 
search for O'Briens).

I've also talked to sqoop experts. The confirmed that sqoop command parsing is 
a rather complex task. If we want to be able to block-copy from shell to oozie, 
the best approach is probably mimicking the shell parsing. We would still need 
real-life examples to create lots of test cases.

> Escape quotes whitespaces in Sqoop  field
> --
>
> Key: OOZIE-2949
> URL: https://issues.apache.org/jira/browse/OOZIE-2949
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 4.3.0
>Reporter: Peter Cseh
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-2949-0.patch, OOZIE-2949-01.patch
>
>
> The current behavior of the Sqoop action is:
> {noformat}
> The Sqoop command can be specified either using the command element or 
> multiple arg elements.
> When using the command element, Oozie will split the command on every space 
> into multiple arguments.
> When using the arg elements, Oozie will pass each argument value as an 
> argument to Sqoop.
> {noformat}
> This prevents the user to simply copy-paste the command worked in the shell 
> into the workflow.xml.
> We should split the  field by taking quotes into account, similar to 
> what OOZIE-2391
> did for the Spark action's  field.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3404) The env variable of SPARK_HOME needs to be set when running pySpark

2019-01-02 Thread Andras Piros (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731971#comment-16731971
 ] 

Andras Piros commented on OOZIE-3404:
-

Thanks for the contribution so far [~zuston].

A couple of questions and remarks:
 * new unit tests missing
 * are you sure you need exactly these MapReduce settings for all Hadoop 
versions 2.0+? This means all the Hadoop versions Oozie currently supports as 
this is in the range [2.6.0, 3.1.1]
 * please provide tested Oozie and Hadoop versions, as well as relevant pieces 
of the PySpark workflow definition / job properties file / Spark version / 
PySpark sources. As [~asalamon74] already pointed out, PySpark works under a 
wide variety of circumstances
 * I don't understand why setting exactly these MapReduce properties would help 
the PySpark jobs (and only those) to succeed. Can you please explain?

> The env variable of SPARK_HOME needs to be set when running pySpark
> ---
>
> Key: OOZIE-3404
> URL: https://issues.apache.org/jira/browse/OOZIE-3404
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 5.1.0
>Reporter: Junfan Zhang
>Assignee: Junfan Zhang
>Priority: Major
> Attachments: oozie-3404-1.patch
>
>
> When we run spark in a cluster, we rely on the spark jars on hdfs. We don't 
> deploy Spark on the cluster server. So running pySpark according to the Oozie 
> documentation is not successful.
>  
> I found that when Hadoop is a 2.0+ version, although Oozie sets the 
> {{SPARK_HOME}}  variable in {{mapred.child.env}} , the {{mapreduce.map.env}} 
> variable is read first in Hadoop ([source 
> code|https://github.com/apache/hadoop/blob/f95b390df2ca7d599f0ad82cf6e8d980469e7abb/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/MapReduceChildJVM.java#L45])
>  . So when we don't set {{SPARK_HOME}} env in {{mapreduce.map.env}} , pySpark 
> doesn't work.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3404) The env variable of SPARK_HOME needs to be set when running pySpark

2019-01-02 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731948#comment-16731948
 ] 

Hadoop QA commented on OOZIE-3404:
--

PreCommit-OOZIE-Build started


> The env variable of SPARK_HOME needs to be set when running pySpark
> ---
>
> Key: OOZIE-3404
> URL: https://issues.apache.org/jira/browse/OOZIE-3404
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 5.1.0
>Reporter: Junfan Zhang
>Assignee: Junfan Zhang
>Priority: Major
> Attachments: oozie-3404-1.patch
>
>
> When we run spark in a cluster, we rely on the spark jars on hdfs. We don't 
> deploy Spark on the cluster server. So running pySpark according to the Oozie 
> documentation is not successful.
>  
> I found that when Hadoop is a 2.0+ version, although Oozie sets the 
> {{SPARK_HOME}}  variable in {{mapred.child.env}} , the {{mapreduce.map.env}} 
> variable is read first in Hadoop ([source 
> code|https://github.com/apache/hadoop/blob/f95b390df2ca7d599f0ad82cf6e8d980469e7abb/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/MapReduceChildJVM.java#L45])
>  . So when we don't set {{SPARK_HOME}} env in {{mapreduce.map.env}} , pySpark 
> doesn't work.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3341) [docs] Fix difference between command line help and documentation

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3341:

Component/s: docs

> [docs] Fix difference between command line help and documentation
> -
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>  Components: docs
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
>  documentation and in the generated HTML.
> There are several differences, the Markdown (HTML) looks outdated. For 
> instance for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3341) [docs] Fix difference between command line help and documentation

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3341:

Summary: [docs] Fix difference between command line help and documentation  
(was: [docs] Fix difference between command line help and command line 
documentation)

> [docs] Fix difference between command line help and documentation
> -
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
>  documentation and in the generated HTML.
> There are several differences, the Markdown (HTML) looks outdated. For 
> instance for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3341) [docs] Fix difference between command line help and command line documentation

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3341:

Summary: [docs] Fix difference between command line help and command line 
documentation  (was: Fix difference between command line help and command line 
documentation)

> [docs] Fix difference between command line help and command line documentation
> --
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
>  documentation and in the generated HTML.
> There are several differences, the Markdown (HTML) looks outdated. For 
> instance for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3341) Fix difference between command line help and command line documentation

2019-01-02 Thread Andras Piros (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731941#comment-16731941
 ] 

Andras Piros commented on OOZIE-3341:
-

Thanks for the contribution [~asalamon74]! +1

> Fix difference between command line help and command line documentation
> ---
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
>  documentation and in the generated HTML.
> There are several differences, the Markdown (HTML) looks outdated. For 
> instance for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3341) Fix difference between command line help and command line documentation

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3341:

Description: 
The {{help}} command of the oozie CLI prints out usage information. It is also 
possible to print out only for the specified command, for instance:
{noformat}
oozie help job{noformat}
This information is also available in the 
[Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
 documentation and in the generated HTML.

There are several differences, the Markdown (HTML) looks outdated. For instance 
for {{job}} command lists the fluent job options as {{apijarcheck}}, 
{{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
{{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the correct 
names. Note that Fluent Job related differences are handled by OOZIE-3358.

  was:
The {{help}} command of the oozie CLI prints out usage information. It is also 
possible to print out only for the specified command, for instance:
{noformat}
oozie help job{noformat}
This information is also available in the 
[twiki|https://github.com/apache/oozie/blob/master/docs/src/site/twiki/DG_CommandLineTool.twiki]
 documentation and in the generated HTML.

There are several differences, the twiki (HTML) looks outdated. For instance 
for {{job}} command lists the fluent job options as {{apijarcheck}}, 
{{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
{{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the correct 
names. Note that Fluent Job related differences are handled by OOZIE-3358.


> Fix difference between command line help and command line documentation
> ---
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [Markdown|https://github.com/apache/oozie/blob/master/docs/src/site/markdown/DG_CommandLineTool.md]
>  documentation and in the generated HTML.
> There are several differences, the Markdown (HTML) looks outdated. For 
> instance for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3400) [core] Fix PurgeService sub-sub-workflow checking

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3400:

Summary: [core] Fix PurgeService sub-sub-workflow checking  (was: Fix 
PurgeService sub-sub-workflow checking)

> [core] Fix PurgeService sub-sub-workflow checking
> -
>
> Key: OOZIE-3400
> URL: https://issues.apache.org/jira/browse/OOZIE-3400
> Project: Oozie
>  Issue Type: Sub-task
>  Components: core
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3400-00.patch, OOZIE-3400-01.patch, 
> OOZIE-3400-02.patch, OOZIE-3400-03.patch
>
>
> Purge service 
> [checks|https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/command/PurgeXCommand.java#L222-L223]
>  sub-workflows before it deletes a workflow. If some of the sub-workflows is 
> not yet ready to be purged it does not delete the workflow. Several unit 
> tests also checks this, e. g. 
> [testPurgeWFWithSubWF1|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java#L2271].
> This check is recursive, it also checks sub-sub-workflows, 
> sub-sub-sub-workflows
> The recursive check is buggy. Let's assume that we have a workflow (A) with a 
> single sub-workflow (B), which also has a sub-sub-workflow (C). If A is ready 
> to be purged, B is ready to be purged, and C is not ready to be purged then 
> the current implementation correctly recognizes that C and B is not 
> purgeable, but it purges A. Instead of this, it should also recognize that 
> it's not possible to purge A.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3341) Fix difference between command line help and command line documentation

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3341:

Affects Version/s: 5.1.0

> Fix difference between command line help and command line documentation
> ---
>
> Key: OOZIE-3341
> URL: https://issues.apache.org/jira/browse/OOZIE-3341
> Project: Oozie
>  Issue Type: Improvement
>Affects Versions: trunk, 5.1.0
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3341-01.patch
>
>
> The {{help}} command of the oozie CLI prints out usage information. It is 
> also possible to print out only for the specified command, for instance:
> {noformat}
> oozie help job{noformat}
> This information is also available in the 
> [twiki|https://github.com/apache/oozie/blob/master/docs/src/site/twiki/DG_CommandLineTool.twiki]
>  documentation and in the generated HTML.
> There are several differences, the twiki (HTML) looks outdated. For instance 
> for {{job}} command lists the fluent job options as {{apijarcheck}}, 
> {{apijarsubmit}}, and {{apijarrun}} but these options has been renamed to 
> {{validatejar}}, {{submitjar}}, and {{runjar}}. The CLI help shows the 
> correct names. Note that Fluent Job related differences are handled by 
> OOZIE-3358.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (OOZIE-3409) Oozie Server : Possible memory leak in EL evaluation

2019-01-02 Thread Andras Salamon (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Salamon reassigned OOZIE-3409:
-

Assignee: Andras Salamon

> Oozie Server : Possible memory leak in EL evaluation
> 
>
> Key: OOZIE-3409
> URL: https://issues.apache.org/jira/browse/OOZIE-3409
> Project: Oozie
>  Issue Type: Bug
>Affects Versions: 5.1.0
>Reporter: Praneeth
>Assignee: Andras Salamon
>Priority: Major
>
> I have noticed that there is a potential memory leak in the Oozie server due 
> to one of its dependencies - {{commons-el:commons-el:1.0}}. Our oozie servers 
> have been running out of memory occasionally. It happens more often when we 
> have more activity. Upon analysis of the heap dump, it seems that the culprit 
> here is the {{org.apache.commons.el.ExpressionEvaluatorImpl}} from 
> {{commons-el}} which is used in the Oozie server through 
> {{org.apache.oozie.util.ELEvaluator}} [1] . {{ExpressionEvaluatorImpl}} uses 
> a static {{Map}} [2] to cache the EL evaluations. This map never gets 
> cleared, eventually taking up all the memory. So, in an environment with 
> heavy usage and long running oozie servers, this would lead to an OOM. The 
> configurations for workflow actions we have are possibly larger than normal 
> which is worsening the situation as the Map has to maintain larger strings.
>  # 
> [https://github.com/apache/oozie/blob/release-5.1.0/core/src/main/java/org/apache/oozie/util/ELEvaluator.java#L143]
>  # 
> [http://svn.apache.org/viewvc/commons/dormant/el/trunk/src/java/org/apache/commons/el/ExpressionEvaluatorImpl.java?view=markup#l94]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (OOZIE-3400) Fix PurgeService sub-sub-workflow checking

2019-01-02 Thread Andras Piros (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andras Piros updated OOZIE-3400:

Component/s: core

> Fix PurgeService sub-sub-workflow checking
> --
>
> Key: OOZIE-3400
> URL: https://issues.apache.org/jira/browse/OOZIE-3400
> Project: Oozie
>  Issue Type: Sub-task
>  Components: core
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3400-00.patch, OOZIE-3400-01.patch, 
> OOZIE-3400-02.patch, OOZIE-3400-03.patch
>
>
> Purge service 
> [checks|https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/command/PurgeXCommand.java#L222-L223]
>  sub-workflows before it deletes a workflow. If some of the sub-workflows is 
> not yet ready to be purged it does not delete the workflow. Several unit 
> tests also checks this, e. g. 
> [testPurgeWFWithSubWF1|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java#L2271].
> This check is recursive, it also checks sub-sub-workflows, 
> sub-sub-sub-workflows
> The recursive check is buggy. Let's assume that we have a workflow (A) with a 
> single sub-workflow (B), which also has a sub-sub-workflow (C). If A is ready 
> to be purged, B is ready to be purged, and C is not ready to be purged then 
> the current implementation correctly recognizes that C and B is not 
> purgeable, but it purges A. Instead of this, it should also recognize that 
> it's not possible to purge A.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3400) Fix PurgeService sub-sub-workflow checking

2019-01-02 Thread Andras Piros (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731919#comment-16731919
 ] 

Andras Piros commented on OOZIE-3400:
-

Thanks for the contribution [~asalamon74]! +1

> Fix PurgeService sub-sub-workflow checking
> --
>
> Key: OOZIE-3400
> URL: https://issues.apache.org/jira/browse/OOZIE-3400
> Project: Oozie
>  Issue Type: Sub-task
>Affects Versions: trunk
>Reporter: Andras Salamon
>Assignee: Andras Salamon
>Priority: Major
> Fix For: 5.2.0
>
> Attachments: OOZIE-3400-00.patch, OOZIE-3400-01.patch, 
> OOZIE-3400-02.patch, OOZIE-3400-03.patch
>
>
> Purge service 
> [checks|https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/command/PurgeXCommand.java#L222-L223]
>  sub-workflows before it deletes a workflow. If some of the sub-workflows is 
> not yet ready to be purged it does not delete the workflow. Several unit 
> tests also checks this, e. g. 
> [testPurgeWFWithSubWF1|https://github.com/apache/oozie/blob/master/core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java#L2271].
> This check is recursive, it also checks sub-sub-workflows, 
> sub-sub-sub-workflows
> The recursive check is buggy. Let's assume that we have a workflow (A) with a 
> single sub-workflow (B), which also has a sub-sub-workflow (C). If A is ready 
> to be purged, B is ready to be purged, and C is not ready to be purged then 
> the current implementation correctly recognizes that C and B is not 
> purgeable, but it purges A. Instead of this, it should also recognize that 
> it's not possible to purge A.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


Re: Review Request 69594: OOZIE-3400: Fix PurgeService sub-sub-workflow checking

2019-01-02 Thread András Piros via Review Board

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/69594/#review211601
---


Ship it!




Ship It!

- András Piros


On Dec. 27, 2018, 3:13 p.m., Andras Salamon wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/69594/
> ---
> 
> (Updated Dec. 27, 2018, 3:13 p.m.)
> 
> 
> Review request for oozie, András Piros and Kinga Marton.
> 
> 
> Repository: oozie-git
> 
> 
> Description
> ---
> 
> OOZIE-3400: Fix PurgeService sub-sub-workflow checking
> 
> 
> Diffs
> -
> 
>   core/src/main/java/org/apache/oozie/ErrorCode.java 9cc153bb0 
>   core/src/main/java/org/apache/oozie/command/PurgeXCommand.java 42c3b28a6 
>   core/src/test/java/org/apache/oozie/command/TestPurgeXCommand.java 
> d11fcffbb 
>   core/src/test/java/org/apache/oozie/command/TestSelectorTreeTraverser.java 
> PRE-CREATION 
> 
> 
> Diff: https://reviews.apache.org/r/69594/diff/3/
> 
> 
> Testing
> ---
> 
> Run TestPurgeXCommand unit tests locally.
> 
> 
> Thanks,
> 
> Andras Salamon
> 
>



[jira] [Resolved] (OOZIE-3412) Update repository links to gitbox

2019-01-02 Thread Peter Cseh (JIRA)


 [ 
https://issues.apache.org/jira/browse/OOZIE-3412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Cseh resolved OOZIE-3412.
---
Resolution: Fixed

Thanks for the verification!

> Update repository links to gitbox
> -
>
> Key: OOZIE-3412
> URL: https://issues.apache.org/jira/browse/OOZIE-3412
> Project: Oozie
>  Issue Type: Task
>Reporter: Peter Cseh
>Assignee: Peter Cseh
>Priority: Major
>
> Once  INFRA-17506 is done, we should update the website and the pre-commit 
> job to refer the new repository.
> Things to update:
> ||Name||Status||
> |Jenkins precommit|Done|
> |[Website repo url|http://oozie.apache.org/source-repository.html]|Done|
> |[Branhes 
> site|https://cwiki.apache.org/confluence/display/OOZIE/Branches+Information]|Done|



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (OOZIE-3412) Update repository links to gitbox

2019-01-02 Thread Andras Salamon (JIRA)


[ 
https://issues.apache.org/jira/browse/OOZIE-3412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16731827#comment-16731827
 ] 

Andras Salamon commented on OOZIE-3412:
---

[~gezapeti] Links look good to me. +1 (Non-binding).

> Update repository links to gitbox
> -
>
> Key: OOZIE-3412
> URL: https://issues.apache.org/jira/browse/OOZIE-3412
> Project: Oozie
>  Issue Type: Task
>Reporter: Peter Cseh
>Assignee: Peter Cseh
>Priority: Major
>
> Once  INFRA-17506 is done, we should update the website and the pre-commit 
> job to refer the new repository.
> Things to update:
> ||Name||Status||
> |Jenkins precommit|Done|
> |[Website repo url|http://oozie.apache.org/source-repository.html]|Done|
> |[Branhes 
> site|https://cwiki.apache.org/confluence/display/OOZIE/Branches+Information]|Done|



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)