Re: Specifying different version of pyspark.zip and py4j files on worker nodes with Spark pre-installed

2018-10-05 Thread Marcelo Vanzin
ps://github.com/apache/spark/blob/88e7e87bd5c052e10f52d4bb97a9d78f5b524128/core/src/main/scala/org/apache/spark/api/python/PythonUtils.scala#L31 >> > >> > The code shows Spark will try to find the path if SPARK_HOME is specified. >> > And on my worker node, SPARK_HOME is s

[jira] [Commented] (SPARK-25645) Add provision to disable EventLoggingListener default flush/hsync/hflush for all events

2018-10-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16639147#comment-16639147 ] Marcelo Vanzin commented on SPARK-25645: I think I might have suggested thi

[jira] [Created] (SPARK-25646) docker-image-tool.sh doesn't work on developer build

2018-10-04 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-25646: -- Summary: docker-image-tool.sh doesn't work on developer build Key: SPARK-25646 URL: https://issues.apache.org/jira/browse/SPARK-25646 Project:

[jira] [Commented] (SPARK-25645) Add provision to disable EventLoggingListener default flush/hsync/hflush for all events

2018-10-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16639024#comment-16639024 ] Marcelo Vanzin commented on SPARK-25645: This seems similar to SPARK-2

[jira] [Commented] (CRYPTO-142) Native library does not load if ENGINE_load_rdrand is not available

2018-10-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/CRYPTO-142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16638996#comment-16638996 ] Marcelo Vanzin commented on CRYPTO-142: --- It seems that things are alr

[jira] [Created] (CRYPTO-142) Native library does not load if ENGINE_load_rdrand is not available

2018-10-04 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created CRYPTO-142: - Summary: Native library does not load if ENGINE_load_rdrand is not available Key: CRYPTO-142 URL: https://issues.apache.org/jira/browse/CRYPTO-142 Project: Commons

Re: Specifying different version of pyspark.zip and py4j files on worker nodes with Spark pre-installed

2018-10-04 Thread Marcelo Vanzin
.bashrc , for the > pre-installed 2.2.1 path. > > I don't want to make any changes to worker node configuration, so any way to > override the order? > > Jianshi > > On Fri, Oct 5, 2018 at 12:11 AM Marcelo Vanzin wrote: >> >> Normally the version of Spark in

[jira] [Updated] (CRYPTO-60) opensslCipher support GCM mode

2018-10-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/CRYPTO-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated CRYPTO-60: - Fix Version/s: 1.1.0 > opensslCipher support GCM m

[jira] [Resolved] (CRYPTO-135) CryptoOutputStream is always blocking

2018-10-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/CRYPTO-135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved CRYPTO-135. --- Resolution: Won't Fix See discussion in the PR for this: https://github.com/apache/co

Re: Specifying different version of pyspark.zip and py4j files on worker nodes with Spark pre-installed

2018-10-04 Thread Marcelo Vanzin
Normally the version of Spark installed on the cluster does not matter, since Spark is uploaded from your gateway machine to YARN by default. You probably have some configuration (in spark-defaults.conf) that tells YARN to use a cached copy. Get rid of that configuration, and you can use whatever

[jira] [Updated] (SPARK-25586) toString method of GeneralizedLinearRegressionTrainingSummary runs in infinite loop throwing StackOverflowError

2018-10-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-25586: --- Issue Type: Bug (was: Improvement) > toString method

[jira] [Commented] (SPARK-25586) toString method of GeneralizedLinearRegressionTrainingSummary runs in infinite loop throwing StackOverflowError

2018-10-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637605#comment-16637605 ] Marcelo Vanzin commented on SPARK-25586: bq. This is not a bug Actually it

[jira] [Resolved] (SPARK-25586) toString method of GeneralizedLinearRegressionTrainingSummary runs in infinite loop throwing StackOverflowError

2018-10-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25586. Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22616 [https

[jira] [Assigned] (SPARK-25586) toString method of GeneralizedLinearRegressionTrainingSummary runs in infinite loop throwing StackOverflowError

2018-10-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25586: -- Assignee: Ankur Gupta > toString method

Re: Missing board report

2018-10-02 Thread Marcelo Vanzin
Alex, could you take care of that small update? Thanks! On Tue, Oct 2, 2018 at 4:19 PM Justin Mclean wrote: > > Hi, > > > I guess we just haven't been asking anything from the mentors, so we> > > should just mention that in the report? > > That sounds good to me. For what it's worth I can see that

Re: Missing board report

2018-10-02 Thread Marcelo Vanzin
Hi Justin, Sorry, it wasn't intentional, it was probably just not in the original report I based my update on. I guess we just haven't been asking anything from the mentors, so we should just mention that in the report? On Tue, Oct 2, 2018 at 3:40 PM Justin Mclean wrote: > > HI, > > I notice yo

Re: Missing board report

2018-10-02 Thread Marcelo Vanzin
I'm pretty sure I don't have access to the wiki; but I took the last report and added a small blurb about what's been going on. Feel free to update the wiki on my behalf. === Livy is web service that exposes a REST interface for managing long running Apache Spark c

[jira] [Created] (SPARK-25598) Remove flume connector in Spark 3

2018-10-02 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-25598: -- Summary: Remove flume connector in Spark 3 Key: SPARK-25598 URL: https://issues.apache.org/jira/browse/SPARK-25598 Project: Spark Issue Type: Task

[jira] [Created] (LIVY-521) Shade netty in RSC

2018-10-02 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-521: --- Summary: Shade netty in RSC Key: LIVY-521 URL: https://issues.apache.org/jira/browse/LIVY-521 Project: Livy Issue Type: Task Components: RSC

[jira] [Resolved] (SPARK-22275) SparkContext doesn't clean up after itself when "fatal" errors occur

2018-10-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22275. Resolution: Won't Fix Let's leave it like that until it becomes a bigg

[jira] [Updated] (SPARK-25590) kubernetes-model-2.0.0.jar masks default Spark logging config

2018-10-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-25590: --- Description: That jar file, which is packaged when the k8s profile is enabled, has a log4j

[jira] [Created] (SPARK-25590) kubernetes-model-2.0.0.jar masks default Spark logging config

2018-10-01 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-25590: -- Summary: kubernetes-model-2.0.0.jar masks default Spark logging config Key: SPARK-25590 URL: https://issues.apache.org/jira/browse/SPARK-25590 Project: Spark

[jira] [Assigned] (LIVY-520) Log when a session is deleted because of timeout

2018-10-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned LIVY-520: --- Assignee: Marco Gaido > Log when a session is deleted because of time

[jira] [Resolved] (LIVY-520) Log when a session is deleted because of timeout

2018-10-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved LIVY-520. - Resolution: Fixed Fix Version/s: 0.6.0 Issue resolved by pull request 116 [https

[jira] [Commented] (SPARK-25380) Generated plans occupy over 50% of Spark driver memory

2018-09-29 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16633145#comment-16633145 ] Marcelo Vanzin commented on SPARK-25380: We can provide ways to diminish

Re: [CRYPTO] Drop support for Java 7?

2018-09-28 Thread Marcelo Vanzin
gt; However it fails when trying to execute 'make'. > > I don't know if make is available on those boxes. > Needs further investigation. > > On Fri, 28 Sep 2018 at 11:27, sebb wrote: > > > > On Fri, 28 Sep 2018 at 02:24, Marcelo Vanzin > > wrote: > &

[jira] [Commented] (SPARK-25380) Generated plans occupy over 50% of Spark driver memory

2018-09-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16631236#comment-16631236 ] Marcelo Vanzin commented on SPARK-25380: If all you want is see this live,

Re: [CRYPTO] Drop support for Java 7?

2018-09-27 Thread Marcelo Vanzin
for those - but I don't think I'll have time to play with that at the moment.) On Thu, Sep 27, 2018 at 10:25 AM Gary Gregory wrote: > > +1 to update to Java 8. > > Gary > > On Thu, Sep 27, 2018 at 11:02 AM Marcelo Vanzin > wrote: > > > Any concerns with

[jira] [Resolved] (SPARK-25533) Inconsistent message for Completed Jobs in the JobUI, when there are failed jobs, compared to spark2.2

2018-09-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25533. Resolution: Fixed Fix Version/s: 2.4.0 2.3.3 > Inconsist

[jira] [Resolved] (LIVY-512) Remove support for Scala 2.10

2018-09-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved LIVY-512. - Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 0.6.0 > Remove supp

[jira] [Resolved] (LIVY-511) Bump minimum Spark version to 2.2

2018-09-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved LIVY-511. - Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 0.6.0 > Bump mini

[CRYPTO] Drop support for Java 7?

2018-09-27 Thread Marcelo Vanzin
Any concerns with dropping support for Java 7 in the next release of commons-crypto? Are there any rules about not doing that kind of thing in a "minor" release? I was trying to fix up the Travis builds to work on JDK7 but it seems that Travis's support for JDK7 is kinda broken at the moment... an

[jira] [Assigned] (SPARK-25372) Deprecate Yarn-specific configs in regards to keytab login for SparkSubmit

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25372: -- Assignee: Ilan Filonenko > Deprecate Yarn-specific configs in regards to key

[jira] [Resolved] (SPARK-25372) Deprecate Yarn-specific configs in regards to keytab login for SparkSubmit

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25372. Resolution: Fixed Fix Version/s: 2.5.0 Issue resolved by pull request 22362 [https

[jira] [Commented] (CRYPTO-141) Errors in native code can leave Java wrappers in bad state

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/CRYPTO-141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16629523#comment-16629523 ] Marcelo Vanzin commented on CRYPTO-141: --- I opened a PR for this: h

[jira] [Commented] (SPARK-25533) Inconsistent message for Completed Jobs in the JobUI, when there are failed jobs, compared to spark2.2

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16629243#comment-16629243 ] Marcelo Vanzin commented on SPARK-25533: This is merged to master.

[jira] [Assigned] (SPARK-25533) Inconsistent message for Completed Jobs in the JobUI, when there are failed jobs, compared to spark2.2

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25533: -- Assignee: shahid > Inconsistent message for Completed Jobs in the JobUI, when th

[jira] [Issue Comment Deleted] (SPARK-25546) RDDInfo uses SparkEnv before it may have been initialized

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-25546: --- Comment: was deleted (was: User 'vanzin' has created a pull request for this is

[jira] [Created] (LIVY-519) Flaky test: SparkYarnApp "should kill yarn app "

2018-09-26 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-519: --- Summary: Flaky test: SparkYarnApp "should kill yarn app " Key: LIVY-519 URL: https://issues.apache.org/jira/browse/LIVY-519 Project: Livy Issue

[jira] [Created] (SPARK-25546) RDDInfo uses SparkEnv before it may have been initialized

2018-09-26 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-25546: -- Summary: RDDInfo uses SparkEnv before it may have been initialized Key: SPARK-25546 URL: https://issues.apache.org/jira/browse/SPARK-25546 Project: Spark

[jira] [Created] (LIVY-518) Add support for Spark 2.4

2018-09-26 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-518: --- Summary: Add support for Spark 2.4 Key: LIVY-518 URL: https://issues.apache.org/jira/browse/LIVY-518 Project: Livy Issue Type: Task Affects Versions

[jira] [Resolved] (SPARK-25318) Add exception handling when wrapping the input stream during the the fetch or stage retry in response to a corrupted block

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25318. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22325 [https

[jira] [Assigned] (SPARK-25318) Add exception handling when wrapping the input stream during the the fetch or stage retry in response to a corrupted block

2018-09-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25318: -- Assignee: Reza Safi > Add exception handling when wrapping the input stream dur

[jira] [Commented] (SPARK-25535) Work around bad error checking in commons-crypto

2018-09-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16628078#comment-16628078 ] Marcelo Vanzin commented on SPARK-25535: I filed CRYPTO-141 for the com

[jira] [Commented] (CRYPTO-141) Errors in native code can leave Java wrappers in bad state

2018-09-25 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/CRYPTO-141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16628077#comment-16628077 ] Marcelo Vanzin commented on CRYPTO-141: --- It might be a better idea to make

[jira] [Created] (CRYPTO-141) Errors in native code can leave Java wrappers in bad state

2018-09-25 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created CRYPTO-141: - Summary: Errors in native code can leave Java wrappers in bad state Key: CRYPTO-141 URL: https://issues.apache.org/jira/browse/CRYPTO-141 Project: Commons Crypto

[jira] [Created] (SPARK-25535) Work around bad error checking in commons-crypto

2018-09-25 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-25535: -- Summary: Work around bad error checking in commons-crypto Key: SPARK-25535 URL: https://issues.apache.org/jira/browse/SPARK-25535 Project: Spark Issue

[jira] [Assigned] (SPARK-25502) [Spark Job History] Empty Page when page number exceeds the reatinedTask size

2018-09-24 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25502: -- Assignee: shahid (was: ABHISHEK KUMAR GUPTA) > [Spark Job History] Empty Page w

[jira] [Resolved] (LIVY-494) Add thriftserver to Livy server

2018-09-24 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved LIVY-494. - Resolution: Fixed Assignee: Marco Gaido Fix Version/s: 0.6.0 > Add thriftserver

[jira] [Resolved] (SPARK-25518) Spark kafka delegation token supported

2018-09-24 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25518. Resolution: Duplicate > Spark kafka delegation token suppor

[jira] [Assigned] (SPARK-25502) [Spark Job History] Empty Page when page number exceeds the reatinedTask size

2018-09-24 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25502: -- Assignee: ABHISHEK KUMAR GUPTA > [Spark Job History] Empty Page when page num

[jira] [Resolved] (SPARK-25502) [Spark Job History] Empty Page when page number exceeds the reatinedTask size

2018-09-24 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25502. Resolution: Fixed Fix Version/s: 2.4.0 2.3.3 Issue resolved by

[jira] [Created] (LIVY-517) NPE in session manager cleanup thread

2018-09-21 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-517: --- Summary: NPE in session manager cleanup thread Key: LIVY-517 URL: https://issues.apache.org/jira/browse/LIVY-517 Project: Livy Issue Type: Bug

[jira] [Assigned] (SPARK-24918) Executor Plugin API

2018-09-20 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24918: -- Assignee: Nihar Sheth > Executor Plugin

[jira] [Resolved] (SPARK-24918) Executor Plugin API

2018-09-20 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24918. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22192 [https

[jira] [Commented] (LIVY-514) No LivyClientFactory implementation was found

2018-09-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16619476#comment-16619476 ] Marcelo Vanzin commented on LIVY-514: - Do you have {{livy-client-http}} in

Re: [VOTE] SPARK 2.4.0 (RC1)

2018-09-17 Thread Marcelo Vanzin
You can log in to https://repository.apache.org and see what's wrong. Just find that staging repo and look at the messages. In your case it seems related to your signature. failureMessageNo public key: Key with id: () was not able to be located on http://gpg-keyserver.de/. Upload your public k

[jira] [Created] (LIVY-513) Flaky test: SessionHeartbeatWatchdog

2018-09-14 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-513: --- Summary: Flaky test: SessionHeartbeatWatchdog Key: LIVY-513 URL: https://issues.apache.org/jira/browse/LIVY-513 Project: Livy Issue Type: Bug

[jira] [Updated] (LIVY-511) Bump minimum Spark version to 2.2

2018-09-14 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated LIVY-511: Labels: release-notes (was: ) > Bump minimum Spark version to

[jira] [Updated] (LIVY-512) Remove support for Scala 2.10

2018-09-14 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated LIVY-512: Labels: release-notes (was: ) > Remove support for Scala 2

[jira] [Updated] (LIVY-510) Remove support for jdk7

2018-09-14 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated LIVY-510: Labels: release-notes (was: ) > Remove support for j

[jira] [Created] (LIVY-512) Remove support for Scala 2.10

2018-09-14 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-512: --- Summary: Remove support for Scala 2.10 Key: LIVY-512 URL: https://issues.apache.org/jira/browse/LIVY-512 Project: Livy Issue Type: Task Reporter

[jira] [Created] (LIVY-511) Bump minimum Spark version to 2.2

2018-09-14 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-511: --- Summary: Bump minimum Spark version to 2.2 Key: LIVY-511 URL: https://issues.apache.org/jira/browse/LIVY-511 Project: Livy Issue Type: Task

[jira] [Created] (LIVY-510) Remove support for jdk7

2018-09-14 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created LIVY-510: --- Summary: Remove support for jdk7 Key: LIVY-510 URL: https://issues.apache.org/jira/browse/LIVY-510 Project: Livy Issue Type: Task Reporter

Re: [DISCUSS] Getting rid of old stuff

2018-09-13 Thread Marcelo Vanzin
sco, CA 94105 > > > < > > https://maps.google.com/?q=505+Howard+Street+San+Francisco,+CA+94105+United+States&entry=gmail&source=g > > > > > > United States > > > < > > https://maps.google.com/?q=505+Howard+Street+San+Francisco,+CA+94105+Un

[DISCUSS] Getting rid of old stuff

2018-09-13 Thread Marcelo Vanzin
Hey all, I'd like to gauge people's reaction to some proposals regarding what is supported in Livy. #1: Java versions I propose dropping support for Java 7. Even J8 is already EOL, although it's pretty obvious nobody is getting rid of it anytime soon. But I don't see a good reason to support J7.

Re: PYSPARK_GATEWAY_SECRET error when running Livy with Spark on YARN, Cluster mode and using PySpark

2018-09-13 Thread Marcelo Vanzin
That requires a Livy fix that's currently only in the master branch. Another option is to use the previous version of the Cloudera parcel (which does not have the fix for CVE-2018-1334, which introduced this incompatibility in Livy). On Thu, Sep 13, 2018 at 12:29 AM Rabe, Jens wrote: > > Hello fo

[jira] [Updated] (LIVY-503) Move RPC classes used in thrifserver in a separate module

2018-09-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/LIVY-503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated LIVY-503: Summary: Move RPC classes used in thrifserver in a separate module (was: More RPC classes used in

[jira] [Commented] (SPARK-25380) Generated plans occupy over 50% of Spark driver memory

2018-09-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16612433#comment-16612433 ] Marcelo Vanzin commented on SPARK-25380: Yep. That's a 200MB plan de

[jira] [Resolved] (SPARK-25410) Spark executor on YARN does not include memoryOverhead when starting an ExecutorRunnable

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25410. Resolution: Not A Bug bq. This means that the amount of memoryOverhead will not be used

[jira] [Commented] (SPARK-25380) Generated plans occupy over 50% of Spark driver memory

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16611012#comment-16611012 ] Marcelo Vanzin commented on SPARK-25380: Another bit of information that w

[jira] [Resolved] (SPARK-24889) dataset.unpersist() doesn't update storage memory stats

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24889. Resolution: Fixed Fix Version/s: 2.3.2 2.4.0 Issue resolved by

[jira] [Assigned] (SPARK-24889) dataset.unpersist() doesn't update storage memory stats

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24889: -- Assignee: Liang-Chi Hsieh > dataset.unpersist() doesn't update storage memo

[jira] [Assigned] (SPARK-25221) [DEPLOY] Consistent trailing whitespace treatment of conf values

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25221: -- Assignee: Gera Shegalov > [DEPLOY] Consistent trailing whitespace treatment of c

[jira] [Resolved] (SPARK-25221) [DEPLOY] Consistent trailing whitespace treatment of conf values

2018-09-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25221. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22213 [https

Re: How to deploy generic Spark applications via Livy using the Java client

2018-09-10 Thread Marcelo Vanzin
If your "orchestrator engine" is receiving pre-built apps from others and needs to execute them in the cluster, you could just use Livy's batch API. I don't think there are Java bindings for that, you'd need to talk to the REST endpoints directly. The code you're referring to is for "interactive"

[jira] [Resolved] (SPARK-25395) Remove Spark Optional Java API

2018-09-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25395. Resolution: Duplicate > Remove Spark Optional Java

[jira] [Resolved] (SPARK-25091) UNCACHE TABLE, CLEAR CACHE, rdd.unpersist() does not clean up executor memory

2018-09-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25091. Resolution: Duplicate > UNCACHE TABLE, CLEAR CACHE, rdd.unpersist() does not clean

[jira] [Commented] (SPARK-23670) Memory leak of SparkPlanGraphWrapper in sparkUI

2018-09-06 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16606355#comment-16606355 ] Marcelo Vanzin commented on SPARK-23670: Yep, looks like just huge plans

[jira] [Commented] (SPARK-23670) Memory leak of SparkPlanGraphWrapper in sparkUI

2018-09-06 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16606343#comment-16606343 ] Marcelo Vanzin commented on SPARK-23670: Do you mind listing

[jira] [Commented] (SPARK-23670) Memory leak of SparkPlanGraphWrapper in sparkUI

2018-09-06 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16606151#comment-16606151 ] Marcelo Vanzin commented on SPARK-23670: How many instances of the follo

[jira] [Commented] (SPARK-24771) Upgrade AVRO version from 1.7.7 to 1.8

2018-09-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16604877#comment-16604877 ] Marcelo Vanzin commented on SPARK-24771: I ran a couple of our tests

[jira] [Assigned] (SPARK-24415) Stage page aggregated executor metrics wrong when failures

2018-09-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24415: -- Assignee: Ankur Gupta > Stage page aggregated executor metrics wrong when failu

[jira] [Resolved] (SPARK-24415) Stage page aggregated executor metrics wrong when failures

2018-09-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24415. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22209 [https

Re: deploy-mode cluster. FileNotFoundException

2018-09-05 Thread Marcelo Vanzin
See SPARK-4160. Long story short: you need to upload the files and jars to some shared storage (like HDFS) manually. On Wed, Sep 5, 2018 at 2:17 AM Guillermo Ortiz Fernández wrote: > > I'm using standalone cluster and the final command I'm trying is: > spark-submit --verbose --deploy-mode cluster

Re: data source api v2 refactoring

2018-09-04 Thread Marcelo Vanzin
Same here, I don't see anything from Wenchen... just replies to him. On Sat, Sep 1, 2018 at 9:31 PM Mridul Muralidharan wrote: > > > Is it only me or are all others getting Wenchen’s mails ? (Obviously Ryan did > :-) ) > I did not see it in the mail thread I received or in archives ... [1] > Won

[jira] [Updated] (SPARK-23781) Merge YARN and Mesos token renewal code

2018-08-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-23781: --- Description: With the fix for SPARK-23361, the code that handles delegation tokens in Mesos

Re: Nightly Builds in the docs (in spark-nightly/spark-master-bin/latest? Can't seem to find it)

2018-08-31 Thread Marcelo Vanzin
I think there still might be an active job publishing stuff. Here's a pretty recent build from master: https://dist.apache.org/repos/dist/dev/spark/2.4.0-SNAPSHOT-2018_08_31_12_02-32da87d-docs/_site/index.html But it seems only docs are being published, which makes me think it's those builds that

Re: [discuss] replacing SPIP template with Heilmeier's Catechism?

2018-08-31 Thread Marcelo Vanzin
I like the questions (aside maybe from the cost one which perhaps does not matter much here), especially since they encourage explaining things in a more plain language than generally used by specs. But I don't think we can ignore design aspects; it's been my observation that a good portion of SPI

[jira] [Commented] (SPARK-25294) Add integration test for Kerberos

2018-08-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16598975#comment-16598975 ] Marcelo Vanzin commented on SPARK-25294: It might be hard to cover all c

[jira] [Resolved] (SPARK-24909) Spark scheduler can hang when fetch failures, executor lost, task running on lost executor, and multiple stage attempts

2018-08-29 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24909. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21976 [https

[jira] [Assigned] (SPARK-24909) Spark scheduler can hang when fetch failures, executor lost, task running on lost executor, and multiple stage attempts

2018-08-29 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24909: -- Assignee: Thomas Graves > Spark scheduler can hang when fetch failures, execu

[jira] [Resolved] (SPARK-25004) Add spark.executor.pyspark.memory config to set resource.RLIMIT_AS

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25004. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21977 [https

[jira] [Assigned] (SPARK-25004) Add spark.executor.pyspark.memory config to set resource.RLIMIT_AS

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25004: -- Assignee: Ryan Blue > Add spark.executor.pyspark.memory config to

[jira] [Updated] (SPARK-24704) The order of stages in the DAG graph is incorrect

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-24704: --- Fix Version/s: 2.3.2 > The order of stages in the DAG graph is incorr

[jira] [Resolved] (SPARK-25119) stages in wrong order within job page DAG chart

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-25119. Resolution: Duplicate > stages in wrong order within job page DAG ch

[jira] [Assigned] (SPARK-23679) uiWebUrl show inproper URL when running on YARN

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-23679: -- Assignee: Saisai Shao > uiWebUrl show inproper URL when running on Y

[jira] [Resolved] (SPARK-23679) uiWebUrl show inproper URL when running on YARN

2018-08-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23679. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22164 [https

Re: [VOTE] SPIP: Executor Plugin (SPARK-24918)

2018-08-28 Thread Marcelo Vanzin
just about all your code > needs this init; I had understood the use cases to be more like "establish > some local config and init for this particular thing I'm doing for this > legacy system". > > On Tue, Aug 28, 2018 at 11:35 AM Marcelo Vanzin wrote: >> >> +1

Re: [VOTE] SPIP: Executor Plugin (SPARK-24918)

2018-08-28 Thread Marcelo Vanzin
+1 Class init is not enough because there is nowhere for you to force a random class to be initialized. This is basically adding that mechanism, instead of forcing people to add hacks using e.g. mapPartitions which don't even cover all scenarios. On Tue, Aug 28, 2018 at 7:09 AM, Sean Owen wrote:

<    5   6   7   8   9   10   11   12   13   14   >