Update Public Documentation - SparkSession instead of SparkContext

2017-02-14 Thread Chetan Khatri
Hello Spark Dev Team, I was working with my team having most of the confusion that why your public documentation is not updated with SparkSession if SparkSession is the ongoing extension and best practice instead of creating sparkcontext. Thanks.

Re: Spark Improvement Proposals

2017-02-14 Thread Cody Koeninger
Thanks for doing that. Given that there are at least 4 different Apache voting processes, "typical Apache vote process" isn't meaningful to me. I think the intention is that in order to pass, it needs at least 3 +1 votes from PMC members *and no -1 votes from PMC members*. But the document

Re: Request for comments: Java 7 removal

2017-02-14 Thread Yuming Wang
There is a way only Spark use Java 8, Hadoop still use Java 7: spark-conf.jpg (58K) By the way, I have a way to install any spark version on CM5.4 - CM5.7 by custom CSD

Re: Request for comments: Java 7 removal

2017-02-14 Thread Koert Kuipers
what about the conversation about dropping scala 2.10? On Fri, Feb 10, 2017 at 11:47 AM, Sean Owen wrote: > As you have seen, there's a WIP PR to implement removal of Java 7 support: > https://github.com/apache/spark/pull/16871 > > I have heard several +1s at

Re: Request for comments: Java 7 removal

2017-02-14 Thread Sean Owen
Yes, that's a key concern about the Java dependency, that its update is a function of the OS packages and those who control them, which is often not the end user. I think that's why this has been delayed a while. My general position is that, of course, someone in that boat can use Spark 2.1.x.

Re: [PYTHON][DISCUSS] Moving to cloudpickle and or Py4J as a dependencies?

2017-02-14 Thread Maciej Szymkiewicz
I don't have any strong views, so just to highlight possible issues: * Based on different issues I've seen there is a substantial amount of users which depend on system wide Python installations. As far as I am aware neither Py4j nor cloudpickle are present in the standard system

Fwd: tylerchap...@yahoo-inc.com is no longer with Yahoo! (was: Dealing with missing columns in SPARK SQL in JSON)

2017-02-14 Thread Aseem Bansal
Can someone please remove tylerchap...@yahoo-inc.com from the mailing list? I was told in a spark JIRA that dev mailing list is the right place to ask for this. -- Forwarded message -- From: Yahoo! No Reply Date: Tue, Feb 14, 2017 at 8:00 PM Subject:

Fwd: Handling Skewness and Heterogeneity

2017-02-14 Thread Anis Nasir
Dear all, Can you please comment on the below mentioned use case. Thanking you in advance Regards, Anis -- Forwarded message - From: Anis Nasir Date: Tue, 14 Feb 2017 at 17:01 Subject: Handling Skewness and Heterogeneity To: Dear

Re: Cannot find checkstyle.xml

2017-02-14 Thread Jakub Dubovsky
Somebody is able to help with this? I am stuck on this in my attempt to help solve issues: SPARK-16599 sparkNB-807 Thanks On Thu, Feb 9, 2017 at 10:18 AM, Jakub Dubovsky <