Re: Spark 3.0 branch cut and code freeze on Jan 31?

2020-01-29 Thread Reynold Xin
Just a reminder - code freeze is coming this Fri ! There can always be exceptions, but those should be exceptions and discussed on a case by case basis rather than becoming the norm. On Tue, Dec 24, 2019 at 4:55 PM, Jungtaek Lim < kabhwan.opensou...@gmail.com > wrote: > > Jan 31 sounds good

Re: Spark 3.0 and ORC 1.6

2020-01-29 Thread Dongjoon Hyun
Hi, David. Thank you for sharing your opinion. I'm also a supporter for ZStandard. Apache Spark 3.0 starts to take advantage of ZStd a lot. 1) Switch the default codec for MapOutputStatus from GZip to ZStd. 2) Add spark.eventLog.compression.codec to allow ZStd. 3) Use Parquet+ZStd

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Dongjoon Hyun
Got it. Thanks! Bests, Dongjoon. On Wed, Jan 29, 2020 at 1:40 PM Sean Owen wrote: > OK, we can wait a tick to confirm there aren't strong objections. > I suppose I'd prefer someone who knows > https://issues.apache.org/jira/browse/SPARK-28344 to confirm it was > either erroneously targeted to

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Sean Owen
OK, we can wait a tick to confirm there aren't strong objections. I suppose I'd prefer someone who knows https://issues.apache.org/jira/browse/SPARK-28344 to confirm it was either erroneously targeted to 2.4, or else it's valid, but, not critical for the RC. Hearing nothing else shortly, I'd

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Dongjoon Hyun
Thanks, Sean. If there is no further objection to the mailing list, could you remove the `Target Version: 2.4.5` from the followings? SPARK-28344 Fail the query if detect ambiguous self join SPARK-29578 JDK 1.8.0_232 timezone updates cause "Kwajalein" test failures again Then, after the

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Sean Owen
I have no opinion - just figuring out the status too. I guess I'm asking first, is this the only issue in question? Does nobody object to untargeting it? -> then we are done for 2.4.5, right? If anyone does -> what's the next step to resolving it? I wasn't clear from the JIRA / PR, or whether

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Dongjoon Hyun
Great. Sean. Then, what is your criteria to remove the targeting it from 2.4.5? It doesn't depend on `Who`, right? Bests, Dongjoon. On Wed, Jan 29, 2020 at 9:56 AM Sean Owen wrote: > OK what if anything is in question for 2.4.5? I don't see anything open > and targeted for it. > Are we

Re: Spark 2.4.5 RC2 Preparation Status

2020-01-29 Thread Sean Owen
OK what if anything is in question for 2.4.5? I don't see anything open and targeted for it. Are we talking about https://issues.apache.org/jira/browse/SPARK-28344 - targeted for 2.4.5 but not backported, and a 'correctness' issue? Simply: who argues this must hold up 2.4.5, and if so what's the

Problems during upgrade 2.2.2 -> 2.4.4

2020-01-29 Thread Behroz Sikander
I already posted the question on the users mailing list but I was suggested to post here. http://apache-spark-user-list.1001560.n3.nabble.com/Problems-during-upgrade-2-2-2-gt-2-4-4-td36792.html Note: The problems regarding the application running on top are understandable and this issue is not