Spark Issues on ORC

2017-05-26 Thread Dong Joon Hyun
Hi, All. Today, while I’m looking over JIRA issues for Spark 2.2.0 in Apache Spark. I noticed that there are many unresolved community requests and related efforts over `Feature parity for ORC with Parquet`. Some examples I found are the following. I created SPARK-20901 to organize these

Re: Uploading PySpark 2.1.1 to PyPi

2017-05-26 Thread Xiao Li
Hi, Holden, That sounds good to me! Thanks, Xiao 2017-05-23 16:32 GMT-07:00 Holden Karau : > An account already exists, the PMC has the info for it. I think we will > need to wait for the 2.2 artifacts to do the actual PyPI upload because of > the local version string in

Re: New metrics for WindowExec with number of partitions and frames?

2017-05-26 Thread Reynold Xin
That would be useful (number of partitions). On Fri, May 26, 2017 at 3:24 PM Jacek Laskowski wrote: > Hi, > > Currently WindowExec gives no metrics in the web UI's Details for Query > page. > > What do you think about adding the number of partitions and frames? > That could

New metrics for WindowExec with number of partitions and frames?

2017-05-26 Thread Jacek Laskowski
Hi, Currently WindowExec gives no metrics in the web UI's Details for Query page. What do you think about adding the number of partitions and frames? That could certainly be super useful, but am unsure if that's the kind of metrics Spark SQL shows in the details. Pozdrawiam, Jacek Laskowski

Re: SQL TIMESTAMP semantics vs. SPARK-18350

2017-05-26 Thread Reynold Xin
That's just my point 4, isn't it? On Fri, May 26, 2017 at 1:07 AM, Ofir Manor wrote: > Reynold, > my point is that Spark should aim to follow the SQL standard instead of > rolling its own type system. > If I understand correctly, the existing implementation is similar to