[jira] [Resolved] (SPARK-21092) Wire SQLConf in logical plan and expressions

2017-06-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21092. - Resolution: Fixed Fix Version/s: 2.3.0 > Wire SQLConf in logical plan and expressions > --

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-06-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16050730#comment-16050730 ] Reynold Xin commented on SPARK-1: - What's left in this ticket? Didn't we fix it a

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-06-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16051013#comment-16051013 ] Reynold Xin commented on SPARK-1: - But this ticket has nothing to do with SQL? >

[jira] [Commented] (SPARK-21102) Refresh command is too aggressive in parsing

2017-06-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16054600#comment-16054600 ] Reynold Xin commented on SPARK-21102: - Can you submit a pull request so we can discus

[jira] [Resolved] (SPARK-21103) QueryPlanConstraints should be part of LogicalPlan

2017-06-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21103. - Resolution: Fixed Fix Version/s: 2.3.0 > QueryPlanConstraints should be part of LogicalPla

[jira] [Commented] (SPARK-13534) Implement Apache Arrow serializer for Spark DataFrame for use in DataFrame.toPandas

2017-06-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16060328#comment-16060328 ] Reynold Xin commented on SPARK-13534: - Was this done? I thought there are still other

[jira] [Updated] (SPARK-13534) Implement Apache Arrow serializer for Spark DataFrame for use in DataFrame.toPandas

2017-06-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13534: Issue Type: Sub-task (was: New Feature) Parent: SPARK-21187 > Implement Apache Arrow seria

[jira] [Commented] (SPARK-21187) Complete support for remaining Spark data types in Arrow Converters

2017-06-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16060455#comment-16060455 ] Reynold Xin commented on SPARK-21187: - Does Pandas support array / struct / map? >

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2017-06-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16060469#comment-16060469 ] Reynold Xin commented on SPARK-14220: - I just removed the target version given the am

[jira] [Updated] (SPARK-14220) Build and test Spark against Scala 2.12

2017-06-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14220: Target Version/s: (was: 2.3.0) > Build and test Spark against Scala 2.12 > --

[jira] [Created] (SPARK-21190) SPIP: Vectorized UDFs for Python

2017-06-23 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21190: --- Summary: SPIP: Vectorized UDFs for Python Key: SPARK-21190 URL: https://issues.apache.org/jira/browse/SPARK-21190 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs for Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Description: *Background and Motivation* Python is one of the most popular programming languages

[jira] [Assigned] (SPARK-21190) SPIP: Vectorized UDFs for Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reassigned SPARK-21190: --- Assignee: Reynold Xin > SPIP: Vectorized UDFs for Python >

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Summary: SPIP: Vectorized UDFs in Python (was: SPIP: Vectorized UDFs for Python) > SPIP: Vectoriz

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Description: *Background and Motivation* Python is one of the most popular programming languages am

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16060549#comment-16060549 ] Reynold Xin commented on SPARK-14220: - Making it build isn't that much work, but gett

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Description: *Background and Motivation* Python is one of the most popular programming languages am

[jira] [Closed] (SPARK-20817) Benchmark.getProcessorName() returns "Unknown processor" on ppc and 390 platforms

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-20817. --- Resolution: Won't Fix See github discussions. > Benchmark.getProcessorName() returns "Unknown proce

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Description: *Background and Motivation* Python is one of the most popular programming languages am

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Attachment: SPIPVectorizedUDFsforPython (1).pdf > SPIP: Vectorized UDFs in Python > ---

[jira] [Commented] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2017-06-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16062207#comment-16062207 ] Reynold Xin commented on SPARK-18016: - Was this merged in 2.1? If yes we should rever

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063515#comment-16063515 ] Reynold Xin commented on SPARK-21190: - [~icexelloss] Thanks. Your proposal brings up

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2017-06-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063984#comment-16063984 ] Reynold Xin commented on SPARK-14220: - If all those issues have been released than it

[jira] [Closed] (SPARK-18199) Support appending to Parquet files

2017-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-18199. --- Resolution: Invalid I'm closing this as invalid. It is not a good idea to append to an existing file

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16069490#comment-16069490 ] Reynold Xin commented on SPARK-21190: - That makes a lot of sense. So to design APIs s

[jira] [Resolved] (SPARK-17924) Consolidate streaming and batch write path

2017-06-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17924. - Resolution: Fixed Fix Version/s: 2.3.0 > Consolidate streaming and batch write path >

[jira] [Closed] (SPARK-21270) Improvement for memory config.

2017-06-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-21270. --- Resolution: Won't Fix While I absolutely would love to see this feature, I don't think this is reali

[jira] [Created] (SPARK-21273) Decouple stats propagation from logical plan

2017-06-30 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21273: --- Summary: Decouple stats propagation from logical plan Key: SPARK-21273 URL: https://issues.apache.org/jira/browse/SPARK-21273 Project: Spark Issue Type: Improv

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2017-06-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16070890#comment-16070890 ] Reynold Xin commented on SPARK-21274: - Do you want to submit a pull request? > Impl

[jira] [Commented] (SPARK-15533) Deprecate Dataset.explode

2017-07-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16071325#comment-16071325 ] Reynold Xin commented on SPARK-15533: - Just use a star. On Sat, Jul 1, 2017 at 9:33

[jira] [Resolved] (SPARK-21323) Rename sql.catalyst.plans.logical.statsEstimation.Range to ValueInterval

2017-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21323. - Resolution: Fixed Assignee: Gengliang Wang Fix Version/s: 2.3.0 > Rename sql.cata

[jira] [Updated] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-21190: Description: *Background and Motivation* Python is one of the most popular programming languages am

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16077484#comment-16077484 ] Reynold Xin commented on SPARK-21190: - [~bryanc] Sorry I don't think it makes sense t

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16077606#comment-16077606 ] Reynold Xin commented on SPARK-18085: - [~vanzin] seems like this should have a SPIP?

[jira] [Commented] (SPARK-21349) Make TASK_SIZE_TO_WARN_KB configurable

2017-07-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16081128#comment-16081128 ] Reynold Xin commented on SPARK-21349: - cc [~cloud_fan] Shouldn't task metric just be

[jira] [Resolved] (SPARK-21358) Argument of repartitionandsortwithinpartitions at pyspark

2017-07-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21358. - Resolution: Fixed Assignee: chie hayashida Fix Version/s: 2.3.0 > Argument of rep

[jira] [Commented] (SPARK-20641) Key-value store abstraction and implementation for storing application data

2017-07-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083452#comment-16083452 ] Reynold Xin commented on SPARK-20641: - BTW why are we not using RocksDB? I saw that y

[jira] [Comment Edited] (SPARK-20641) Key-value store abstraction and implementation for storing application data

2017-07-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083452#comment-16083452 ] Reynold Xin edited comment on SPARK-20641 at 7/12/17 5:05 AM: -

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-07-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083453#comment-16083453 ] Reynold Xin commented on SPARK-18085: - This is just large enough to warrant / deserve

[jira] [Comment Edited] (SPARK-20641) Key-value store abstraction and implementation for storing application data

2017-07-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083452#comment-16083452 ] Reynold Xin edited comment on SPARK-20641 at 7/12/17 5:06 AM: -

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-07-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083612#comment-16083612 ] Reynold Xin commented on SPARK-18085: - That sounds good to me. I don't actually think

[jira] [Updated] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-07-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18085: Summary: SPIP: Better History Server scalability for many / large applications (was: Better Histor

[jira] [Commented] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-07-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16084277#comment-16084277 ] Reynold Xin commented on SPARK-18085: - You should email dev@ to notify the list about

[jira] [Updated] (SPARK-20236) Overwrite a partitioned data source table should only overwrite related partitions

2017-07-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-20236: Labels: releasenotes (was: ) > Overwrite a partitioned data source table should only overwrite rel

[jira] [Commented] (SPARK-9686) Spark Thrift server doesn't return correct JDBC metadata

2017-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16089328#comment-16089328 ] Reynold Xin commented on SPARK-9686: The best way to advance the issue is probably for

[jira] [Commented] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16089332#comment-16089332 ] Reynold Xin commented on SPARK-18085: - [~vanzin] That's actually not true anymore. It

[jira] [Commented] (SPARK-19842) Informational Referential Integrity Constraints Support in Spark

2017-07-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16093887#comment-16093887 ] Reynold Xin commented on SPARK-19842: - Are you guys doing any work here? > Informati

[jira] [Commented] (SPARK-21485) API Documentation for Spark SQL functions

2017-07-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16095600#comment-16095600 ] Reynold Xin commented on SPARK-21485: - Pretty cool. Would be great to just generate t

[jira] [Resolved] (SPARK-12957) Derive and propagate data constrains in logical plan

2017-07-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12957. - Resolution: Fixed Fix Version/s: 2.0.0 > Derive and propagate data constrains in logical p

[jira] [Resolved] (SPARK-21485) API Documentation for Spark SQL functions

2017-07-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-21485. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.3.0 > API Documentation

[jira] [Commented] (SPARK-21551) pyspark's collect fails when getaddrinfo is too slow

2017-07-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103536#comment-16103536 ] Reynold Xin commented on SPARK-21551: - Do you want to submit a pull request? > pysp

[jira] [Commented] (SPARK-21551) pyspark's collect fails when getaddrinfo is too slow

2017-07-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103543#comment-16103543 ] Reynold Xin commented on SPARK-21551: - Sure. > pyspark's collect fails when getaddri

[jira] [Created] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21619: --- Summary: Fail the execution of canonicalized plans explicitly Key: SPARK-21619 URL: https://issues.apache.org/jira/browse/SPARK-21619 Project: Spark Issue Type

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113498#comment-16113498 ] Reynold Xin commented on SPARK-21619: - Canonicalized plan is used for semantic compar

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113516#comment-16113516 ] Reynold Xin commented on SPARK-21619: - Sorry I don't understand your question or poin

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113536#comment-16113536 ] Reynold Xin commented on SPARK-21619: - Mark that's a great point but you are going in

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113538#comment-16113538 ] Reynold Xin commented on SPARK-21619: - Also self-joins are very difficult to handle.

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113579#comment-16113579 ] Reynold Xin commented on SPARK-21619: - Ok so we are good with this one. Sorry I don'

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113593#comment-16113593 ] Reynold Xin commented on SPARK-21619: - Just generate different physical plan? > F

[jira] [Commented] (SPARK-21619) Fail the execution of canonicalized plans explicitly

2017-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16113598#comment-16113598 ] Reynold Xin commented on SPARK-21619: - Just look at structured streaming. That eould

[jira] [Created] (SPARK-21634) Change OneRowRelation from a case object to case class

2017-08-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21634: --- Summary: Change OneRowRelation from a case object to case class Key: SPARK-21634 URL: https://issues.apache.org/jira/browse/SPARK-21634 Project: Spark Issue Ty

[jira] [Resolved] (SPARK-16220) Revert ShowFunctions/ListFunctions in 2.0 to Reflect 1.6 Functionality

2016-06-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16220. - Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.0.1 > Revert ShowF

[jira] [Resolved] (SPARK-16111) Hide SparkOrcNewRecordReader in API docs

2016-06-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16111. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Hide SparkOrcNew

[jira] [Created] (SPARK-16248) Whitelist the list of Hive fallback functions

2016-06-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16248: --- Summary: Whitelist the list of Hive fallback functions Key: SPARK-16248 URL: https://issues.apache.org/jira/browse/SPARK-16248 Project: Spark Issue Type: Impro

[jira] [Updated] (SPARK-16248) Whitelist the list of Hive fallback functions

2016-06-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16248: Description: This patch removes the blind fallback into Hive for functions. Instead, it creates a

[jira] [Resolved] (SPARK-16202) Misleading Description of CreatableRelationProvider's createRelation

2016-06-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16202. - Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.1.0 > Misleading Description

[jira] [Commented] (SPARK-16264) Allow the user to use operators on the received DataFrame

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15353660#comment-15353660 ] Reynold Xin commented on SPARK-16264: - I actually think the sink interface just shoul

[jira] [Resolved] (SPARK-16259) Cleanup options for DataFrame reader API in Python

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16259. - Resolution: Fixed Fix Version/s: 2.1.0 > Cleanup options for DataFrame reader API in Pytho

[jira] [Closed] (SPARK-16081) Disallow using "l" as variable name

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-16081. --- Resolution: Won't Fix See discussion on pr https://github.com/apache/spark/pull/13915 > Disallow usi

[jira] [Resolved] (SPARK-16236) Add Path Option back to Load API in DataFrameReader

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16236. - Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.0.0 > Add Path Option back t

[jira] [Resolved] (SPARK-16248) Whitelist the list of Hive fallback functions

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16248. - Resolution: Fixed Fix Version/s: 2.0.0 > Whitelist the list of Hive fallback functions > -

[jira] [Resolved] (SPARK-16271) Implement Hive's UDFXPathUtil

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16271. - Resolution: Fixed Assignee: Peter Lee Fix Version/s: 2.1.0 > Implement Hive's UDF

[jira] [Created] (SPARK-16275) Implement all the Hive fallback functions

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16275: --- Summary: Implement all the Hive fallback functions Key: SPARK-16275 URL: https://issues.apache.org/jira/browse/SPARK-16275 Project: Spark Issue Type: New Featu

[jira] [Created] (SPARK-16277) Implement java_method SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16277: --- Summary: Implement java_method SQL function Key: SPARK-16277 URL: https://issues.apache.org/jira/browse/SPARK-16277 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16276) Implement elt SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16276: --- Summary: Implement elt SQL function Key: SPARK-16276 URL: https://issues.apache.org/jira/browse/SPARK-16276 Project: Spark Issue Type: Sub-task Rep

[jira] [Created] (SPARK-16279) Implement map_values SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16279: --- Summary: Implement map_values SQL function Key: SPARK-16279 URL: https://issues.apache.org/jira/browse/SPARK-16279 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16278) Implement map_keys SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16278: --- Summary: Implement map_keys SQL function Key: SPARK-16278 URL: https://issues.apache.org/jira/browse/SPARK-16278 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16280) Implement histogram_numeric SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16280: --- Summary: Implement histogram_numeric SQL function Key: SPARK-16280 URL: https://issues.apache.org/jira/browse/SPARK-16280 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16281) Implement parse_url SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16281: --- Summary: Implement parse_url SQL function Key: SPARK-16281 URL: https://issues.apache.org/jira/browse/SPARK-16281 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16283) Implement percentile_approx SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16283: --- Summary: Implement percentile_approx SQL function Key: SPARK-16283 URL: https://issues.apache.org/jira/browse/SPARK-16283 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16282) Implement percentile SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16282: --- Summary: Implement percentile SQL function Key: SPARK-16282 URL: https://issues.apache.org/jira/browse/SPARK-16282 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16287) Implement str_to_map SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16287: --- Summary: Implement str_to_map SQL function Key: SPARK-16287 URL: https://issues.apache.org/jira/browse/SPARK-16287 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16284) Implement reflect SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16284: --- Summary: Implement reflect SQL function Key: SPARK-16284 URL: https://issues.apache.org/jira/browse/SPARK-16284 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16288) Implement inline table generating function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16288: --- Summary: Implement inline table generating function Key: SPARK-16288 URL: https://issues.apache.org/jira/browse/SPARK-16288 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16289) Implement posexplode table generating function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16289: --- Summary: Implement posexplode table generating function Key: SPARK-16289 URL: https://issues.apache.org/jira/browse/SPARK-16289 Project: Spark Issue Type: Sub-

[jira] [Created] (SPARK-16285) Implement sentences SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16285: --- Summary: Implement sentences SQL function Key: SPARK-16285 URL: https://issues.apache.org/jira/browse/SPARK-16285 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16286) Implement stack SQL function

2016-06-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16286: --- Summary: Implement stack SQL function Key: SPARK-16286 URL: https://issues.apache.org/jira/browse/SPARK-16286 Project: Spark Issue Type: Sub-task R

[jira] [Updated] (SPARK-16286) Implement stack table generating function

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16286: Summary: Implement stack table generating function (was: Implement stack SQL function) > Implemen

[jira] [Commented] (SPARK-16275) Implement all the Hive fallback functions

2016-06-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15354614#comment-15354614 ] Reynold Xin commented on SPARK-16275: - cc [~dongjoon] maybe you can help with some of

[jira] [Resolved] (SPARK-14480) Remove meaningless StringIteratorReader for CSV data source for better performance

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14480. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 > Remove meaningles

[jira] [Updated] (SPARK-16044) input_file_name() returns empty strings in data sources based on NewHadoopRDD.

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16044: Fix Version/s: 1.6.3 > input_file_name() returns empty strings in data sources based on NewHadoopRD

[jira] [Created] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16304: --- Summary: LinkageError should not crash Spark executor Key: SPARK-16304 URL: https://issues.apache.org/jira/browse/SPARK-16304 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-16305) LinkageError should not crash Spark executor

2016-06-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16305: --- Summary: LinkageError should not crash Spark executor Key: SPARK-16305 URL: https://issues.apache.org/jira/browse/SPARK-16305 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-16006) Attemping to write empty DataFrame with no fields throw non-intuitive exception

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16006. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Attemping to wri

[jira] [Resolved] (SPARK-16228) "Percentile" needs explicit cast to double

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16228. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > "Percentile" nee

[jira] [Resolved] (SPARK-16267) Replace deprecated `CREATE TEMPORARY TABLE` from testsuites

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16267. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Replace deprecat

[jira] [Updated] (SPARK-16311) "refresh" should work on temporary tables or views or Dataset/DataFrame

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Summary: "refresh" should work on temporary tables or views or Dataset/DataFrame (was: "refresh" s

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Summary: Improve metadata refresh (was: "refresh" should work on temporary tables or views or Data

[jira] [Updated] (SPARK-16311) "refresh" should work on temporary tables or views or Dataset/DataFrame

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Description: When the underlying file changes, it can be very confusing to users The refresh com

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Description: When the underlying file changes, it can be very confusing to users when they see a F

  1   2   3   4   5   6   7   8   9   10   >