[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375005#comment-16375005 ] Imran Rashid commented on SPARK-23485: -- {quote} I think this is because the general expectation is

[jira] [Commented] (SPARK-21740) DataFrame.write does not work with Phoenix JDBC Driver

2018-02-23 Thread Dheeren Beborrtha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375134#comment-16375134 ] Dheeren Beborrtha commented on SPARK-21740: --- What workaround did you use? DId you modify

[jira] [Resolved] (SPARK-23459) Improve the error message when unknown column is specified in partition columns

2018-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23459. - Resolution: Fixed Assignee: Kazuaki Ishizaki Fix Version/s: 2.4.0 > Improve the error

[jira] [Updated] (SPARK-23408) Flaky test: StreamingOuterJoinSuite.left outer early state exclusion on right

2018-02-23 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23408: -- Fix Version/s: 2.4.0 > Flaky test: StreamingOuterJoinSuite.left outer early state exclusion on

[jira] [Commented] (SPARK-23502) Support async init of spark context during spark-shell startup

2018-02-23 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375051#comment-16375051 ] Sital Kedia commented on SPARK-23502: - I realized that we are printing the web url link and the

[jira] [Commented] (SPARK-23502) Support async init of spark context during spark-shell startup

2018-02-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375119#comment-16375119 ] Sean Owen commented on SPARK-23502: --- It does introduce some new cases to deal with, like, what happens

[jira] [Created] (SPARK-23504) Flaky test: RateSourceV2Suite.basic microbatch execution

2018-02-23 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23504: -- Summary: Flaky test: RateSourceV2Suite.basic microbatch execution Key: SPARK-23504 URL: https://issues.apache.org/jira/browse/SPARK-23504 Project: Spark

[jira] [Commented] (SPARK-23475) The "stages" page doesn't show any completed stages

2018-02-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375254#comment-16375254 ] Marcelo Vanzin commented on SPARK-23475: https://github.com/apache/spark/pull/20662 was merged to

[jira] [Created] (SPARK-23505) Flaky test: ParquetQuerySuite

2018-02-23 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23505: -- Summary: Flaky test: ParquetQuerySuite Key: SPARK-23505 URL: https://issues.apache.org/jira/browse/SPARK-23505 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-20592) Alter table concatenate is not working as expected.

2018-02-23 Thread Arun Manivannan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375305#comment-16375305 ] Arun Manivannan commented on SPARK-20592: - x > Alter table concatenate is not working as

[jira] [Issue Comment Deleted] (SPARK-20592) Alter table concatenate is not working as expected.

2018-02-23 Thread Arun Manivannan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Manivannan updated SPARK-20592: Comment: was deleted (was: x) > Alter table concatenate is not working as expected. >

[jira] [Created] (SPARK-23506) Add refreshByPath in HiveMetastoreCatalog and invalidByPath in FileStatusCache

2018-02-23 Thread guichaoxian (JIRA)
guichaoxian created SPARK-23506: --- Summary: Add refreshByPath in HiveMetastoreCatalog and invalidByPath in FileStatusCache Key: SPARK-23506 URL: https://issues.apache.org/jira/browse/SPARK-23506

[jira] [Commented] (SPARK-23448) Dataframe returns wrong result when column don't respect datatype

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16375386#comment-16375386 ] Apache Spark commented on SPARK-23448: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374708#comment-16374708 ] Yinan Li commented on SPARK-23485: -- In the Yarn case, yes, it's possible that a node is missing a jar

[jira] [Created] (SPARK-23498) Accuracy problem in comparison with string and integer

2018-02-23 Thread Kevin Zhang (JIRA)
Kevin Zhang created SPARK-23498: --- Summary: Accuracy problem in comparison with string and integer Key: SPARK-23498 URL: https://issues.apache.org/jira/browse/SPARK-23498 Project: Spark Issue

[jira] [Created] (SPARK-23499) Mesos Cluster Dispatcher should support priority queues to submit drivers

2018-02-23 Thread Pascal GILLET (JIRA)
Pascal GILLET created SPARK-23499: - Summary: Mesos Cluster Dispatcher should support priority queues to submit drivers Key: SPARK-23499 URL: https://issues.apache.org/jira/browse/SPARK-23499 Project:

[jira] [Updated] (SPARK-23499) Mesos Cluster Dispatcher should support priority queues to submit drivers

2018-02-23 Thread Pascal GILLET (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pascal GILLET updated SPARK-23499: -- Description: As for Yarn, Mesos users should be able to specify priority queues to define a

[jira] [Created] (SPARK-23500) Filters on named_structs could be pushed into scans

2018-02-23 Thread Henry Robinson (JIRA)
Henry Robinson created SPARK-23500: -- Summary: Filters on named_structs could be pushed into scans Key: SPARK-23500 URL: https://issues.apache.org/jira/browse/SPARK-23500 Project: Spark

[jira] [Commented] (SPARK-23495) Creating a json file using a dataframe Generates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374490#comment-16374490 ] AIT OUFKIR commented on SPARK-23495: After several checks, I noticed that the issue comes from the

[jira] [Updated] (SPARK-23494) Expose InferSchema's functionalities to the outside

2018-02-23 Thread David Courtinot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Courtinot updated SPARK-23494: Description: I'm proposing that InferSchema's internals (infer the schema of each record,

[jira] [Updated] (SPARK-23498) Accuracy problem in comparison with string and integer

2018-02-23 Thread Kevin Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Zhang updated SPARK-23498: Description: While comparing a string column with integer value, spark sql will automatically

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374745#comment-16374745 ] Anirudh Ramanathan commented on SPARK-23485: While mostly I think that K8s would be better

[jira] [Comment Edited] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374745#comment-16374745 ] Anirudh Ramanathan edited comment on SPARK-23485 at 2/23/18 6:00 PM: -

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374748#comment-16374748 ] Imran Rashid commented on SPARK-23485: -- ok the missing jar was a bad example on kubernetes ... I

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374757#comment-16374757 ] Yinan Li commented on SPARK-23485: -- It's not that I'm too confident on the capability of Kubernetes to

[jira] [Created] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Xiaoju Wu (JIRA)
Xiaoju Wu created SPARK-23493: - Summary: insert-into depends on columns order, otherwise incorrect data inserted Key: SPARK-23493 URL: https://issues.apache.org/jira/browse/SPARK-23493 Project: Spark

[jira] [Commented] (SPARK-9278) DataFrameWriter.insertInto inserts incorrect data

2018-02-23 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374073#comment-16374073 ] Xiaoju Wu commented on SPARK-9278: -- Created a new ticket to trace this issue SPARK-23493 >

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374076#comment-16374076 ] Xiaoju Wu commented on SPARK-23493: --- This issue is similar with the issue described in ticket 

[jira] [Updated] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaoju Wu updated SPARK-23493: -- Description: insert-into only works when the partitionby key columns are set at last: val data = Seq(

[jira] [Updated] (SPARK-23494) Expose InferSchema's functionalities to the outside

2018-02-23 Thread David Courtinot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Courtinot updated SPARK-23494: Description: I'm proposing that InferSchema's internals (infer the schema of each record,

[jira] [Updated] (SPARK-23494) Expose InferSchema's functionalities to the outside

2018-02-23 Thread David Courtinot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Courtinot updated SPARK-23494: Description: I'm proposing that InferSchema's internals (infer the schema of each record,

[jira] [Commented] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Ala Luszczak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374506#comment-16374506 ] Ala Luszczak commented on SPARK-23496: -- I agree that this solution is merely making the problem

[jira] [Commented] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374530#comment-16374530 ] Marco Gaido commented on SPARK-23496: - [~ala.luszczak] thanks for your answer. Honestly I don't see

[jira] [Created] (SPARK-23497) Sparklyr Applications doesn't disconnect spark driver in client mode

2018-02-23 Thread bharath kumar (JIRA)
bharath kumar created SPARK-23497: - Summary: Sparklyr Applications doesn't disconnect spark driver in client mode Key: SPARK-23497 URL: https://issues.apache.org/jira/browse/SPARK-23497 Project:

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374599#comment-16374599 ] Imran Rashid commented on SPARK-23485: -- Yeah I don't think its safe to assume that its kubernetes

[jira] [Created] (SPARK-23501) Refactor AllStagesPage in order to avoid redundant code

2018-02-23 Thread Marco Gaido (JIRA)
Marco Gaido created SPARK-23501: --- Summary: Refactor AllStagesPage in order to avoid redundant code Key: SPARK-23501 URL: https://issues.apache.org/jira/browse/SPARK-23501 Project: Spark Issue

[jira] [Assigned] (SPARK-23499) Mesos Cluster Dispatcher should support priority queues to submit drivers

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23499: Assignee: Apache Spark > Mesos Cluster Dispatcher should support priority queues to

[jira] [Commented] (SPARK-23499) Mesos Cluster Dispatcher should support priority queues to submit drivers

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374768#comment-16374768 ] Apache Spark commented on SPARK-23499: -- User 'pgillet' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23499) Mesos Cluster Dispatcher should support priority queues to submit drivers

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23499: Assignee: (was: Apache Spark) > Mesos Cluster Dispatcher should support priority

[jira] [Commented] (SPARK-23501) Refactor AllStagesPage in order to avoid redundant code

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374775#comment-16374775 ] Apache Spark commented on SPARK-23501: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23501) Refactor AllStagesPage in order to avoid redundant code

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23501: Assignee: (was: Apache Spark) > Refactor AllStagesPage in order to avoid redundant

[jira] [Assigned] (SPARK-23501) Refactor AllStagesPage in order to avoid redundant code

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23501: Assignee: Apache Spark > Refactor AllStagesPage in order to avoid redundant code >

[jira] [Assigned] (SPARK-20680) Spark-sql do not support for void column datatype of view

2018-02-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-20680: -- Assignee: Marcelo Vanzin > Spark-sql do not support for void column datatype of view

[jira] [Commented] (SPARK-23471) RandomForestClassificationModel save() - incorrect metadata

2018-02-23 Thread Keepun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374830#comment-16374830 ] Keepun commented on SPARK-23471: Saved to AWS s3:// > RandomForestClassificationModel save() - incorrect

[jira] [Commented] (SPARK-5377) Dynamically add jar into Spark Driver's classpath.

2018-02-23 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374841#comment-16374841 ] Xuefu Zhang commented on SPARK-5377: [~shay_elbaz] I think the issue was closed purely because no one

[jira] [Created] (SPARK-23503) continuous execution should sequence committed epochs

2018-02-23 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23503: --- Summary: continuous execution should sequence committed epochs Key: SPARK-23503 URL: https://issues.apache.org/jira/browse/SPARK-23503 Project: Spark Issue

[jira] [Created] (SPARK-23502) Support async init of spark context during spark-shell startup

2018-02-23 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-23502: --- Summary: Support async init of spark context during spark-shell startup Key: SPARK-23502 URL: https://issues.apache.org/jira/browse/SPARK-23502 Project: Spark

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 10.0.1 to 10.2.0

2018-02-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374872#comment-16374872 ] Sönke Liebau commented on SPARK-18057: -- Alright. I've got a 12h flight to SF ahead of me on Sunday,

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374778#comment-16374778 ] Stavros Kontopoulos commented on SPARK-23485: - How about locality preferences + a hardware

[jira] [Updated] (SPARK-23500) Filters on named_structs could be pushed into scans

2018-02-23 Thread Henry Robinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Henry Robinson updated SPARK-23500: --- Description: Simple filters on dataframes joined with {{joinWith()}} are missing an

[jira] [Assigned] (SPARK-20680) Spark-sql do not support for void column datatype of view

2018-02-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-20680: -- Assignee: (was: Marcelo Vanzin) > Spark-sql do not support for void column

[jira] [Updated] (SPARK-23471) RandomForestClassificationModel save() - incorrect metadata

2018-02-23 Thread Keepun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keepun updated SPARK-23471: --- Description: RandomForestClassificationMode.load() does not work after save() {code:java}

[jira] [Comment Edited] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374778#comment-16374778 ] Stavros Kontopoulos edited comment on SPARK-23485 at 2/23/18 6:59 PM:

[jira] [Comment Edited] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374757#comment-16374757 ] Yinan Li edited comment on SPARK-23485 at 2/23/18 6:22 PM: --- It's not that I'm 

[jira] [Comment Edited] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374778#comment-16374778 ] Stavros Kontopoulos edited comment on SPARK-23485 at 2/23/18 6:36 PM:

[jira] [Commented] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374884#comment-16374884 ] Anirudh Ramanathan commented on SPARK-23485: Stavros - we [do currently

[jira] [Comment Edited] (SPARK-23485) Kubernetes should support node blacklist

2018-02-23 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374884#comment-16374884 ] Anirudh Ramanathan edited comment on SPARK-23485 at 2/23/18 7:36 PM: -

[jira] [Resolved] (SPARK-23408) Flaky test: StreamingOuterJoinSuite.left outer early state exclusion on right

2018-02-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23408. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 20650

[jira] [Assigned] (SPARK-23408) Flaky test: StreamingOuterJoinSuite.left outer early state exclusion on right

2018-02-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-23408: - Assignee: Tathagata Das > Flaky test: StreamingOuterJoinSuite.left outer early state

[jira] [Commented] (SPARK-21550) approxQuantiles throws "next on empty iterator" on empty data

2018-02-23 Thread Javier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374218#comment-16374218 ] Javier commented on SPARK-21550: I still observe this behavior in 2.2.0 when approxQuantile is applied to

[jira] [Comment Edited] (SPARK-21550) approxQuantiles throws "next on empty iterator" on empty data

2018-02-23 Thread Javier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374218#comment-16374218 ] Javier edited comment on SPARK-21550 at 2/23/18 10:58 AM: -- I still observe this

[jira] [Comment Edited] (SPARK-21550) approxQuantiles throws "next on empty iterator" on empty data

2018-02-23 Thread Javier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374218#comment-16374218 ] Javier edited comment on SPARK-21550 at 2/23/18 10:59 AM: -- I still observe this

[jira] [Updated] (SPARK-23350) [SS]Exception when stopping continuous processing application

2018-02-23 Thread Wang Yanlin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wang Yanlin updated SPARK-23350: Attachment: TaskScheduler_stop.png > [SS]Exception when stopping continuous processing application

[jira] [Updated] (SPARK-23350) [SS]Exception when stopping continuous processing application

2018-02-23 Thread Wang Yanlin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wang Yanlin updated SPARK-23350: Attachment: (was: TaskScheduler_stop.png) > [SS]Exception when stopping continuous processing

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374266#comment-16374266 ] Xiaoju Wu commented on SPARK-23493: --- If that's the case, it should throw an exception to tell the users

[jira] [Updated] (SPARK-23350) [SS]Exception when stopping continuous processing application

2018-02-23 Thread Wang Yanlin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wang Yanlin updated SPARK-23350: Attachment: TaskScheduler_stop.png > [SS]Exception when stopping continuous processing application

[jira] [Commented] (SPARK-23350) [SS]Exception when stopping continuous processing application

2018-02-23 Thread Wang Yanlin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374236#comment-16374236 ] Wang Yanlin commented on SPARK-23350: - add the sequence flow for explainning this error

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374258#comment-16374258 ] Marco Gaido commented on SPARK-23493: - I don't think so. Partition columns are always at the end. If

[jira] [Commented] (SPARK-23475) The "stages" page doesn't show any completed stages

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374139#comment-16374139 ] Apache Spark commented on SPARK-23475: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374146#comment-16374146 ] Marco Gaido commented on SPARK-23493: - I don't think this is an issue. I think this is the expected

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374172#comment-16374172 ] Xiaoju Wu commented on SPARK-23493: --- [~mgaido] "Columns are matched in order while inserting" This is

[jira] [Commented] (SPARK-23405) The task will hang up when a small table left semi join a big table

2018-02-23 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374182#comment-16374182 ] KaiXinXIaoLei commented on SPARK-23405: --- [~q79969786] And if i run 'select ls.cs_order_number from

[jira] [Commented] (SPARK-23493) insert-into depends on columns order, otherwise incorrect data inserted

2018-02-23 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374358#comment-16374358 ] Marco Gaido commented on SPARK-23493: - How can it know that you are not setting the partition column

[jira] [Commented] (SPARK-23475) The "stages" page doesn't show any completed stages

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374360#comment-16374360 ] Apache Spark commented on SPARK-23475: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Created] (SPARK-23494) Expose InferSchema's functionalities to the outside

2018-02-23 Thread David Courtinot (JIRA)
David Courtinot created SPARK-23494: --- Summary: Expose InferSchema's functionalities to the outside Key: SPARK-23494 URL: https://issues.apache.org/jira/browse/SPARK-23494 Project: Spark

[jira] [Updated] (SPARK-23495) Creating a json file using a dataframe Generates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AIT OUFKIR updated SPARK-23495: --- Summary: Creating a json file using a dataframe Generates an issue (was: Creating a json file using

[jira] [Created] (SPARK-23495) Creating a json file using a dataframe creates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
AIT OUFKIR created SPARK-23495: -- Summary: Creating a json file using a dataframe creates an issue Key: SPARK-23495 URL: https://issues.apache.org/jira/browse/SPARK-23495 Project: Spark Issue

[jira] [Updated] (SPARK-23495) Creating a json file using a dataframe Generates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AIT OUFKIR updated SPARK-23495: --- Description: Issue happen when trying to create json file using a dataframe (see code below) from

[jira] [Updated] (SPARK-23495) Creating a json file using a dataframe Generates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AIT OUFKIR updated SPARK-23495: --- Flags: Important Remaining Estimate: 4h Original Estimate: 4h This issue can

[jira] [Created] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Ala Luszczak (JIRA)
Ala Luszczak created SPARK-23496: Summary: Locality of coalesced partitions can be severely skewed by the order of input partitions Key: SPARK-23496 URL: https://issues.apache.org/jira/browse/SPARK-23496

[jira] [Updated] (SPARK-23495) Creating a json file using a dataframe Generates an issue

2018-02-23 Thread AIT OUFKIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AIT OUFKIR updated SPARK-23495: --- Description: Issue happen when trying to create json file using a dataframe (see code below) from

[jira] [Assigned] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23496: Assignee: Apache Spark > Locality of coalesced partitions can be severely skewed by the

[jira] [Commented] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374429#comment-16374429 ] Apache Spark commented on SPARK-23496: -- User 'ala' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23496: Assignee: (was: Apache Spark) > Locality of coalesced partitions can be severely

[jira] [Commented] (SPARK-23496) Locality of coalesced partitions can be severely skewed by the order of input partitions

2018-02-23 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16374439#comment-16374439 ] Marco Gaido commented on SPARK-23496: - I read that the proposed solution is to use random numbers