[jira] [Assigned] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22110: Assignee: Apache Spark > Enhance function description trim string function >

[jira] [Assigned] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22110: Assignee: (was: Apache Spark) > Enhance function description trim string function >

[jira] [Commented] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177497#comment-16177497 ] Apache Spark commented on SPARK-22110: -- User 'kevinyu98' has created a pull request for this issue:

[jira] [Closed] (SPARK-22085) When the applicationhas no core left,do not request more executors from the cluster manager in ExecutorAllocationManager

2017-09-22 Thread liuxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liuxian closed SPARK-22085. --- Resolution: Invalid > When the applicationhas no core left,do not request more executors from the >

[jira] [Resolved] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22060. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19278

[jira] [Commented] (SPARK-22088) Incorrect scalastyle comment causes wrong styles in stringExpressions

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177365#comment-16177365 ] Apache Spark commented on SPARK-22088: -- User 'kevinyu98' has created a pull request for this issue:

[jira] [Updated] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kevin yu updated SPARK-22110: - Description: This JIRA will enhance the function description for string function `trim`, specific for

[jira] [Updated] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kevin yu updated SPARK-22110: - Description: This JIRA will enhance the function description for string function {code:java} TRIM

[jira] [Updated] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kevin yu updated SPARK-22110: - Description: This JIRA will enhance the function description for string function `trim` , specific for

[jira] [Created] (SPARK-22110) Enhance function description trim string function

2017-09-22 Thread kevin yu (JIRA)
kevin yu created SPARK-22110: Summary: Enhance function description trim string function Key: SPARK-22110 URL: https://issues.apache.org/jira/browse/SPARK-22110 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19417) spark.files.overwrite is ignored

2017-09-22 Thread Chris Kanich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177276#comment-16177276 ] Chris Kanich commented on SPARK-19417: -- This is my hacked up runtime library loader: {{ #

[jira] [Commented] (SPARK-19417) spark.files.overwrite is ignored

2017-09-22 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177253#comment-16177253 ] Devaraj K commented on SPARK-19417: --- Thanks [~ckanich] for the test case.

[jira] [Created] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-22 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-22109: Summary: Reading tables partitioned by columns that look like timestamps has inconsistent schema inference Key: SPARK-22109 URL: https://issues.apache.org/jira/browse/SPARK-22109

[jira] [Created] (SPARK-22108) Logical Inconsistency in Timestamp Cast

2017-09-22 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-22108: Summary: Logical Inconsistency in Timestamp Cast Key: SPARK-22108 URL: https://issues.apache.org/jira/browse/SPARK-22108 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22107: Assignee: (was: Apache Spark) > "as" should be "alias" in python quick start

[jira] [Assigned] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22107: Assignee: Apache Spark > "as" should be "alias" in python quick start documentation >

[jira] [Commented] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177092#comment-16177092 ] Apache Spark commented on SPARK-22107: -- User 'jgoleary' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22106) Remove support for 0-parameter pandas_udfs

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22106: Assignee: Apache Spark > Remove support for 0-parameter pandas_udfs >

[jira] [Assigned] (SPARK-22106) Remove support for 0-parameter pandas_udfs

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22106: Assignee: (was: Apache Spark) > Remove support for 0-parameter pandas_udfs >

[jira] [Commented] (SPARK-22106) Remove support for 0-parameter pandas_udfs

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177066#comment-16177066 ] Apache Spark commented on SPARK-22106: -- User 'BryanCutler' has created a pull request for this

[jira] [Created] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-22 Thread John O'Leary (JIRA)
John O'Leary created SPARK-22107: Summary: "as" should be "alias" in python quick start documentation Key: SPARK-22107 URL: https://issues.apache.org/jira/browse/SPARK-22107 Project: Spark

[jira] [Commented] (SPARK-22046) Streaming State cannot be scalable

2017-09-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177009#comment-16177009 ] Shixiong Zhu commented on SPARK-22046: -- I don't know what's the exact issue.

[jira] [Comment Edited] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-22 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176902#comment-16176902 ] Imran Rashid edited comment on SPARK-22083 at 9/22/17 6:35 PM: --- After

[jira] [Commented] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-09-22 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176902#comment-16176902 ] Imran Rashid commented on SPARK-22083: -- After another look at this, I'm actually not sure why we

[jira] [Commented] (SPARK-19563) advoid unnecessary sort in FileFormatWriter

2017-09-22 Thread Charles Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176870#comment-16176870 ] Charles Pritchard commented on SPARK-19563: --- Did this make it into Spark 2.1.1? > advoid

[jira] [Commented] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176773#comment-16176773 ] Bryan Cutler commented on SPARK-19357: -- [~josephkb] I think trying to push down the parallelism to

[jira] [Commented] (SPARK-22106) Remove support for 0-parameter pandas_udfs

2017-09-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176714#comment-16176714 ] Bryan Cutler commented on SPARK-22106: -- I'll submit a PR soon for this > Remove support for

[jira] [Resolved] (SPARK-21404) Simple Vectorized Python UDFs using Arrow

2017-09-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-21404. -- Resolution: Fixed This has been merged as SPARK-21190 > Simple Vectorized Python UDFs using

[jira] [Closed] (SPARK-21404) Simple Vectorized Python UDFs using Arrow

2017-09-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler closed SPARK-21404. > Simple Vectorized Python UDFs using Arrow > - > >

[jira] [Created] (SPARK-22106) Remove support for 0-parameter pandas_udfs

2017-09-22 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-22106: Summary: Remove support for 0-parameter pandas_udfs Key: SPARK-22106 URL: https://issues.apache.org/jira/browse/SPARK-22106 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176655#comment-16176655 ] Kazuaki Ishizaki edited comment on SPARK-22105 at 9/22/17 4:22 PM: --- Can

[jira] [Updated] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-22105: --- Description: Suppose we have a dataframe with many columns (e.g 100 columns), each column is

[jira] [Commented] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176655#comment-16176655 ] Kazuaki Ishizaki commented on SPARK-22105: -- Can this PR at

[jira] [Updated] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-22105: --- Priority: Minor (was: Major) > Dataframe has poor performance when computing on many columns with

[jira] [Commented] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176650#comment-16176650 ] Weichen Xu commented on SPARK-22105: cc [~mlnick] [~cloud_fan] > Dataframe has poor performance when

[jira] [Created] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2017-09-22 Thread Weichen Xu (JIRA)
Weichen Xu created SPARK-22105: -- Summary: Dataframe has poor performance when computing on many columns with codegen Key: SPARK-22105 URL: https://issues.apache.org/jira/browse/SPARK-22105 Project:

[jira] [Assigned] (SPARK-22103) Move HashAggregateExec parent consume to a separate function in codegen

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22103: Assignee: Apache Spark > Move HashAggregateExec parent consume to a separate function in

[jira] [Assigned] (SPARK-22103) Move HashAggregateExec parent consume to a separate function in codegen

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22103: Assignee: (was: Apache Spark) > Move HashAggregateExec parent consume to a separate

[jira] [Commented] (SPARK-22103) Move HashAggregateExec parent consume to a separate function in codegen

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176597#comment-16176597 ] Apache Spark commented on SPARK-22103: -- User 'juliuszsompolski' has created a pull request for this

[jira] [Updated] (SPARK-22104) Add new option to dataframe -> parquet ==> custom extension to file name

2017-09-22 Thread Anbu Cheeralan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anbu Cheeralan updated SPARK-22104: --- Component/s: SQL > Add new option to dataframe -> parquet ==> custom extension to file name

[jira] [Created] (SPARK-22104) Add new option to dataframe -> parquet ==> custom extension to file name

2017-09-22 Thread Anbu Cheeralan (JIRA)
Anbu Cheeralan created SPARK-22104: -- Summary: Add new option to dataframe -> parquet ==> custom extension to file name Key: SPARK-22104 URL: https://issues.apache.org/jira/browse/SPARK-22104

[jira] [Created] (SPARK-22103) Move HashAggregateExec parent consume to a separate function in codegen

2017-09-22 Thread Juliusz Sompolski (JIRA)
Juliusz Sompolski created SPARK-22103: - Summary: Move HashAggregateExec parent consume to a separate function in codegen Key: SPARK-22103 URL: https://issues.apache.org/jira/browse/SPARK-22103

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176534#comment-16176534 ] Weichen Xu commented on SPARK-13030: [~josephkb] What about create a new name estimator for this ? It

[jira] [Commented] (SPARK-10884) Support prediction on single instance for regression and classification related models

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176522#comment-16176522 ] Weichen Xu commented on SPARK-10884: I took this over. Will create new PR soon. Thanks! > Support

[jira] [Updated] (SPARK-16625) Oracle JDBC table creation fails with ORA-00902: invalid datatype

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16625: Fix Version/s: 2.1.2 > Oracle JDBC table creation fails with ORA-00902: invalid datatype >

[jira] [Comment Edited] (SPARK-10413) ML models should support prediction on single instances

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176510#comment-16176510 ] Weichen Xu edited comment on SPARK-10413 at 9/22/17 2:46 PM: -

[jira] [Commented] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176517#comment-16176517 ] Apache Spark commented on SPARK-22092: -- User 'ala' has created a pull request for this issue:

[jira] [Commented] (SPARK-10413) ML models should support prediction on single instances

2017-09-22 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176510#comment-16176510 ] Weichen Xu commented on SPARK-10413: [~derek.kak...@gmail.com] I have taken this over and will create

[jira] [Closed] (SPARK-22089) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-22 Thread guichaoxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guichaoxian closed SPARK-22089. --- > There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex > refresh >

[jira] [Updated] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-22102: Description: It shows the warehouse dir is

[jira] [Commented] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176447#comment-16176447 ] Apache Spark commented on SPARK-22102: -- User 'wangyum' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22102: Assignee: (was: Apache Spark) > Reusing CliSessionState didn't set correct

[jira] [Assigned] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22102: Assignee: Apache Spark > Reusing CliSessionState didn't set correct METASTOREWAREHOUSE >

[jira] [Updated] (SPARK-22097) Request an accurate memory after we unrolled the block

2017-09-22 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-22097: - Description: We only need request bbos.size - unrollMemoryUsedByThisBlock after unrolled the

[jira] [Updated] (SPARK-22097) Request an accurate memory after we unrolled the block

2017-09-22 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-22097: - Summary: Request an accurate memory after we unrolled the block (was: Call

[jira] [Assigned] (SPARK-21766) DataFrame toPandas() raises ValueError with nullable int columns

2017-09-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-21766: Assignee: Liang-Chi Hsieh > DataFrame toPandas() raises ValueError with nullable int

[jira] [Resolved] (SPARK-21766) DataFrame toPandas() raises ValueError with nullable int columns

2017-09-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21766. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19319

[jira] [Updated] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-22102: Description: {noformat} [root@wangyuming01 spark-2.3.0-SNAPSHOT-bin-2.6.5]# bin/spark-sql

[jira] [Resolved] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-22 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-22092. --- Resolution: Fixed Assignee: Ala Luszczak Fix Version/s: 2.3.0 >

[jira] [Created] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-09-22 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-22102: --- Summary: Reusing CliSessionState didn't set correct METASTOREWAREHOUSE Key: SPARK-22102 URL: https://issues.apache.org/jira/browse/SPARK-22102 Project: Spark

[jira] [Commented] (SPARK-21347) Performance issues with KryoSerializer

2017-09-22 Thread Josh Devins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176327#comment-16176327 ] Josh Devins commented on SPARK-21347: - We were experiencing the same issue but on Spark 2.2.0. On

[jira] [Commented] (SPARK-792) PairRDDFunctions should expect Product2 instead of Tuple2

2017-09-22 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176268#comment-16176268 ] Sergei Lebedev commented on SPARK-792: -- It looks like the change in the title was never implemented.

[jira] [Commented] (SPARK-22101) spark over hbase snapshot

2017-09-22 Thread ulysses you (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176185#comment-16176185 ] ulysses you commented on SPARK-22101: - Thank you , i will go to HBase > spark over hbase snapshot >

[jira] [Resolved] (SPARK-22101) spark over hbase snapshot

2017-09-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22101. --- Resolution: Invalid This is an HBase question, not a Spark JIRA > spark over hbase snapshot >

[jira] [Created] (SPARK-22101) spark over hbase snapshot

2017-09-22 Thread ulysses you (JIRA)
ulysses you created SPARK-22101: --- Summary: spark over hbase snapshot Key: SPARK-22101 URL: https://issues.apache.org/jira/browse/SPARK-22101 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-22081) Generalized Reduced Error Logistic Regression

2017-09-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176090#comment-16176090 ] Liang-Chi Hsieh commented on SPARK-22081: - Btw, looks like RELR is patented:

[jira] [Commented] (SPARK-9103) Tracking spark's memory usage

2017-09-22 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176073#comment-16176073 ] Saisai Shao commented on SPARK-9103: [~irashid] I would like to hear your suggestion on displaying

[jira] [Assigned] (SPARK-22100) make percentile_approx support numeric/date/timestamp types

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22100: Assignee: (was: Apache Spark) > make percentile_approx support numeric/date/timestamp

[jira] [Commented] (SPARK-22100) make percentile_approx support numeric/date/timestamp types

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176064#comment-16176064 ] Apache Spark commented on SPARK-22100: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22100) make percentile_approx support numeric/date/timestamp types

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22100: Assignee: Apache Spark > make percentile_approx support numeric/date/timestamp types >

[jira] [Created] (SPARK-22100) make percentile_approx support numeric/date/timestamp types

2017-09-22 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-22100: Summary: make percentile_approx support numeric/date/timestamp types Key: SPARK-22100 URL: https://issues.apache.org/jira/browse/SPARK-22100 Project: Spark

[jira] [Resolved] (SPARK-22071) Improve release build scripts to check correct JAVA version is being used for build

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22071. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Resolved] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22072. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Resolved] (SPARK-21998) SortMergeJoinExec did not calculate its outputOrdering correctly during physical planning

2017-09-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21998. - Resolution: Fixed Assignee: Maryann Xue Fix Version/s: 2.3.0 > SortMergeJoinExec did not

[jira] [Resolved] (SPARK-22089) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22089. --- Resolution: Not A Problem > There is no need for fileStatusCache to invalidateAll when

[jira] [Reopened] (SPARK-22089) There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex refresh

2017-09-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-22089: --- > There is no need for fileStatusCache to invalidateAll when InMemoryFileIndex > refresh >

[jira] [Assigned] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22099: Assignee: (was: Apache Spark) > The 'job ids' list style needs to be changed in the

[jira] [Assigned] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22099: Assignee: Apache Spark > The 'job ids' list style needs to be changed in the SQL page. >

[jira] [Commented] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176011#comment-16176011 ] Apache Spark commented on SPARK-22099: -- User 'guoxiaolongzte' has created a pull request for this

[jira] [Updated] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-22099: --- Description: The 'job ids' list style needs to be changed in the SQL page The 'job ids'

[jira] [Updated] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-22099: --- Component/s: SQL > The 'job ids' list style needs to be changed in the SQL page. >

[jira] [Commented] (SPARK-21766) DataFrame toPandas() raises ValueError with nullable int columns

2017-09-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176000#comment-16176000 ] Apache Spark commented on SPARK-21766: -- User 'viirya' has created a pull request for this issue:

[jira] [Updated] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-22099: --- Description: The 'job ids' list style needs to be changed in the SQL page The 'job ids'

[jira] [Created] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-22 Thread guoxiaolongzte (JIRA)
guoxiaolongzte created SPARK-22099: -- Summary: The 'job ids' list style needs to be changed in the SQL page. Key: SPARK-22099 URL: https://issues.apache.org/jira/browse/SPARK-22099 Project: Spark

[jira] [Commented] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-09-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175986#comment-16175986 ] Shixiong Zhu commented on SPARK-21999: -- >From the stack trace, it seems the problem is in the