[jira] [Commented] (SPARK-26113) TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691340#comment-16691340 ] Hyukjin Kwon commented on SPARK-26113: -- Please avoid to set Critical+ priority which is usually

[jira] [Updated] (SPARK-26113) TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-26113: - Priority: Major (was: Blocker) > TypeError: object of type 'NoneType' has no len() in >

[jira] [Commented] (SPARK-26113) TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py

2018-11-18 Thread Sai Varun Reddy Daram (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691330#comment-16691330 ] Sai Varun Reddy Daram commented on SPARK-26113: --- Something to help here:

[jira] [Updated] (SPARK-26113) TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py

2018-11-18 Thread Sai Varun Reddy Daram (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sai Varun Reddy Daram updated SPARK-26113: -- Description: Machine OS: Ubuntu 16.04. Kubernetes: Minikube  Kubernetes

[jira] [Created] (SPARK-26113) TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py

2018-11-18 Thread Sai Varun Reddy Daram (JIRA)
Sai Varun Reddy Daram created SPARK-26113: - Summary: TypeError: object of type 'NoneType' has no len() in authenticate_and_accum_updates of pyspark/accumulators.py Key: SPARK-26113 URL:

[jira] [Commented] (SPARK-26112) Update since versions of new built-in functions.

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691297#comment-16691297 ] Apache Spark commented on SPARK-26112: -- User 'ueshin' has created a pull request for this issue:

[jira] [Created] (SPARK-26112) Update since versions of new built-in functions.

2018-11-18 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-26112: - Summary: Update since versions of new built-in functions. Key: SPARK-26112 URL: https://issues.apache.org/jira/browse/SPARK-26112 Project: Spark Issue

[jira] [Assigned] (SPARK-26112) Update since versions of new built-in functions.

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26112: Assignee: (was: Apache Spark) > Update since versions of new built-in functions. >

[jira] [Assigned] (SPARK-26112) Update since versions of new built-in functions.

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26112: Assignee: Apache Spark > Update since versions of new built-in functions. >

[jira] [Commented] (SPARK-12717) pyspark broadcast fails when using multiple threads

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691282#comment-16691282 ] Apache Spark commented on SPARK-12717: -- User 'BryanCutler' has created a pull request for this

[jira] [Commented] (SPARK-12717) pyspark broadcast fails when using multiple threads

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691281#comment-16691281 ] Apache Spark commented on SPARK-12717: -- User 'BryanCutler' has created a pull request for this

[jira] [Created] (SPARK-26111) Support ANOVA F-value between label/feature for the continuous distribution feature selection

2018-11-18 Thread Bihui Jin (JIRA)
Bihui Jin created SPARK-26111: - Summary: Support ANOVA F-value between label/feature for the continuous distribution feature selection Key: SPARK-26111 URL: https://issues.apache.org/jira/browse/SPARK-26111

[jira] [Commented] (SPARK-26031) dataframe can't load correct after saving to local disk in cluster mode

2018-11-18 Thread Bihui Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691202#comment-16691202 ] Bihui Jin commented on SPARK-26031: --- [~hyukjin.kwon] Got it, thank you. > dataframe can't load

[jira] [Closed] (SPARK-26031) dataframe can't load correct after saving to local disk in cluster mode

2018-11-18 Thread Bihui Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bihui Jin closed SPARK-26031. - Not a bug > dataframe can't load correct after saving to local disk in cluster mode >

[jira] [Commented] (SPARK-26039) Reading an empty folder as ORC causes an Analysis Exception

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691173#comment-16691173 ] Hyukjin Kwon commented on SPARK-26039: -- Can you clarify what you meant by ^? I couldn't follow. >

[jira] [Updated] (SPARK-26110) If you restart the spark history server, the "Last Update" of incomplete app(had been kill) will be updated to current time

2018-11-18 Thread zhouyongjin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhouyongjin updated SPARK-26110: Description: !2018-11-19_093402.png!!2018-11-19_092114.png! The Spark application that is

[jira] [Updated] (SPARK-26110) If you restart the spark history server, the "Last Update" of incomplete app(had been kill) will be updated to current time

2018-11-18 Thread zhouyongjin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhouyongjin updated SPARK-26110: Attachment: 2018-11-19_093402.png > If you restart the spark history server, the "Last Update" of

[jira] [Updated] (SPARK-26110) If you restart the spark history server, the "Last Update" of incomplete app(had been kill) will be updated to current time

2018-11-18 Thread zhouyongjin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhouyongjin updated SPARK-26110: Attachment: 2018-11-19_092301.png 2018-11-19_092114.png > If you restart the

[jira] [Created] (SPARK-26110) If you restart the spark history server, the "Last Update" of incomplete app(had been kill) will be updated to current time

2018-11-18 Thread zhouyongjin (JIRA)
zhouyongjin created SPARK-26110: --- Summary: If you restart the spark history server, the "Last Update" of incomplete app(had been kill) will be updated to current time Key: SPARK-26110 URL:

[jira] [Resolved] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26105. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23077

[jira] [Assigned] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-26105: Assignee: Hyukjin Kwon > Clean unittest2 imports up added Python 2.6 before >

[jira] [Assigned] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26105: Assignee: Apache Spark > Clean unittest2 imports up added Python 2.6 before >

[jira] [Commented] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691143#comment-16691143 ] Apache Spark commented on SPARK-26105: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691141#comment-16691141 ] Apache Spark commented on SPARK-26105: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26105: Assignee: (was: Apache Spark) > Clean unittest2 imports up added Python 2.6 before >

[jira] [Commented] (SPARK-26109) Duration in the task summary metrics table and the task table are different

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691124#comment-16691124 ] Apache Spark commented on SPARK-26109: -- User 'shahidki31' has created a pull request for this

[jira] [Commented] (SPARK-26109) Duration in the task summary metrics table and the task table are different

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691123#comment-16691123 ] Apache Spark commented on SPARK-26109: -- User 'shahidki31' has created a pull request for this

[jira] [Assigned] (SPARK-26109) Duration in the task summary metrics table and the task table are different

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26109: Assignee: (was: Apache Spark) > Duration in the task summary metrics table and the

[jira] [Assigned] (SPARK-26109) Duration in the task summary metrics table and the task table are different

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26109: Assignee: Apache Spark > Duration in the task summary metrics table and the task table

[jira] [Created] (SPARK-26109) Duration in the task summary metrics table and the task table are different

2018-11-18 Thread shahid (JIRA)
shahid created SPARK-26109: -- Summary: Duration in the task summary metrics table and the task table are different Key: SPARK-26109 URL: https://issues.apache.org/jira/browse/SPARK-26109 Project: Spark

[jira] [Commented] (SPARK-26045) Error in the spark 2.4 release package with the spark-avro_2.11 depdency

2018-11-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-26045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16691101#comment-16691101 ] Oscar garcía commented on SPARK-26045: --- Following the 2.4 spark documentation,

[jira] [Commented] (SPARK-26039) Reading an empty folder as ORC causes an Analysis Exception

2018-11-18 Thread Abhishek Verma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690989#comment-16690989 ] Abhishek Verma commented on SPARK-26039: But schema os dynamic so cant hanle that with instances

[jira] [Commented] (SPARK-26039) Reading an empty folder as ORC causes an Analysis Exception

2018-11-18 Thread Maxim Gekk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690985#comment-16690985 ] Maxim Gekk commented on SPARK-26039: This behaviour is not specific to ORC datasource. You can see

[jira] [Assigned] (SPARK-26026) Published Scaladoc jars missing from Maven Central

2018-11-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-26026: - Docs Text: InterfaceStability.* annotations have become top-level classes;

[jira] [Updated] (SPARK-26026) Published Scaladoc jars missing from Maven Central

2018-11-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-26026: -- Priority: Major (was: Minor) > Published Scaladoc jars missing from Maven Central >

[jira] [Updated] (SPARK-26090) Resolve most miscellaneous deprecation and build warnings for Spark 3

2018-11-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-26090: -- Description: The build has a lot of deprecation warnings. Some are new in Scala 2.12 and Java 11.

[jira] [Assigned] (SPARK-26108) Support custom lineSep in CSV datasource

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26108: Assignee: Apache Spark > Support custom lineSep in CSV datasource >

[jira] [Assigned] (SPARK-26108) Support custom lineSep in CSV datasource

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26108: Assignee: (was: Apache Spark) > Support custom lineSep in CSV datasource >

[jira] [Commented] (SPARK-26108) Support custom lineSep in CSV datasource

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690878#comment-16690878 ] Apache Spark commented on SPARK-26108: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Created] (SPARK-26108) Support custom lineSep in CSV datasource

2018-11-18 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-26108: -- Summary: Support custom lineSep in CSV datasource Key: SPARK-26108 URL: https://issues.apache.org/jira/browse/SPARK-26108 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-26107) Extend ReplaceNullWithFalseInPredicate to support higher-order functions: ArrayExists, ArrayFilter, MapFilter

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690843#comment-16690843 ] Apache Spark commented on SPARK-26107: -- User 'rednaxelafx' has created a pull request for this

[jira] [Assigned] (SPARK-26107) Extend ReplaceNullWithFalseInPredicate to support higher-order functions: ArrayExists, ArrayFilter, MapFilter

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26107: Assignee: (was: Apache Spark) > Extend ReplaceNullWithFalseInPredicate to support

[jira] [Assigned] (SPARK-26107) Extend ReplaceNullWithFalseInPredicate to support higher-order functions: ArrayExists, ArrayFilter, MapFilter

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26107: Assignee: Apache Spark > Extend ReplaceNullWithFalseInPredicate to support higher-order

[jira] [Created] (SPARK-26107) Extend ReplaceNullWithFalseInPredicate to support higher-order functions: ArrayExists, ArrayFilter, MapFilter

2018-11-18 Thread Kris Mok (JIRA)
Kris Mok created SPARK-26107: Summary: Extend ReplaceNullWithFalseInPredicate to support higher-order functions: ArrayExists, ArrayFilter, MapFilter Key: SPARK-26107 URL:

[jira] [Commented] (SPARK-25549) High level API to collect RDD statistics

2018-11-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690841#comment-16690841 ] Liang-Chi Hsieh commented on SPARK-25549: - I have code patch based on the design doc in local.

[jira] [Assigned] (SPARK-26106) Prioritizes ML unittests over the doctests in PySpark

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26106: Assignee: Apache Spark > Prioritizes ML unittests over the doctests in PySpark >

[jira] [Commented] (SPARK-26106) Prioritizes ML unittests over the doctests in PySpark

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690831#comment-16690831 ] Apache Spark commented on SPARK-26106: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-26106) Prioritizes ML unittests over the doctests in PySpark

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26106: Assignee: (was: Apache Spark) > Prioritizes ML unittests over the doctests in

[jira] [Commented] (SPARK-26106) Prioritizes ML unittests over the doctests in PySpark

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690832#comment-16690832 ] Apache Spark commented on SPARK-26106: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-26106) Prioritizes ML unittests over the doctests in PySpark

2018-11-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26106: Summary: Prioritizes ML unittests over the doctests in PySpark Key: SPARK-26106 URL: https://issues.apache.org/jira/browse/SPARK-26106 Project: Spark Issue

[jira] [Commented] (SPARK-25344) Break large PySpark unittests into smaller files

2018-11-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16690830#comment-16690830 ] Apache Spark commented on SPARK-25344: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-26105) Clean unittest2 imports up added Python 2.6 before

2018-11-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26105: Summary: Clean unittest2 imports up added Python 2.6 before Key: SPARK-26105 URL: https://issues.apache.org/jira/browse/SPARK-26105 Project: Spark Issue

[jira] [Resolved] (SPARK-25344) Break large PySpark unittests into smaller files

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25344. -- Resolution: Done > Break large PySpark unittests into smaller files >

[jira] [Resolved] (SPARK-26033) Break large ml/tests.py files into smaller files

2018-11-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26033. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23063