[jira] [Commented] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address.

2017-09-23 Thread Sayat Satybaldiyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178082#comment-16178082 ] Sayat Satybaldiyev commented on SPARK-22077: I'm working on this issue and hope I'll be able

[jira] [Commented] (SPARK-22108) Logical Inconsistency in Timestamp Cast

2017-09-23 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178039#comment-16178039 ] Yuming Wang commented on SPARK-22108: -

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2017-09-23 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177939#comment-16177939 ] Nick Pentreath commented on SPARK-13030: It's ugly but we can introduce a new class

[jira] [Updated] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin updated SPARK-22109: -- Fix Version/s: 2.2.1 > Reading tables partitioned by columns that look like timestamps has >

[jira] [Resolved] (SPARK-22110) Enhance function description trim string function

2017-09-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22110. - Resolution: Fixed Fix Version/s: 2.3.0 > Enhance function description trim string function >

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177886#comment-16177886 ] Reynold Xin commented on SPARK-21190: - Maybe create an umbrella ticket so it is easier to link. >

[jira] [Updated] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18136: Fix Version/s: (was: 2.1.2) > Make PySpark pip install works on windows >

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177885#comment-16177885 ] Wenchen Fan commented on SPARK-21190: - yea, let's do that in a separated ticket. > SPIP: Vectorized

[jira] [Updated] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18136: Fix Version/s: 2.1.3 > Make PySpark pip install works on windows >

[jira] [Assigned] (SPARK-20448) Document how FileInputDStream works with object storage

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20448: - Assignee: Steve Loughran > Document how FileInputDStream works with object storage >

[jira] [Resolved] (SPARK-20448) Document how FileInputDStream works with object storage

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20448. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 17743

[jira] [Commented] (SPARK-20803) KernelDensity.estimate in pyspark.mllib.stat.KernelDensity throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is n

2017-09-23 Thread Alessio Placitelli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177855#comment-16177855 ] Alessio Placitelli commented on SPARK-20803: I can still reproduce this issue with Spark

[jira] [Commented] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177847#comment-16177847 ] Apache Spark commented on SPARK-22109: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Updated] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin updated SPARK-22109: -- Fix Version/s: 2.3.0 > Reading tables partitioned by columns that look like timestamps has >

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-23 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177846#comment-16177846 ] Li Jin commented on SPARK-21190: [~cloud_fan], do we want to track other vectorized udf efforts (group,

[jira] [Resolved] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-22109. --- Resolution: Fixed > Reading tables partitioned by columns that look like timestamps has >

[jira] [Commented] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177841#comment-16177841 ] Takuya Ueshin commented on SPARK-22109: --- Issue resolved by pull request 19331

[jira] [Assigned] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-22109: - Assignee: Hyukjin Kwon > Reading tables partitioned by columns that look like

[jira] [Assigned] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-21190: --- Assignee: Bryan Cutler (was: Reynold Xin) > SPIP: Vectorized UDFs in Python >

[jira] [Resolved] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21190. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18659

[jira] [Assigned] (SPARK-22110) Enhance function description trim string function

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22110: - Assignee: kevin yu Fix Version/s: (was: 2.3.0) > Enhance function description trim

[jira] [Assigned] (SPARK-22033) BufferHolder, other size checks should account for the specific VM array size limitations

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22033: - Assignee: Sean Owen > BufferHolder, other size checks should account for the specific VM array

[jira] [Resolved] (SPARK-22033) BufferHolder, other size checks should account for the specific VM array size limitations

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22033. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19266

[jira] [Resolved] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22099. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19320

[jira] [Assigned] (SPARK-22099) The 'job ids' list style needs to be changed in the SQL page.

2017-09-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22099: - Assignee: guoxiaolongzte > The 'job ids' list style needs to be changed in the SQL page. >

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177818#comment-16177818 ] Hyukjin Kwon commented on SPARK-18136: -- I haven't looked into the way you said but let's make the

[jira] [Commented] (SPARK-21157) Report Total Memory Used by Spark Executors

2017-09-23 Thread Wang Haihua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177814#comment-16177814 ] Wang Haihua commented on SPARK-21157: - Dose the include the RES memory of one executor? It does

[jira] [Updated] (SPARK-22092) Reallocation in OffHeapColumnVector.reserveInternal corrupts array data

2017-09-23 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-22092: -- Fix Version/s: 2.2.1 > Reallocation in OffHeapColumnVector.reserveInternal corrupts

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Jakub Nowacki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177795#comment-16177795 ] Jakub Nowacki commented on SPARK-18136: --- I've looked into it again and noticed the Bash script

[jira] [Assigned] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18136: Assignee: Apache Spark > Make PySpark pip install works on windows >

[jira] [Assigned] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18136: Assignee: (was: Apache Spark) > Make PySpark pip install works on windows >

[jira] [Reopened] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-18136: -- Oh, let me leave it open. I guess this is not fully solved and needs a followup. > Make PySpark

[jira] [Resolved] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18136. -- Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1

[jira] [Created] (SPARK-22111) OnlineLDAOptimizer should filter out empty documents beforehand

2017-09-23 Thread Weichen Xu (JIRA)
Weichen Xu created SPARK-22111: -- Summary: OnlineLDAOptimizer should filter out empty documents beforehand Key: SPARK-22111 URL: https://issues.apache.org/jira/browse/SPARK-22111 Project: Spark

[jira] [Assigned] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22093: Assignee: (was: Apache Spark) > UtilsSuite "resolveURIs with multiple paths" test

[jira] [Commented] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177722#comment-16177722 ] Apache Spark commented on SPARK-22093: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22093: Assignee: Apache Spark > UtilsSuite "resolveURIs with multiple paths" test always

[jira] [Assigned] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22109: Assignee: (was: Apache Spark) > Reading tables partitioned by columns that look like

[jira] [Assigned] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22109: Assignee: Apache Spark > Reading tables partitioned by columns that look like timestamps

[jira] [Commented] (SPARK-22109) Reading tables partitioned by columns that look like timestamps has inconsistent schema inference

2017-09-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177715#comment-16177715 ] Apache Spark commented on SPARK-22109: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Comment Edited] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-23 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177709#comment-16177709 ] Weichen Xu edited comment on SPARK-19357 at 9/23/17 10:18 AM: -- I thought on

[jira] [Commented] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-23 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177709#comment-16177709 ] Weichen Xu commented on SPARK-19357: I thought on this again. If we do not considering the thing