[jira] [Assigned] (SPARK-22981) Incorrect results of casting Struct to String

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22981: Assignee: Apache Spark > Incorrect results of casting Struct to String > -

[jira] [Commented] (SPARK-22981) Incorrect results of casting Struct to String

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16315071#comment-16315071 ] Apache Spark commented on SPARK-22981: -- User 'maropu' has created a pull request for

[jira] [Assigned] (SPARK-22981) Incorrect results of casting Struct to String

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22981: Assignee: (was: Apache Spark) > Incorrect results of casting Struct to String > --

[jira] [Created] (SPARK-22981) Incorrect results of casting Struct to String

2018-01-06 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-22981: Summary: Incorrect results of casting Struct to String Key: SPARK-22981 URL: https://issues.apache.org/jira/browse/SPARK-22981 Project: Spark Issue T

[jira] [Resolved] (SPARK-22973) Incorrect results of casting Map to String

2018-01-06 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22973. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20166 [https://githu

[jira] [Assigned] (SPARK-22973) Incorrect results of casting Map to String

2018-01-06 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22973: --- Assignee: Takeshi Yamamuro > Incorrect results of casting Map to String > --

[jira] [Updated] (SPARK-22980) Wrong answer when using pandas_udf

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22980: Priority: Blocker (was: Major) > Wrong answer when using pandas_udf > -- >

[jira] [Commented] (SPARK-22980) Wrong answer when using pandas_udf

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16315042#comment-16315042 ] Xiao Li commented on SPARK-22980: - cc [~icexelloss] [~bryanc] [~ueshin] > Wrong answer w

[jira] [Created] (SPARK-22980) Wrong answer when using pandas_udf

2018-01-06 Thread Xiao Li (JIRA)
Xiao Li created SPARK-22980: --- Summary: Wrong answer when using pandas_udf Key: SPARK-22980 URL: https://issues.apache.org/jira/browse/SPARK-22980 Project: Spark Issue Type: Sub-task Compo

[jira] [Updated] (SPARK-20498) RandomForestRegressionModel should expose getMaxDepth in PySpark

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20498: -- Target Version/s: 2.4.0 (was: 2.3.0) > RandomForestRegressionModel should expose getMa

[jira] [Commented] (SPARK-18569) Support R formula arithmetic

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314994#comment-16314994 ] Joseph K. Bradley commented on SPARK-18569: --- [~felixcheung] There are a few JIR

[jira] [Commented] (SPARK-20498) RandomForestRegressionModel should expose getMaxDepth in PySpark

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314993#comment-16314993 ] Joseph K. Bradley commented on SPARK-20498: --- We'll need to retarget this for 2.

[jira] [Updated] (SPARK-20498) RandomForestRegressionModel should expose getMaxDepth in PySpark

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20498: -- Shepherd: Joseph K. Bradley > RandomForestRegressionModel should expose getMaxDepth in

[jira] [Commented] (SPARK-20602) Adding LBFGS optimizer and Squared_hinge loss for LinearSVC

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314979#comment-16314979 ] Joseph K. Bradley commented on SPARK-20602: --- I'm afraid we'll need to re-target

[jira] [Updated] (SPARK-20602) Adding LBFGS optimizer and Squared_hinge loss for LinearSVC

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20602: -- Target Version/s: 2.4.0 (was: 2.3.0) > Adding LBFGS optimizer and Squared_hinge loss f

[jira] [Commented] (SPARK-22796) Add multiple column support to PySpark QuantileDiscretizer

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314978#comment-16314978 ] Joseph K. Bradley commented on SPARK-22796: --- We'll need to re-target this for 2

[jira] [Commented] (SPARK-18348) Improve tree ensemble model summary

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314976#comment-16314976 ] Joseph K. Bradley commented on SPARK-18348: --- I'll remove the target, but please

[jira] [Updated] (SPARK-18348) Improve tree ensemble model summary

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18348: -- Target Version/s: (was: 2.3.0) > Improve tree ensemble model summary > --

[jira] [Updated] (SPARK-15572) ML persistence in R format: compatibility with other languages

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15572: -- Target Version/s: (was: 2.3.0) > ML persistence in R format: compatibility with other

[jira] [Comment Edited] (SPARK-15572) ML persistence in R format: compatibility with other languages

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15987592#comment-15987592 ] Joseph K. Bradley edited comment on SPARK-15572 at 1/6/18 10:40 PM: ---

[jira] [Comment Edited] (SPARK-15572) ML persistence in R format: compatibility with other languages

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15987592#comment-15987592 ] Joseph K. Bradley edited comment on SPARK-15572 at 1/6/18 10:39 PM: ---

[jira] [Comment Edited] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15987588#comment-15987588 ] Joseph K. Bradley edited comment on SPARK-15784 at 1/6/18 10:39 PM: ---

[jira] [Updated] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15784: -- Target Version/s: 2.4.0 (was: 2.3.0) > Add Power Iteration Clustering to spark.ml > --

[jira] [Updated] (SPARK-18618) SparkR GLM model predict should support type as a argument

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18618: -- Target Version/s: (was: 2.3.0) > SparkR GLM model predict should support type as a ar

[jira] [Commented] (SPARK-18618) SparkR GLM model predict should support type as a argument

2018-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314966#comment-16314966 ] Joseph K. Bradley commented on SPARK-18618: --- I'll remove the target, but please

[jira] [Assigned] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22951: Assignee: (was: Apache Spark) > count() after dropDuplicates() on emptyDataFrame retur

[jira] [Assigned] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22951: Assignee: Apache Spark > count() after dropDuplicates() on emptyDataFrame returns incorrec

[jira] [Commented] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314508#comment-16314508 ] Apache Spark commented on SPARK-22951: -- User 'liufengdb' has created a pull request

[jira] [Updated] (SPARK-21786) The 'spark.sql.parquet.compression.codec' configuration doesn't take effect on tables with partition field(s)

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21786: Description: Since Hive 1.1, Hive allows users to set parquet compression codec via table-level properties

[jira] [Assigned] (SPARK-21786) The 'spark.sql.parquet.compression.codec' configuration doesn't take effect on tables with partition field(s)

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-21786: --- Assignee: Jinhua Fu > The 'spark.sql.parquet.compression.codec' configuration doesn't take effect >

[jira] [Resolved] (SPARK-21786) The 'spark.sql.parquet.compression.codec' configuration doesn't take effect on tables with partition field(s)

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21786. - Resolution: Fixed Fix Version/s: 2.3.0 > The 'spark.sql.parquet.compression.codec' configuration d

[jira] [Resolved] (SPARK-22793) Memory leak in Spark Thrift Server

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22793. - Resolution: Fixed Assignee: zuotingbing Fix Version/s: 2.3.0 > Memory leak in Spark Thrif

[jira] [Commented] (SPARK-22901) Add non-deterministic to Python UDF

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314476#comment-16314476 ] Apache Spark commented on SPARK-22901: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-22979) Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava)

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314459#comment-16314459 ] Apache Spark commented on SPARK-22979: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-22979) Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava)

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22979: Assignee: Apache Spark > Avoid per-record type dispatch in Python data conversion > (Eval

[jira] [Assigned] (SPARK-22979) Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava)

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22979: Assignee: (was: Apache Spark) > Avoid per-record type dispatch in Python data conversi

[jira] [Updated] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22978: Description: Capable of registering vectorized UDFs and then use it in SQL statement. For example, {noform

[jira] [Assigned] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22978: Assignee: Xiao Li (was: Apache Spark) > Register Vectorized UDFs for SQL Statement >

[jira] [Assigned] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22978: Assignee: Apache Spark (was: Xiao Li) > Register Vectorized UDFs for SQL Statement >

[jira] [Commented] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314456#comment-16314456 ] Apache Spark commented on SPARK-22978: -- User 'gatorsmile' has created a pull request

[jira] [Created] (SPARK-22979) Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava)

2018-01-06 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-22979: Summary: Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava) Key: SPARK-22979 URL: https://issues.apache.org/jira/browse/SPARK-22979 Pr

[jira] [Updated] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22978: Description: Capable of registering vectorized UDFs and then use it in SQL statement (was: R) > Register

[jira] [Updated] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22978: Summary: Register Vectorized UDFs for SQL Statement (was: Register Vectori) > Register Vectorized UDFs for

[jira] [Created] (SPARK-22978) Register Vectori

2018-01-06 Thread Xiao Li (JIRA)
Xiao Li created SPARK-22978: --- Summary: Register Vectori Key: SPARK-22978 URL: https://issues.apache.org/jira/browse/SPARK-22978 Project: Spark Issue Type: Sub-task Components: PySpark

[jira] [Issue Comment Deleted] (SPARK-7551) Don't split by dot if within backticks for DataFrame attribute resolution

2018-01-06 Thread Haris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haris updated SPARK-7551: - Comment: was deleted (was: I encountered a similar problem in spark 2,2 while using pyspark, I tried to split a c

[jira] [Commented] (SPARK-7551) Don't split by dot if within backticks for DataFrame attribute resolution

2018-01-06 Thread Haris (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16314443#comment-16314443 ] Haris commented on SPARK-7551: -- I encountered a similar problem in spark 2,2 while using pysp

[jira] [Resolved] (SPARK-22930) Improve the description of Vectorized UDFs for non-deterministic cases

2018-01-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22930. - Resolution: Fixed Assignee: Li Jin Fix Version/s: 2.3.0 > Improve the description of Vect