[jira] [Assigned] (SPARK-23447) Cleanup codegen template for Literal

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23447: Assignee: (was: Apache Spark) > Cleanup codegen template for Literal >

[jira] [Commented] (SPARK-23447) Cleanup codegen template for Literal

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366688#comment-16366688 ] Apache Spark commented on SPARK-23447: -- User 'rednaxelafx' has created a pull request for this

[jira] [Commented] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib

2018-02-16 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366744#comment-16366744 ] Nick Pentreath commented on SPARK-23437: It sounds interesting - however the standard practice is

[jira] [Commented] (SPARK-23217) Add cosine distance measure to ClusteringEvaluator

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366762#comment-16366762 ] Apache Spark commented on SPARK-23217: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Resolved] (SPARK-23447) Cleanup codegen template for Literal

2018-02-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23447. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20626

[jira] [Commented] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib

2018-02-16 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368109#comment-16368109 ] Seth Hendrickson commented on SPARK-23437: -- TBH, this seems like a pretty reasonable request.

[jira] [Comment Edited] (SPARK-23399) Register a task completion listener first for OrcColumnarBatchReader

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368121#comment-16368121 ] Dongjoon Hyun edited comment on SPARK-23399 at 2/17/18 7:29 AM:

[jira] [Commented] (SPARK-3159) Check for reducible DecisionTree

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368106#comment-16368106 ] Apache Spark commented on SPARK-3159: - User 'asolimando' has created a pull request for this issue:

[jira] [Commented] (SPARK-3159) Check for reducible DecisionTree

2018-02-16 Thread Alessandro Solimando (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368108#comment-16368108 ] Alessandro Solimando commented on SPARK-3159: - As I was not aware of this Jira case I have

[jira] [Created] (SPARK-23455) Default Params in ML should be saved separately

2018-02-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-23455: --- Summary: Default Params in ML should be saved separately Key: SPARK-23455 URL: https://issues.apache.org/jira/browse/SPARK-23455 Project: Spark Issue

[jira] [Created] (SPARK-23454) Add Trigger information to the Structured Streaming programming guide

2018-02-16 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-23454: - Summary: Add Trigger information to the Structured Streaming programming guide Key: SPARK-23454 URL: https://issues.apache.org/jira/browse/SPARK-23454 Project:

[jira] [Updated] (SPARK-23454) Add Trigger information to the Structured Streaming programming guide

2018-02-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-23454: -- Priority: Minor (was: Major) > Add Trigger information to the Structured Streaming

[jira] [Assigned] (SPARK-23454) Add Trigger information to the Structured Streaming programming guide

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23454: Assignee: Tathagata Das (was: Apache Spark) > Add Trigger information to the Structured

[jira] [Commented] (SPARK-23454) Add Trigger information to the Structured Streaming programming guide

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368077#comment-16368077 ] Apache Spark commented on SPARK-23454: -- User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23454) Add Trigger information to the Structured Streaming programming guide

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23454: Assignee: Apache Spark (was: Tathagata Das) > Add Trigger information to the Structured

[jira] [Assigned] (SPARK-23447) Cleanup codegen template for Literal

2018-02-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23447: --- Assignee: Kris Mok > Cleanup codegen template for Literal >

[jira] [Commented] (SPARK-23455) Default Params in ML should be saved separately

2018-02-16 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368116#comment-16368116 ] Liang-Chi Hsieh commented on SPARK-23455: - Currently, {{DefaultParamsWriter}} saves the following

[jira] [Commented] (SPARK-23442) Reading from partitioned and bucketed table uses only bucketSpec.numBuckets partitions in all cases

2018-02-16 Thread Pranav Rao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368117#comment-16368117 ] Pranav Rao commented on SPARK-23442: Repartitioning is unlikely to be helpful to a user because: *

[jira] [Assigned] (SPARK-23435) R tests should support latest testthat

2018-02-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung reassigned SPARK-23435: Assignee: Felix Cheung > R tests should support latest testthat >

[jira] [Commented] (SPARK-23399) Register a task completion listener first for OrcColumnarBatchReader

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368121#comment-16368121 ] Dongjoon Hyun commented on SPARK-23399: --- [~mgaido]. I understand what is your intention, but please

[jira] [Updated] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-02-16 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-23265: --- Description: SPARK-22397 added support for multiple columns to {{QuantileDiscretizer}}. If

[jira] [Commented] (SPARK-12140) Support Streaming UI in HistoryServer

2018-02-16 Thread German Schiavon Matteo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366774#comment-16366774 ] German Schiavon Matteo commented on SPARK-12140: Ok [~jerryshao], Im testing your code

[jira] [Commented] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib

2018-02-16 Thread Valeriy Avanesov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366857#comment-16366857 ] Valeriy Avanesov commented on SPARK-23437: -- [~mlnick], is that really supposed to happen to a

[jira] [Commented] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-02-16 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366809#comment-16366809 ] Nick Pentreath commented on SPARK-23265: Thanks for the ping - yes it adds more detailed checking

[jira] [Commented] (SPARK-23439) Ambiguous reference when selecting column inside StructType with same name that outer colum

2018-02-16 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366945#comment-16366945 ] Marco Gaido commented on SPARK-23439: - [~cloud_fan] I think this comes from

[jira] [Commented] (SPARK-23399) Register a task completion listener first for OrcColumnarBatchReader

2018-02-16 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366788#comment-16366788 ] Marco Gaido commented on SPARK-23399: - I think we should reopen this, it is still happening:

[jira] [Commented] (SPARK-23442) Reading from partitioned and bucketed table uses only bucketSpec.numBuckets partitions in all cases

2018-02-16 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16366898#comment-16366898 ] Marco Gaido commented on SPARK-23442: - I am not sure it is what you are looking for, but you can

[jira] [Updated] (SPARK-23448) Data encoding problem when not finding the right type

2018-02-16 Thread Ahmed ZAROUI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed ZAROUI updated SPARK-23448: - Environment: Local (was: Tested locally in linux machine) > Data encoding problem when not

[jira] [Updated] (SPARK-23448) Data encoding problem when not finding the right type

2018-02-16 Thread Ahmed ZAROUI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed ZAROUI updated SPARK-23448: - Description: I have the following json file that contains some noisy data(String instead of

[jira] [Created] (SPARK-23451) Deprecate KMeans computeCost

2018-02-16 Thread Marco Gaido (JIRA)
Marco Gaido created SPARK-23451: --- Summary: Deprecate KMeans computeCost Key: SPARK-23451 URL: https://issues.apache.org/jira/browse/SPARK-23451 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-23420) Datasource loading not handling paths with regex chars.

2018-02-16 Thread Mitchell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367529#comment-16367529 ] Mitchell commented on SPARK-23420: -- Yes, I agree there appears to be no way currently for a user to

[jira] [Commented] (SPARK-23288) Incorrect number of written records in structured streaming

2018-02-16 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367569#comment-16367569 ] Gabor Somogyi commented on SPARK-23288: --- Seems like no statsTrackers created in FileStreamSink. >

[jira] [Commented] (SPARK-23451) Deprecate KMeans computeCost

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367578#comment-16367578 ] Apache Spark commented on SPARK-23451: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23451) Deprecate KMeans computeCost

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23451: Assignee: (was: Apache Spark) > Deprecate KMeans computeCost >

[jira] [Assigned] (SPARK-23451) Deprecate KMeans computeCost

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23451: Assignee: Apache Spark > Deprecate KMeans computeCost > > >

[jira] [Commented] (SPARK-23288) Incorrect number of written records in structured streaming

2018-02-16 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367567#comment-16367567 ] Gabor Somogyi commented on SPARK-23288: --- I'm working on this issue. > Incorrect number of written

[jira] [Updated] (SPARK-23449) Extra java options lose order in Docker context

2018-02-16 Thread Andrew Korzhuev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Korzhuev updated SPARK-23449: Description: `spark.driver.extraJavaOptions` and `spark.executor.extraJavaOptions` when

[jira] [Created] (SPARK-23450) jars option in spark submit is documented in misleading way

2018-02-16 Thread Gregory Reshetniak (JIRA)
Gregory Reshetniak created SPARK-23450: -- Summary: jars option in spark submit is documented in misleading way Key: SPARK-23450 URL: https://issues.apache.org/jira/browse/SPARK-23450 Project:

[jira] [Created] (SPARK-23449) Extra java options lose order in Docker context

2018-02-16 Thread Andrew Korzhuev (JIRA)
Andrew Korzhuev created SPARK-23449: --- Summary: Extra java options lose order in Docker context Key: SPARK-23449 URL: https://issues.apache.org/jira/browse/SPARK-23449 Project: Spark Issue

[jira] [Updated] (SPARK-23448) Data encoding problem when not finding the right type

2018-02-16 Thread Ahmed ZAROUI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed ZAROUI updated SPARK-23448: - Description: I have the following json file that contains some noisy data(String instead of

[jira] [Created] (SPARK-23448) Data encoding problem when not finding the right type

2018-02-16 Thread Ahmed ZAROUI (JIRA)
Ahmed ZAROUI created SPARK-23448: Summary: Data encoding problem when not finding the right type Key: SPARK-23448 URL: https://issues.apache.org/jira/browse/SPARK-23448 Project: Spark Issue

[jira] [Commented] (SPARK-23439) Ambiguous reference when selecting column inside StructType with same name that outer colum

2018-02-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367414#comment-16367414 ] Wenchen Fan commented on SPARK-23439: - This is a valid behavior, as `a.b` is an invalid column name

[jira] [Updated] (SPARK-23448) Dataframe returns wrong result when column don't respect datatype

2018-02-16 Thread Ahmed ZAROUI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed ZAROUI updated SPARK-23448: - Summary: Dataframe returns wrong result when column don't respect datatype (was: Data encoding

[jira] [Commented] (SPARK-23449) Extra java options lose order in Docker context

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367370#comment-16367370 ] Apache Spark commented on SPARK-23449: -- User 'andrusha' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23449) Extra java options lose order in Docker context

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23449: Assignee: Apache Spark > Extra java options lose order in Docker context >

[jira] [Assigned] (SPARK-23449) Extra java options lose order in Docker context

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23449: Assignee: (was: Apache Spark) > Extra java options lose order in Docker context >

[jira] [Comment Edited] (SPARK-23427) spark.sql.autoBroadcastJoinThreshold causing OOM exception in the driver

2018-02-16 Thread Pratik Dhumal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367742#comment-16367742 ] Pratik Dhumal edited comment on SPARK-23427 at 2/16/18 7:18 PM:

[jira] [Commented] (SPARK-23427) spark.sql.autoBroadcastJoinThreshold causing OOM in the driver

2018-02-16 Thread Pratik Dhumal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367742#comment-16367742 ] Pratik Dhumal commented on SPARK-23427: --- {code:java} // code placeholder @Test def testLoop() = {

[jira] [Commented] (SPARK-23452) Improve test coverage for ORC file format

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367792#comment-16367792 ] Dongjoon Hyun commented on SPARK-23452: --- I created this and will proceed this for 2.3.1,

[jira] [Created] (SPARK-23452) Improve test coverage for ORC file format

2018-02-16 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-23452: - Summary: Improve test coverage for ORC file format Key: SPARK-23452 URL: https://issues.apache.org/jira/browse/SPARK-23452 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-23452) Extend test coverage to all ORC readers

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23452: -- Component/s: Tests > Extend test coverage to all ORC readers >

[jira] [Updated] (SPARK-23452) Extend test coverage to all ORC readers

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23452: -- Issue Type: Improvement (was: Test) > Extend test coverage to all ORC readers >

[jira] [Updated] (SPARK-23427) spark.sql.autoBroadcastJoinThreshold causing OOM exception in the driver

2018-02-16 Thread Dhiraj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dhiraj updated SPARK-23427: --- Summary: spark.sql.autoBroadcastJoinThreshold causing OOM exception in the driver (was:

[jira] [Updated] (SPARK-23452) Improve test coverage for ORC readers

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23452: -- Summary: Improve test coverage for ORC readers (was: Improve test coverage for ORC file

[jira] [Updated] (SPARK-23452) Extend test coverage to all ORC readers

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23452: -- Summary: Extend test coverage to all ORC readers (was: Improve test coverage for ORC readers)

[jira] [Updated] (SPARK-23452) Improve test coverage for ORC readers

2018-02-16 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23452: -- Description: We have five ORC readers. We had better have a test coverage for all ORC

[jira] [Commented] (SPARK-23433) java.lang.IllegalStateException: more than one active taskSet for stage

2018-02-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367728#comment-16367728 ] Shixiong Zhu commented on SPARK-23433: -- [~irashid] I'm busy with other stuff and not working on

[jira] [Commented] (SPARK-23433) java.lang.IllegalStateException: more than one active taskSet for stage

2018-02-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367664#comment-16367664 ] Imran Rashid commented on SPARK-23433: -- yes I think you are right [~zsxwing]. Since a zombie

[jira] [Resolved] (SPARK-23446) Explicitly check supported types in toPandas

2018-02-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23446. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.3.0 > Explicitly check

[jira] [Commented] (SPARK-23433) java.lang.IllegalStateException: more than one active taskSet for stage

2018-02-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367666#comment-16367666 ] Imran Rashid commented on SPARK-23433: -- actually, I realized its more general than just marking it

[jira] [Updated] (SPARK-23234) ML python test failure due to default outputCol

2018-02-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23234: Priority: Major (was: Blocker) > ML python test failure due to default outputCol >

[jira] [Updated] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23381: -- Issue Type: Bug (was: Improvement) > Murmur3 hash generates a different value from

[jira] [Commented] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367845#comment-16367845 ] Joseph K. Bradley commented on SPARK-23381: --- Copying my comment from the PR: {quote} For ML, I

[jira] [Commented] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367895#comment-16367895 ] Apache Spark commented on SPARK-23381: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Updated] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23381: -- Priority: Major (was: Minor) > Murmur3 hash generates a different value from other

[jira] [Commented] (SPARK-23452) Extend test coverage to all ORC readers

2018-02-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367905#comment-16367905 ] Xiao Li commented on SPARK-23452: - Thanks! I will assign it to you. > Extend test coverage to all ORC

[jira] [Assigned] (SPARK-23452) Extend test coverage to all ORC readers

2018-02-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-23452: --- Assignee: Dongjoon Hyun > Extend test coverage to all ORC readers >

[jira] [Resolved] (SPARK-23409) RandomForest/DecisionTree (syntactic) pruning of redundant subtrees

2018-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23409. --- Resolution: Duplicate Linking old JIRA for this issue > RandomForest/DecisionTree

[jira] [Commented] (SPARK-23337) withWatermark raises an exception on struct objects

2018-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367933#comment-16367933 ] Michael Armbrust commented on SPARK-23337: -- This is essentially the same issue as SPARK-18084.

[jira] [Resolved] (SPARK-23362) Migrate Kafka microbatch source to v2

2018-02-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23362. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 20554

[jira] [Commented] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2018-02-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16367925#comment-16367925 ] Li Jin commented on SPARK-13127: Hi all, The status of the Jira is "Progress". I am wondering if this is

[jira] [Commented] (SPARK-23417) pyspark tests give wrong sbt instructions

2018-02-16 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16368031#comment-16368031 ] Bruce Robbins commented on SPARK-23417: --- This does the trick: {noformat} build/sbt -Pkafka-0-8

[jira] [Updated] (SPARK-23453) ToolBox compiled Spark UDAF causes java.lang.InternalError: Malformed class name

2018-02-16 Thread Eric Lo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Lo updated SPARK-23453: Description: Here is a weird problem I just ran into... My scenario is that I need to compile UDAF

[jira] [Created] (SPARK-23453) ToolBox compiled Spark UDAF causes java.lang.InternalError: Malformed class name

2018-02-16 Thread Eric Lo (JIRA)
Eric Lo created SPARK-23453: --- Summary: ToolBox compiled Spark UDAF causes java.lang.InternalError: Malformed class name Key: SPARK-23453 URL: https://issues.apache.org/jira/browse/SPARK-23453 Project: