[jira] [Commented] (SPARK-22980) Using pandas_udf when inputs are not Pandas's Series or DataFrame

2018-01-17 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328936#comment-16328936 ] Li Jin commented on SPARK-22980: I agree with [~cloud_fan]. I think it's enough to document each args to

[jira] [Commented] (SPARK-23076) When we call cache() on RDD which depends on ShuffleRowRDD, we will get an error result

2018-01-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328801#comment-16328801 ] Sean Owen commented on SPARK-23076: --- You're relying on behavior that this class doesn't provide. It

[jira] [Assigned] (SPARK-21783) Turn on ORC filter push-down by default

2018-01-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-21783: --- Assignee: Dongjoon Hyun > Turn on ORC filter push-down by default >

[jira] [Resolved] (SPARK-21783) Turn on ORC filter push-down by default

2018-01-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21783. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20265

[jira] [Commented] (SPARK-21697) NPE & ExceptionInInitializerError trying to load UTF from HDFS

2018-01-17 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328928#comment-16328928 ] Steve Loughran commented on SPARK-21697: No, it's spark's ability to have hdfs:// URLs on the

[jira] [Commented] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328868#comment-16328868 ] Sean Owen commented on SPARK-23131: --- This requires updating Twitter Chill too, really, to 0.9.2. Have

[jira] [Commented] (SPARK-22884) ML test for StructuredStreaming: spark.ml.clustering

2018-01-17 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328898#comment-16328898 ] Sandor Murakozi commented on SPARK-22884: - Is there anybody working on this? If not I'm happy to

[jira] [Commented] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328787#comment-16328787 ] Sean Owen commented on SPARK-23125: --- The error message you cite, which is from the version in use,

[jira] [Resolved] (SPARK-23076) When we call cache() on RDD which depends on ShuffleRowRDD, we will get an error result

2018-01-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-23076. --- Resolution: Not A Problem > When we call cache() on RDD which depends on ShuffleRowRDD, we will get

[jira] [Commented] (SPARK-22886) ML test for StructuredStreaming: spark.ml.recommendation

2018-01-17 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328874#comment-16328874 ] Gabor Somogyi commented on SPARK-22886: --- I would like to work on this. Please notify me if somebody

[jira] [Created] (SPARK-23134) WebUI is showing the cache table details even after cache idle timeout

2018-01-17 Thread Shahid K I (JIRA)
Shahid K I created SPARK-23134: -- Summary: WebUI is showing the cache table details even after cache idle timeout Key: SPARK-23134 URL: https://issues.apache.org/jira/browse/SPARK-23134 Project: Spark

[jira] [Updated] (SPARK-23134) WebUI is showing the cache table details even after cache idle timeout

2018-01-17 Thread Shahid K I (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shahid K I updated SPARK-23134: --- Description: After cachedExecutorIdleTimeout, WebUI shows the cached partition details in the

[jira] [Updated] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-23135: Attachment: webUIAccumulatorRegression.png > Accumulators don't show up properly in the Stages

[jira] [Updated] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-23135: Description: Didn't do a lot of digging but may be caused by:

[jira] [Commented] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329177#comment-16329177 ] Apache Spark commented on SPARK-23011: -- User 'icexelloss' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Apache Spark (was: Marcelo Vanzin) > Re-enable Flaky Test: >

[jira] [Created] (SPARK-23133) Spark options are not passed to the Executor in Docker context

2018-01-17 Thread Andrew Korzhuev (JIRA)
Andrew Korzhuev created SPARK-23133: --- Summary: Spark options are not passed to the Executor in Docker context Key: SPARK-23133 URL: https://issues.apache.org/jira/browse/SPARK-23133 Project: Spark

[jira] [Created] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-23135: --- Summary: Accumulators don't show up properly in the Stages page anymore Key: SPARK-23135 URL: https://issues.apache.org/jira/browse/SPARK-23135 Project: Spark

[jira] [Assigned] (SPARK-23133) Spark options are not passed to the Executor in Docker context

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23133: Assignee: (was: Apache Spark) > Spark options are not passed to the Executor in

[jira] [Commented] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329293#comment-16329293 ] Burak Yavuz commented on SPARK-23135: - cc [~vanzin] > Accumulators don't show up properly in the

[jira] [Updated] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-23135: Environment:       was: Didn't do a lot of digging but may be caused by:

[jira] [Commented] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329065#comment-16329065 ] Marcelo Vanzin commented on SPARK-23020: Bummer. I'll try to take another look later today. >

[jira] [Updated] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Peigen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peigen updated SPARK-23131: --- Description: When trying to use GeneralizedLinearRegression model and set SparkConf to use

[jira] [Commented] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Peigen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329100#comment-16329100 ] Peigen commented on SPARK-23131: I realize this happens when I try to serialize the model using

[jira] [Updated] (SPARK-23134) WebUI is showing the cache table details even after cache idle timeout

2018-01-17 Thread Shahid K I (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shahid K I updated SPARK-23134: --- Environment:  Run Cache command with below configuration to cache the RDD blocks  

[jira] [Updated] (SPARK-23134) WebUI is showing the cache table details even after cache idle timeout

2018-01-17 Thread Shahid K I (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shahid K I updated SPARK-23134: --- Environment:  Run Cache command with below configuration to cache the RDD blocks  

[jira] [Assigned] (SPARK-23133) Spark options are not passed to the Executor in Docker context

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23133: Assignee: Apache Spark > Spark options are not passed to the Executor in Docker context >

[jira] [Commented] (SPARK-23133) Spark options are not passed to the Executor in Docker context

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329189#comment-16329189 ] Apache Spark commented on SPARK-23133: -- User 'andrusha' has created a pull request for this issue:

[jira] [Commented] (SPARK-22976) Worker cleanup can remove running driver directories

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329400#comment-16329400 ] Apache Spark commented on SPARK-22976: -- User 'RussellSpitzer' has created a pull request for this

[jira] [Commented] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329398#comment-16329398 ] Marcelo Vanzin commented on SPARK-23135: Nevermind, I was able to get the wrong table after some

[jira] [Commented] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329353#comment-16329353 ] Marcelo Vanzin commented on SPARK-23135: I'll try to take a look at the code, but do you have

[jira] [Commented] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329306#comment-16329306 ] Apache Spark commented on SPARK-23020: -- User 'vanzin' has created a pull request for this issue:

[jira] [Reopened] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts reopened SPARK-23130: -- > Spark Thrift does not clean-up temporary files (/tmp/*_resources and > /tmp/hive/*.pipeout) >

[jira] [Assigned] (SPARK-23110) ML 2.3 QA: API: Java compatibility, docs

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23110: - Assignee: Weichen Xu > ML 2.3 QA: API: Java compatibility, docs >

[jira] [Commented] (SPARK-23110) ML 2.3 QA: API: Java compatibility, docs

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329489#comment-16329489 ] Joseph K. Bradley commented on SPARK-23110: --- [~WeichenXu123] said he'd take this task again

[jira] [Commented] (SPARK-23110) ML 2.3 QA: API: Java compatibility, docs

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329487#comment-16329487 ] Joseph K. Bradley commented on SPARK-23110: --- Please check out the old JIRA for this from 2.2

[jira] [Commented] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329362#comment-16329362 ] Marcelo Vanzin commented on SPARK-23135: (By fine I mean the table renders correctly; the

[jira] [Commented] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329380#comment-16329380 ] Marcelo Vanzin commented on SPARK-23103: Given the unit test failure in the PR it might not be as

[jira] [Assigned] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23135: Assignee: (was: Apache Spark) > Accumulators don't show up properly in the Stages

[jira] [Commented] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329419#comment-16329419 ] Apache Spark commented on SPARK-23135: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23135: Assignee: Apache Spark > Accumulators don't show up properly in the Stages page anymore >

[jira] [Commented] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329464#comment-16329464 ] Sean Roberts commented on SPARK-23130: -- Marco - Which JIRA resolves the pipeout issue or

[jira] [Assigned] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23114: - Assignee: Felix Cheung > Spark R 2.3 QA umbrella > --- > >

[jira] [Commented] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329478#comment-16329478 ] Joseph K. Bradley commented on SPARK-23114: --- Thank you! > Spark R 2.3 QA umbrella >

[jira] [Commented] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329367#comment-16329367 ] Imran Rashid commented on SPARK-23103: -- [~vanzin] is this really minor, not a blocker for 2.3? Or

[jira] [Comment Edited] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329367#comment-16329367 ] Imran Rashid edited comment on SPARK-23103 at 1/17/18 8:07 PM: --- -[~vanzin]

[jira] [Commented] (SPARK-23109) ML 2.3 QA: API: Python API coverage

2018-01-17 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329481#comment-16329481 ] Bryan Cutler commented on SPARK-23109: -- [~josephkb] I can take this, thanks! > ML 2.3 QA: API:

[jira] [Created] (SPARK-23128) Introduce QueryStage to improve adaptive execution in Spark SQL

2018-01-17 Thread Carson Wang (JIRA)
Carson Wang created SPARK-23128: --- Summary: Introduce QueryStage to improve adaptive execution in Spark SQL Key: SPARK-23128 URL: https://issues.apache.org/jira/browse/SPARK-23128 Project: Spark

[jira] [Assigned] (SPARK-23129) Lazy init DiskMapIterator#deserializeStream to reduce memory usage when ExternalAppendOnlyMap spill too much times

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23129: Assignee: Apache Spark > Lazy init DiskMapIterator#deserializeStream to reduce memory

[jira] [Commented] (SPARK-23129) Lazy init DiskMapIterator#deserializeStream to reduce memory usage when ExternalAppendOnlyMap spill too much times

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328543#comment-16328543 ] Apache Spark commented on SPARK-23129: -- User 'caneGuy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23129) Lazy init DiskMapIterator#deserializeStream to reduce memory usage when ExternalAppendOnlyMap spill too much times

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23129: Assignee: (was: Apache Spark) > Lazy init DiskMapIterator#deserializeStream to reduce

[jira] [Updated] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23130: - Environment: * Spark versions: 1.6.3, 2.1.0, 2.2.0 * Hadoop distributions: HDP 2.5 - 2.6.3.0 *

[jira] [Updated] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23130: - Environment: * Hadoop distributions: HDP 2.5 - 2.6.3.0 * OS: Seen on SLES12, RHEL 7.3 & RHEL

[jira] [Updated] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhaoshijie updated SPARK-23125: --- Description: I find DirectKafkaInputDStream(kafka010) Offset commit failed when batch time is more

[jira] [Created] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
Sean Roberts created SPARK-23130: Summary: Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout) Key: SPARK-23130 URL: https://issues.apache.org/jira/browse/SPARK-23130

[jira] [Updated] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout)

2018-01-17 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23130: - Labels: thrift (was: ) > Spark Thrift does not clean-up temporary files (/tmp/*_resources and

[jira] [Assigned] (SPARK-23127) Update FeatureHasher user guide for catCols parameter

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23127: Assignee: Apache Spark > Update FeatureHasher user guide for catCols parameter >

[jira] [Updated] (SPARK-23127) Update FeatureHasher user guide for catCols parameter

2018-01-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-23127: --- Description: SPARK-22801 added the {{categoricalCols}} parameter and updated the Scala and

[jira] [Created] (SPARK-23129) Lazy init DiskMapIterator#deserializeStream to reduce memory usage when ExternalAppendOnlyMap spill too much times

2018-01-17 Thread zhoukang (JIRA)
zhoukang created SPARK-23129: Summary: Lazy init DiskMapIterator#deserializeStream to reduce memory usage when ExternalAppendOnlyMap spill too much times Key: SPARK-23129 URL:

[jira] [Resolved] (SPARK-23062) EXCEPT documentation should make it clear that it's EXCEPT DISTINCT

2018-01-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23062. - Resolution: Fixed Assignee: Henry Robinson Fix Version/s: 2.3.0 > EXCEPT documentation

[jira] [Created] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
zhaoshijie created SPARK-23125: -- Summary: Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout. Key: SPARK-23125 URL: https://issues.apache.org/jira/browse/SPARK-23125

[jira] [Updated] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhaoshijie updated SPARK-23125: --- Description: I find DirectKafkaInputDStream(kafka010) Offset commit failed when batch time is more

[jira] [Created] (SPARK-23126) I used the Project operator and modified the source. After compiling successfully, and testing the jars, I got the exception. Maybe the phenomenon is related with implic

2018-01-17 Thread xuetao (JIRA)
xuetao created SPARK-23126: -- Summary: I used the Project operator and modified the source. After compiling successfully, and testing the jars, I got the exception. Maybe the phenomenon is related with implicits Key: SPARK-23126

[jira] [Comment Edited] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328422#comment-16328422 ] Felix Cheung edited comment on SPARK-23115 at 1/17/18 8:01 AM: --- did this,

[jira] [Commented] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328430#comment-16328430 ] Felix Cheung commented on SPARK-23118: -- did this and opened SPARK-21616 > SparkR 2.3 QA:

[jira] [Updated] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-23020: --- Summary: Re-enable Flaky Test:

[jira] [Updated] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhaoshijie updated SPARK-23125: --- Description: I find DirectKafkaInputDStream(kafka010) Offset commit failed when batch time is more

[jira] [Assigned] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Marcelo Vanzin (was: Apache Spark) > Flaky Test:

[jira] [Commented] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328455#comment-16328455 ] Apache Spark commented on SPARK-23020: -- User 'sameeragarwal' has created a pull request for this

[jira] [Assigned] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Apache Spark (was: Marcelo Vanzin) > Flaky Test:

[jira] [Updated] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhaoshijie updated SPARK-23125: --- Description: I find DirectKafkaInputDStream(kafka010) Offset commit failed when batch time is more

[jira] [Commented] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328422#comment-16328422 ] Felix Cheung commented on SPARK-23115: -- did this, and opened this

[jira] [Commented] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328419#comment-16328419 ] Felix Cheung commented on SPARK-23114: -- sure, [~josephkb] > Spark R 2.3 QA umbrella >

[jira] [Updated] (SPARK-23125) Offset commit failed when spark-streaming batch time is more than kafkaParams session timeout.

2018-01-17 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhaoshijie updated SPARK-23125: --- Description: I find DirectKafkaInputDStream(kafka010) Offset commit failed when batch time is more

[jira] [Created] (SPARK-23127) Update FeatureHasher user guide for catCols parameter

2018-01-17 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-23127: -- Summary: Update FeatureHasher user guide for catCols parameter Key: SPARK-23127 URL: https://issues.apache.org/jira/browse/SPARK-23127 Project: Spark

[jira] [Resolved] (SPARK-23033) disable task-level retry for continuous execution

2018-01-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23033. --- Resolution: Fixed Fix Version/s: 2.3.0 3.0.0 Issue resolved by

[jira] [Resolved] (SPARK-23136) Mark packages as experimental

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23136. Resolution: Duplicate I actually filed SPARK-23104 yesterday for this. > Mark packages as

[jira] [Commented] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329618#comment-16329618 ] Marcelo Vanzin commented on SPARK-23104: That should be enough. Since there is no API the user

[jira] [Commented] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-17 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329608#comment-16329608 ] Anirudh Ramanathan commented on SPARK-23104: Thanks for opening this - is this simply a

[jira] [Assigned] (SPARK-23047) Change MapVector to NullableMapVector in ArrowColumnVector

2018-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-23047: Assignee: Li Jin > Change MapVector to NullableMapVector in ArrowColumnVector >

[jira] [Resolved] (SPARK-23047) Change MapVector to NullableMapVector in ArrowColumnVector

2018-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23047. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20239

[jira] [Updated] (SPARK-23135) Accumulators don't show up properly in the Stages page anymore

2018-01-17 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-23135: --- Target Version/s: 2.3.0 > Accumulators don't show up properly in the Stages page anymore >

[jira] [Updated] (SPARK-22993) checkpointInterval param doc should be clearer

2018-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22993: -- Fix Version/s: 2.3.0 > checkpointInterval param doc should be clearer >

[jira] [Assigned] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23131: Assignee: (was: Apache Spark) > Stackoverflow using ML and Kryo serializer >

[jira] [Assigned] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23131: Assignee: Apache Spark > Stackoverflow using ML and Kryo serializer >

[jira] [Commented] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329627#comment-16329627 ] Apache Spark commented on SPARK-23131: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-23016) Spark UI access and documentation

2018-01-17 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329675#comment-16329675 ] Anirudh Ramanathan commented on SPARK-23016: Works as expected with port forwarding in my

[jira] [Resolved] (SPARK-23093) don't modify run id

2018-01-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-23093. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20282

[jira] [Assigned] (SPARK-23093) don't modify run id

2018-01-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-23093: Assignee: Jose Torres > don't modify run id > --- > >

[jira] [Commented] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-17 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329621#comment-16329621 ] Anirudh Ramanathan commented on SPARK-23104: [~sameerag], this is a docs-only change, will

[jira] [Resolved] (SPARK-23132) Run ml.image doctests in tests

2018-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23132. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20294

[jira] [Commented] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-17 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329644#comment-16329644 ] Sameer Agarwal commented on SPARK-23104: Sounds great, thanks! > Document that kubernetes is

[jira] [Created] (SPARK-23137) spark.kubernetes.executor.podNamePrefix is ignored

2018-01-17 Thread Anirudh Ramanathan (JIRA)
Anirudh Ramanathan created SPARK-23137: -- Summary: spark.kubernetes.executor.podNamePrefix is ignored Key: SPARK-23137 URL: https://issues.apache.org/jira/browse/SPARK-23137 Project: Spark

[jira] [Commented] (SPARK-23137) spark.kubernetes.executor.podNamePrefix is ignored

2018-01-17 Thread Yinan Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329713#comment-16329713 ] Yinan Li commented on SPARK-23137: -- It's actually marked as an \{{internal}} config property. So the fix

[jira] [Commented] (SPARK-23137) spark.kubernetes.executor.podNamePrefix is ignored

2018-01-17 Thread Erik Erlandson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329721#comment-16329721 ] Erik Erlandson commented on SPARK-23137: +1, a more general "app prefix" seems more useful >

[jira] [Commented] (SPARK-23137) spark.kubernetes.executor.podNamePrefix is ignored

2018-01-17 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329731#comment-16329731 ] Anirudh Ramanathan commented on SPARK-23137: Looks like consensus is - we can get rid of the

[jira] [Created] (SPARK-23136) Mark packages as experimental

2018-01-17 Thread Anirudh Ramanathan (JIRA)
Anirudh Ramanathan created SPARK-23136: -- Summary: Mark packages as experimental Key: SPARK-23136 URL: https://issues.apache.org/jira/browse/SPARK-23136 Project: Spark Issue Type:

[jira] [Commented] (SPARK-23131) Stackoverflow using ML and Kryo serializer

2018-01-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329595#comment-16329595 ] Sean Owen commented on SPARK-23131: --- OK, good to know. A kryo upgrade is overdue, and may still be

[jira] [Assigned] (SPARK-23132) Run ml.image doctests in tests

2018-01-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-23132: Assignee: Hyukjin Kwon > Run ml.image doctests in tests > --

[jira] [Assigned] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-17 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal reassigned SPARK-23104: -- Assignee: Anirudh Ramanathan > Document that kubernetes is still "experimental" >

  1   2   >