[jira] [Created] (SPARK-22801) Allow FeatureHasher to specify numeric columns to treat as categorical

2017-12-15 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-22801: -- Summary: Allow FeatureHasher to specify numeric columns to treat as categorical Key: SPARK-22801 URL: https://issues.apache.org/jira/browse/SPARK-22801 Project:

[jira] [Commented] (SPARK-16087) Spark Hangs When Using Union With Persisted Hadoop RDD

2017-12-15 Thread Norbert Schultz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292355#comment-16292355 ] Norbert Schultz commented on SPARK-16087: - We recently had a similar issue, which at the end was

[jira] [Commented] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292343#comment-16292343 ] Apache Spark commented on SPARK-22800: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22800: Assignee: Apache Spark > Add a SSB query suite > - > >

[jira] [Assigned] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22800: Assignee: (was: Apache Spark) > Add a SSB query suite > - > >

[jira] [Created] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-22800: Summary: Add a SSB query suite Key: SPARK-22800 URL: https://issues.apache.org/jira/browse/SPARK-22800 Project: Spark Issue Type: Test

[jira] [Comment Edited] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292314#comment-16292314 ] zuotingbing edited comment on SPARK-22793 at 12/15/17 10:48 AM: yes the

[jira] [Commented] (SPARK-22465) Cogroup of two disproportionate RDDs could lead into 2G limit BUG

2017-12-15 Thread Sujith Jay Nair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292327#comment-16292327 ] Sujith Jay Nair commented on SPARK-22465: - Hi [~tgraves], is there a plan to resolve this

[jira] [Comment Edited] (SPARK-8418) Add single- and multi-value support to ML Transformers

2017-12-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292320#comment-16292320 ] Nick Pentreath edited comment on SPARK-8418 at 12/15/17 10:40 AM: --

[jira] [Updated] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-22799: --- Issue Type: Improvement (was: New Feature) > Bucketizer should throw exception if single-

[jira] [Commented] (SPARK-8418) Add single- and multi-value support to ML Transformers

2017-12-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292320#comment-16292320 ] Nick Pentreath commented on SPARK-8418: --- Created SPARK-22796, SPARK-22797 and SPARK-22798 to track

[jira] [Created] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-22799: -- Summary: Bucketizer should throw exception if single- and multi-column params are both set Key: SPARK-22799 URL: https://issues.apache.org/jira/browse/SPARK-22799

[jira] [Created] (SPARK-22798) Add multiple column support to PySpark StringIndexer

2017-12-15 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-22798: -- Summary: Add multiple column support to PySpark StringIndexer Key: SPARK-22798 URL: https://issues.apache.org/jira/browse/SPARK-22798 Project: Spark

[jira] [Created] (SPARK-22797) Add multiple column support to PySpark Bucketizer

2017-12-15 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-22797: -- Summary: Add multiple column support to PySpark Bucketizer Key: SPARK-22797 URL: https://issues.apache.org/jira/browse/SPARK-22797 Project: Spark Issue

[jira] [Commented] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292314#comment-16292314 ] zuotingbing commented on SPARK-22793: - yes the master branch also has this problem,but the different

[jira] [Created] (SPARK-22796) Add multiple column support to PySpark QuantileDiscretizer

2017-12-15 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-22796: -- Summary: Add multiple column support to PySpark QuantileDiscretizer Key: SPARK-22796 URL: https://issues.apache.org/jira/browse/SPARK-22796 Project: Spark

[jira] [Assigned] (SPARK-22600) Fix 64kb limit for deeply nested expressions under wholestage codegen

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22600: Assignee: Apache Spark (was: Liang-Chi Hsieh) > Fix 64kb limit for deeply nested

[jira] [Assigned] (SPARK-22600) Fix 64kb limit for deeply nested expressions under wholestage codegen

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22600: Assignee: Liang-Chi Hsieh (was: Apache Spark) > Fix 64kb limit for deeply nested

[jira] [Assigned] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22793: Assignee: (was: Apache Spark) > Memory leak in Spark Thrift Server >

[jira] [Assigned] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22793: Assignee: Apache Spark > Memory leak in Spark Thrift Server >

[jira] [Commented] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292307#comment-16292307 ] Apache Spark commented on SPARK-22793: -- User 'zuotingbing' has created a pull request for this

[jira] [Commented] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292266#comment-16292266 ] Annamalai Venugopal commented on SPARK-22792: - which is in the cloudpickle.py > PySpark UDF

[jira] [Commented] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292265#comment-16292265 ] Annamalai Venugopal commented on SPARK-22792: - its throwing the following exeption:

[jira] [Commented] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292258#comment-16292258 ] Apache Spark commented on SPARK-22795: -- User 'mrkm4ntr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22795: Assignee: Apache Spark > Raise error when line search in FirstOrderMinimizer failed >

[jira] [Assigned] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22795: Assignee: (was: Apache Spark) > Raise error when line search in FirstOrderMinimizer

[jira] [Commented] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292248#comment-16292248 ] zuotingbing commented on SPARK-22793: - ok, i will try to check it in master branch. Thanks. > Memory

[jira] [Created] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed

2017-12-15 Thread Shintaro Murakami (JIRA)
Shintaro Murakami created SPARK-22795: - Summary: Raise error when line search in FirstOrderMinimizer failed Key: SPARK-22795 URL: https://issues.apache.org/jira/browse/SPARK-22795 Project: Spark

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292134#comment-16292134 ] Mayank Agarwal edited comment on SPARK-22371 at 12/15/17 9:01 AM: -- Hi,

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292134#comment-16292134 ] Mayank Agarwal edited comment on SPARK-22371 at 12/15/17 9:01 AM: -- Hi,

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292134#comment-16292134 ] Mayank Agarwal edited comment on SPARK-22371 at 12/15/17 8:56 AM: -- Hi,

[jira] [Commented] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292213#comment-16292213 ] Marco Gaido commented on SPARK-22793: - Have you tried if the problem still exists in current master

[jira] [Updated] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mayank Agarwal updated SPARK-22371: --- Attachment: (was: ShuffleIssue.java) > dag-scheduler-event-loop thread stopped with

[jira] [Updated] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mayank Agarwal updated SPARK-22371: --- Attachment: (was: ShuffleIssue.java) > dag-scheduler-event-loop thread stopped with

[jira] [Updated] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mayank Agarwal updated SPARK-22371: --- Attachment: sampledata > dag-scheduler-event-loop thread stopped with error Attempted to

[jira] [Updated] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-12-15 Thread Mayank Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mayank Agarwal updated SPARK-22371: --- Attachment: ShuffleIssue.java > dag-scheduler-event-loop thread stopped with error

[jira] [Commented] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292187#comment-16292187 ] Hyukjin Kwon commented on SPARK-22792: -- PySpark pickles and unpickles Python objects and I wanted to

[jira] [Commented] (SPARK-22793) Memory leak in Spark Thrift Server

2017-12-15 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292174#comment-16292174 ] zuotingbing commented on SPARK-22793: - {code:java} lazy val metadataHive: HiveClient =

[jira] [Commented] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292166#comment-16292166 ] Annamalai Venugopal commented on SPARK-22792: - I am totally new to the spark and python i

[jira] [Updated] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-22792: - Issue Type: Bug (was: Question) > PySpark UDF registering issue > -

<    1   2