[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-10-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194170#comment-16194170 ] Takeshi Yamamuro commented on SPARK-22211: -- Probably, the suggested solution does not work when

[jira] [Commented] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194142#comment-16194142 ] Apache Spark commented on SPARK-8515: - User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8515: --- Assignee: (was: Apache Spark) > Improve ML attribute API > > >

[jira] [Assigned] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8515: --- Assignee: Apache Spark > Improve ML attribute API > > >

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194141#comment-16194141 ] Felix Cheung commented on SPARK-22202: -- [~holden.ka...@gmail.com] actually, I think for R we would

[jira] [Updated] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22202: - Priority: Minor (was: Major) > Release tgz content differences for python and R >

[jira] [Comment Edited] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194074#comment-16194074 ] Liang-Chi Hsieh edited comment on SPARK-8515 at 10/6/17 4:52 AM: - I'm

[jira] [Updated] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-8515: --- Attachment: SPARK-8515.pdf > Improve ML attribute API > > >

[jira] [Commented] (SPARK-20055) Documentation for CSV datasets in SQL programming guide

2017-10-05 Thread Jorge Machado (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194129#comment-16194129 ] Jorge Machado commented on SPARK-20055: --- [~aash] Should I copy paste that options ? And there is

[jira] [Resolved] (SPARK-22159) spark.sql.execution.arrow.enable and spark.sql.codegen.aggregate.map.twolevel.enable -> enabled

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-22159. --- Resolution: Fixed Fix Version/s: 2.3.0 > spark.sql.execution.arrow.enable and >

[jira] [Resolved] (SPARK-22153) Rename ShuffleExchange -> ShuffleExchangeExec

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-22153. --- Resolution: Fixed Fix Version/s: 2.3.0 > Rename ShuffleExchange ->

[jira] [Commented] (SPARK-19141) VectorAssembler metadata causing memory issues

2017-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194075#comment-16194075 ] Liang-Chi Hsieh commented on SPARK-19141: - I'm working on a new ML attribute API (SPARK-8515)

[jira] [Commented] (SPARK-8515) Improve ML attribute API

2017-10-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194074#comment-16194074 ] Liang-Chi Hsieh commented on SPARK-8515: I'm working on a new ML attribute API which is supposed

[jira] [Created] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-10-05 Thread Benyi Wang (JIRA)
Benyi Wang created SPARK-22211: -- Summary: LimitPushDown optimization for FullOuterJoin generates wrong results Key: SPARK-22211 URL: https://issues.apache.org/jira/browse/SPARK-22211 Project: Spark

[jira] [Updated] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael N updated SPARK-22163: -- Description: The application objects can contain List and can be modified dynamically as well.

[jira] [Comment Edited] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N edited comment on SPARK-22163 at 10/6/17 12:24 AM: - Vadim Semenov

[jira] [Comment Edited] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N edited comment on SPARK-22163 at 10/6/17 12:21 AM: - Vadim Semenov

[jira] [Comment Edited] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N edited comment on SPARK-22163 at 10/6/17 12:18 AM: - Vadim Semenov

[jira] [Updated] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael N updated SPARK-22163: -- Description: The application objects can contain List and can be modified dynamically as well.

[jira] [Comment Edited] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N edited comment on SPARK-22163 at 10/5/17 11:47 PM: - Vadim Semenov

[jira] [Comment Edited] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N edited comment on SPARK-22163 at 10/5/17 11:42 PM: - Vadim Semenov

[jira] [Commented] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193923#comment-16193923 ] Michael N commented on SPARK-21999: --- Vadim, It relates to serialization. I confirmed that it is due to

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2017-10-05 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193916#comment-16193916 ] Xiao Li commented on SPARK-18131: - cc [~WeichenXu123] > Support returning Vector/Dense Vector from

[jira] [Commented] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193907#comment-16193907 ] Michael N commented on SPARK-22163: --- Vadim Semenov and Steve Loughran, per your inquiries in ticket

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2017-10-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193891#comment-16193891 ] Miao Wang commented on SPARK-18131: --- [~felixcheung] We got stuck at the data types definitions. There

[jira] [Comment Edited] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193840#comment-16193840 ] Michael N edited comment on SPARK-21999 at 10/5/17 10:33 PM: - Steve, I'd keep

[jira] [Commented] (SPARK-22195) Add cosine similarity to org.apache.spark.ml.linalg.Vectors

2017-10-05 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193844#comment-16193844 ] yuhao yang commented on SPARK-22195: Thanks for the feedback. I don't see the existing

[jira] [Commented] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-10-05 Thread Michael N (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193840#comment-16193840 ] Michael N commented on SPARK-21999: --- Steve, I'd keep personal opinions of another person separate from

[jira] [Comment Edited] (SPARK-21742) BisectingKMeans generate different models with/without caching

2017-10-05 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193691#comment-16193691 ] Ilya Matiach edited comment on SPARK-21742 at 10/5/17 9:12 PM: ---

[jira] [Commented] (SPARK-21742) BisectingKMeans generate different models with/without caching

2017-10-05 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193691#comment-16193691 ] Ilya Matiach commented on SPARK-21742: -- [~podongfeng] The test was just validating that the edge

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2017-10-05 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193675#comment-16193675 ] Ilya Matiach commented on SPARK-16473: -- [~podongfeng] interesting - it looks like the dataset

[jira] [Created] (SPARK-22210) Online LDA variationalTopicInference should use random seed to have stable behavior

2017-10-05 Thread yuhao yang (JIRA)
yuhao yang created SPARK-22210: -- Summary: Online LDA variationalTopicInference should use random seed to have stable behavior Key: SPARK-22210 URL: https://issues.apache.org/jira/browse/SPARK-22210

[jira] [Updated] (SPARK-22188) Add defense against Cross-Site Scripting, MIME-sniffing and MitM attack

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-22188: -- Affects Version/s: 2.0.2 2.1.1 > Add defense against Cross-Site

[jira] [Created] (SPARK-22209) PySpark does not recognize imports from submodules

2017-10-05 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22209: Summary: PySpark does not recognize imports from submodules Key: SPARK-22209 URL: https://issues.apache.org/jira/browse/SPARK-22209 Project: Spark Issue

[jira] [Updated] (SPARK-22200) Kinesis Receivers stops if Kinesis stream was re-sharded

2017-10-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22200: - Component/s: (was: Spark Core) DStreams > Kinesis Receivers stops if

[jira] [Commented] (SPARK-21926) Some transformers in spark.ml.feature fail when trying to transform streaming dataframes

2017-10-05 Thread Bago Amirbekian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193392#comment-16193392 ] Bago Amirbekian commented on SPARK-21926: - [~mslipper] The trickiest thing about 1 (b) is knowing

[jira] [Commented] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address.

2017-10-05 Thread Sayat Satybaldiyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193391#comment-16193391 ] Sayat Satybaldiyev commented on SPARK-22077: yes, when I enclose IPv6 with square brackets

[jira] [Commented] (SPARK-21871) Check actual bytecode size when compiling generated code

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193390#comment-16193390 ] Apache Spark commented on SPARK-21871: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2017-10-05 Thread Bago Amirbekian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193389#comment-16193389 ] Bago Amirbekian commented on SPARK-13030: - Just so I'm clear, does multi-column in this context

[jira] [Commented] (SPARK-20055) Documentation for CSV datasets in SQL programming guide

2017-10-05 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193355#comment-16193355 ] Andrew Ash commented on SPARK-20055: What I would find most useful is a list of available options and

[jira] [Updated] (SPARK-15682) Hive partition write looks for root hdfs folder for existence

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15682: -- Summary: Hive partition write looks for root hdfs folder for existence (was: Hive ORC

[jira] [Updated] (SPARK-14286) Empty ORC table join throws exception

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14286: -- Component/s: SQL > Empty ORC table join throws exception >

[jira] [Commented] (SPARK-14286) Empty ORC table join throws exception

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193344#comment-16193344 ] Dongjoon Hyun commented on SPARK-14286: --- Hi, [~rajesh.balamohan]. How can I reproduce this? When I

[jira] [Resolved] (SPARK-17047) Spark 2 cannot create table when CLUSTERED.

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-17047. --- Resolution: Fixed Fix Version/s: 2.3.0 This is resolved by SPARK-17729. {code}

[jira] [Updated] (SPARK-17047) Spark 2 cannot create table when CLUSTERED.

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-17047: -- Component/s: SQL > Spark 2 cannot create table when CLUSTERED. >

[jira] [Updated] (SPARK-17047) Spark 2 cannot create table when CLUSTERED.

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-17047: -- Affects Version/s: 2.1.1 2.2.0 > Spark 2 cannot create table when

[jira] [Updated] (SPARK-17047) Spark 2 cannot create table when CLUSTERED.

2017-10-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-17047: -- Summary: Spark 2 cannot create table when CLUSTERED. (was: Spark 2 cannot create ORC table

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193247#comment-16193247 ] holdenk commented on SPARK-22202: - [~felixcheung] for Python I think it would not be bad to be

[jira] [Comment Edited] (SPARK-21785) Support create table from a file schema

2017-10-05 Thread Jacky Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193209#comment-16193209 ] Jacky Shen edited comment on SPARK-21785 at 10/5/17 5:28 PM: - Yes, we can

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193239#comment-16193239 ] Felix Cheung commented on SPARK-22202: -- [~holden.ka...@gmail.com] would you be concerned with the

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193238#comment-16193238 ] Felix Cheung commented on SPARK-22202: -- Yes, exactly. > Release tgz content differences for python

[jira] [Comment Edited] (SPARK-21785) Support create table from a file schema

2017-10-05 Thread Jacky Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193209#comment-16193209 ] Jacky Shen edited comment on SPARK-21785 at 10/5/17 5:12 PM: - Yes, we can

[jira] [Commented] (SPARK-21785) Support create table from a file schema

2017-10-05 Thread Jacky Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193209#comment-16193209 ] Jacky Shen commented on SPARK-21785: Yes, we can create table from existing parquet file and it is a

[jira] [Assigned] (SPARK-21866) SPIP: Image support in Spark

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21866: Assignee: Apache Spark > SPIP: Image support in Spark > > >

[jira] [Assigned] (SPARK-21866) SPIP: Image support in Spark

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21866: Assignee: (was: Apache Spark) > SPIP: Image support in Spark >

[jira] [Resolved] (SPARK-22179) percentile_approx should choose the first element if it already reaches the percentage

2017-10-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22179. --- Resolution: Duplicate OK I think this effectively taken over by the new issue? > percentile_approx

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193212#comment-16193212 ] Apache Spark commented on SPARK-21866: -- User 'imatiach-msft' has created a pull request for this

[jira] [Commented] (SPARK-19984) ERROR codegen.CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java'

2017-10-05 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193201#comment-16193201 ] Kazuaki Ishizaki commented on SPARK-19984: -- [~JohnSteidley] Thank you for your comment. Now, I

[jira] [Commented] (SPARK-19984) ERROR codegen.CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java'

2017-10-05 Thread John Steidley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193120#comment-16193120 ] John Steidley commented on SPARK-19984: --- [~kiszk] I noticed one other difference in our plans:

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193111#comment-16193111 ] Shivaram Venkataraman commented on SPARK-22202: --- I think the differences happen because we

[jira] [Commented] (SPARK-22208) Improve percentile_approx by not rounding up targetError and starting from index 0

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193078#comment-16193078 ] Apache Spark commented on SPARK-22208: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22208) Improve percentile_approx by not rounding up targetError and starting from index 0

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22208: Assignee: Apache Spark > Improve percentile_approx by not rounding up targetError and

[jira] [Assigned] (SPARK-22208) Improve percentile_approx by not rounding up targetError and starting from index 0

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22208: Assignee: (was: Apache Spark) > Improve percentile_approx by not rounding up

[jira] [Created] (SPARK-22208) Improve percentile_approx by not rounding up targetError and starting from index 0

2017-10-05 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-22208: Summary: Improve percentile_approx by not rounding up targetError and starting from index 0 Key: SPARK-22208 URL: https://issues.apache.org/jira/browse/SPARK-22208

[jira] [Assigned] (SPARK-22206) gapply in R can't work on empty grouping columns

2017-10-05 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22206: Assignee: Liang-Chi Hsieh > gapply in R can't work on empty grouping columns >

[jira] [Resolved] (SPARK-22206) gapply in R can't work on empty grouping columns

2017-10-05 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22206. -- Resolution: Fixed Fix Version/s: 2.1.3 2.3.0 2.2.1

[jira] [Commented] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192938#comment-16192938 ] cold gin commented on SPARK-22201: -- Ok, and thank you, I appreciate your time and feedback. Having the

[jira] [Updated] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22201: -- Flags: (was: Patch) Priority: Minor (was: Major) Issue Type: Improvement (was: Bug)

[jira] [Comment Edited] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-10-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192732#comment-16192732 ] Steve Loughran edited comment on SPARK-21999 at 10/5/17 1:39 PM: - Apache

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 1:24 PM: --- Yes, it is only the

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 1:23 PM: --- Yes, it is only the

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 1:22 PM: --- Yes, it is only the

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 1:19 PM: --- Yes, it is only the

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 12:58 PM: Yes, it is only

[jira] [Comment Edited] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin edited comment on SPARK-22201 at 10/5/17 12:58 PM: Yes, it is only

[jira] [Commented] (SPARK-22201) Dataframe describe includes string columns

2017-10-05 Thread cold gin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192834#comment-16192834 ] cold gin commented on SPARK-22201: -- Yes, it is only the default behavior that I think should be

[jira] [Commented] (SPARK-22131) Add Mesos Secrets Support to the Mesos Driver

2017-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192799#comment-16192799 ] Apache Spark commented on SPARK-22131: -- User 'susanxhuynh' has created a pull request for this

[jira] [Commented] (SPARK-21999) ConcurrentModificationException - Spark Streaming

2017-10-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16192732#comment-16192732 ] Steve Loughran commented on SPARK-21999: Apache projects are all open source, with an open

[jira] [Updated] (SPARK-22163) Design Issue of Spark Streaming that Causes Random Run-time Exception

2017-10-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-22163: --- Priority: Major (was: Critical) > Design Issue of Spark Streaming that Causes Random

[jira] [Created] (SPARK-22207) High memory usage when converting relational data to Hierarchical data

2017-10-05 Thread kanika dhuria (JIRA)
kanika dhuria created SPARK-22207: - Summary: High memory usage when converting relational data to Hierarchical data Key: SPARK-22207 URL: https://issues.apache.org/jira/browse/SPARK-22207 Project: