[jira] [Commented] (SPARK-12117) Column Aliases are Ignored in callUDF while using struct()

2016-03-09 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188593#comment-15188593 ] Liang-Chi Hsieh commented on SPARK-12117: - As I revisit this PR and find that this bug is already

[jira] [Created] (SPARK-13794) Rename DataFrameWriter.stream DataFrameWriter.startStream

2016-03-09 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13794: --- Summary: Rename DataFrameWriter.stream DataFrameWriter.startStream Key: SPARK-13794 URL: https://issues.apache.org/jira/browse/SPARK-13794 Project: Spark

[jira] [Assigned] (SPARK-13794) Rename DataFrameWriter.stream DataFrameWriter.startStream

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13794: Assignee: Reynold Xin (was: Apache Spark) > Rename DataFrameWriter.stream

[jira] [Assigned] (SPARK-13794) Rename DataFrameWriter.stream DataFrameWriter.startStream

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13794: Assignee: Apache Spark (was: Reynold Xin) > Rename DataFrameWriter.stream

[jira] [Deleted] (SPARK-10813) API design: high level class structuring regarding windowed and non-windowed streams

2016-03-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin deleted SPARK-10813: > API design: high level class structuring regarding windowed and non-windowed > streams >

[jira] [Deleted] (SPARK-10819) Logical plan: determine logical operators needed

2016-03-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin deleted SPARK-10819: > Logical plan: determine logical operators needed >

[jira] [Resolved] (SPARK-13146) API for managing streaming dataframes

2016-03-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13146. - Resolution: Fixed Fix Version/s: 2.0.0 > API for managing streaming dataframes >

[jira] [Deleted] (SPARK-10818) Query optimization: investigate whether we need a separate optimizer from Spark SQL's

2016-03-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin deleted SPARK-10818: > Query optimization: investigate whether we need a separate optimizer from > Spark SQL's >

[jira] [Commented] (SPARK-13794) Rename DataFrameWriter.stream DataFrameWriter.startStream

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188598#comment-15188598 ] Apache Spark commented on SPARK-13794: -- User 'rxin' has created a pull request for this issue:

[jira] [Updated] (SPARK-3374) Spark on Yarn remove deprecated configs for 2.0

2016-03-09 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-3374: - Assignee: Boyang Jerry Peng > Spark on Yarn remove deprecated configs for 2.0 >

[jira] [Commented] (SPARK-13774) IllegalArgumentException: Can not create a Path from an empty string for incorrect file path

2016-03-09 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15187162#comment-15187162 ] Jacek Laskowski commented on SPARK-13774: - It's today's build. No other changes and the line

[jira] [Commented] (SPARK-3374) Spark on Yarn remove deprecated configs for 2.0

2016-03-09 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15187141#comment-15187141 ] Thomas Graves commented on SPARK-3374: -- It seems this work is also being done under

[jira] [Commented] (SPARK-13311) prettyString of IN is not good

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188412#comment-15188412 ] Apache Spark commented on SPARK-13311: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13787) Feature importances for decision trees in Python

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13787: Assignee: Apache Spark > Feature importances for decision trees in Python >

[jira] [Assigned] (SPARK-13787) Feature importances for decision trees in Python

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13787: Assignee: (was: Apache Spark) > Feature importances for decision trees in Python >

[jira] [Commented] (SPARK-13787) Feature importances for decision trees in Python

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188414#comment-15188414 ] Apache Spark commented on SPARK-13787: -- User 'sethah' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13430) Expose ml summary function in PySpark for classification and regression models

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13430: Assignee: Apache Spark > Expose ml summary function in PySpark for classification and

[jira] [Commented] (SPARK-13430) Expose ml summary function in PySpark for classification and regression models

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188336#comment-15188336 ] Apache Spark commented on SPARK-13430: -- User 'BryanCutler' has created a pull request for this

[jira] [Assigned] (SPARK-13430) Expose ml summary function in PySpark for classification and regression models

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13430: Assignee: (was: Apache Spark) > Expose ml summary function in PySpark for

[jira] [Updated] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-03-09 Thread zhengcanbin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengcanbin updated SPARK-13125: Attachment: 13134.patch > makes the ratio of KafkaRDD partition to kafka topic partition

[jira] [Updated] (SPARK-13134) add 'spark.streaming.kafka.partition.multiplier' into SparkConf

2016-03-09 Thread zhengcanbin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengcanbin updated SPARK-13134: Attachment: 13134.patch > add 'spark.streaming.kafka.partition.multiplier' into SparkConf >

[jira] [Created] (SPARK-13790) Speed up ColumnVector's getDecimal

2016-03-09 Thread Nong Li (JIRA)
Nong Li created SPARK-13790: --- Summary: Speed up ColumnVector's getDecimal Key: SPARK-13790 URL: https://issues.apache.org/jira/browse/SPARK-13790 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-12555) Datasets: data is corrupted when input data is reordered

2016-03-09 Thread Luciano Resende (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188447#comment-15188447 ] Luciano Resende commented on SPARK-12555: - This issue is still reproducible in Spark 1.6.x but

[jira] [Commented] (SPARK-13068) Extend pyspark ml paramtype conversion to support lists

2016-03-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188319#comment-15188319 ] Joseph K. Bradley commented on SPARK-13068: --- You're right that the current implementation would

[jira] [Assigned] (SPARK-13761) Deprecate validateParams

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13761: Assignee: (was: Apache Spark) > Deprecate validateParams > >

[jira] [Assigned] (SPARK-13761) Deprecate validateParams

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13761: Assignee: Apache Spark > Deprecate validateParams > > >

[jira] [Commented] (SPARK-13761) Deprecate validateParams

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188329#comment-15188329 ] Apache Spark commented on SPARK-13761: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Commented] (SPARK-13783) Model export/import for spark.ml: GBTs

2016-03-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188358#comment-15188358 ] yuhao yang commented on SPARK-13783: I'm interested. > Model export/import for spark.ml: GBTs >

[jira] [Assigned] (SPARK-12555) Datasets: data is corrupted when input data is reordered

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12555: Assignee: (was: Apache Spark) > Datasets: data is corrupted when input data is

[jira] [Commented] (SPARK-12555) Datasets: data is corrupted when input data is reordered

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188438#comment-15188438 ] Apache Spark commented on SPARK-12555: -- User 'lresende' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12555) Datasets: data is corrupted when input data is reordered

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12555: Assignee: Apache Spark > Datasets: data is corrupted when input data is reordered >

[jira] [Commented] (SPARK-13790) Speed up ColumnVector's getDecimal

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188451#comment-15188451 ] Apache Spark commented on SPARK-13790: -- User 'nongli' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13790) Speed up ColumnVector's getDecimal

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13790: Assignee: (was: Apache Spark) > Speed up ColumnVector's getDecimal >

[jira] [Assigned] (SPARK-13790) Speed up ColumnVector's getDecimal

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13790: Assignee: Apache Spark > Speed up ColumnVector's getDecimal >

[jira] [Created] (SPARK-13791) Add MetadataLog and HDFSMetadataLog

2016-03-09 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-13791: Summary: Add MetadataLog and HDFSMetadataLog Key: SPARK-13791 URL: https://issues.apache.org/jira/browse/SPARK-13791 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-13778) Master's ApplicationPage displays wrong application executor state when a worker is lost

2016-03-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13778. --- Resolution: Fixed Fix Version/s: 2.0.0 Target Version/s: 2.0.0 > Master's

[jira] [Resolved] (SPARK-13760) Fix BigDecimal constructor for FloatType

2016-03-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13760. -- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved by pull request

[jira] [Created] (SPARK-13792) Limit logging of bad records

2016-03-09 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-13792: -- Summary: Limit logging of bad records Key: SPARK-13792 URL: https://issues.apache.org/jira/browse/SPARK-13792 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-13747) Concurrent execution in SQL doesn't work with Scala ForkJoinPool

2016-03-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13747. -- Resolution: Fixed Fix Version/s: 2.0.0 > Concurrent execution in SQL doesn't work with

[jira] [Issue Comment Deleted] (SPARK-13782) Model export/import for spark.ml: BisectingKMeans

2016-03-09 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13782: -- Comment: was deleted (was: Hi, [~josephkb]. May I work on this issue?) > Model export/import

[jira] [Commented] (SPARK-13791) Add MetadataLog and HDFSMetadataLog

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188467#comment-15188467 ] Apache Spark commented on SPARK-13791: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13791) Add MetadataLog and HDFSMetadataLog

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13791: Assignee: Shixiong Zhu (was: Apache Spark) > Add MetadataLog and HDFSMetadataLog >

[jira] [Assigned] (SPARK-13791) Add MetadataLog and HDFSMetadataLog

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13791: Assignee: Apache Spark (was: Shixiong Zhu) > Add MetadataLog and HDFSMetadataLog >

[jira] [Resolved] (SPARK-13775) history server sort by completed time by default

2016-03-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13775. --- Resolution: Fixed Assignee: Zhuo Liu Fix Version/s: 2.0.0 Target

[jira] [Resolved] (SPARK-13492) Configure a custom webui_url for the Spark Mesos Framework

2016-03-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13492. --- Resolution: Fixed Fix Version/s: 2.0.0 > Configure a custom webui_url for the Spark Mesos

[jira] [Updated] (SPARK-13492) Configure a custom webui_url for the Spark Mesos Framework

2016-03-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-13492: -- Assignee: Sergiusz Urbaniak (was: Andrew Or) > Configure a custom webui_url for the Spark Mesos

[jira] [Assigned] (SPARK-13492) Configure a custom webui_url for the Spark Mesos Framework

2016-03-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reassigned SPARK-13492: - Assignee: Andrew Or > Configure a custom webui_url for the Spark Mesos Framework >

[jira] [Updated] (SPARK-13760) Fix BigDecimal constructor for FloatType

2016-03-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13760: - Fix Version/s: (was: 1.6.1) 1.6.2 > Fix BigDecimal constructor for FloatType >

[jira] [Updated] (SPARK-13760) Fix BigDecimal constructor for FloatType

2016-03-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13760: - Assignee: Sameer Agarwal > Fix BigDecimal constructor for FloatType >

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186835#comment-15186835 ] Sean Owen commented on SPARK-13766: --- It seems like you're referring to the "part-*" files. These files

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186855#comment-15186855 ] Apache Spark commented on SPARK-13766: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Comment Edited] (SPARK-13767) py4j.protocol.Py4JNetworkError: An error occurred while trying to connect to the Java server

2016-03-09 Thread Poonam Agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186881#comment-15186881 ] Poonam Agrawal edited comment on SPARK-13767 at 3/9/16 10:01 AM: - Master

[jira] [Created] (SPARK-13771) Eliminate child of project if the project with no references to its child

2016-03-09 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-13771: --- Summary: Eliminate child of project if the project with no references to its child Key: SPARK-13771 URL: https://issues.apache.org/jira/browse/SPARK-13771

[jira] [Updated] (SPARK-13706) Python Example for Train Validation Split Missing

2016-03-09 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13706: --- Issue Type: Improvement (was: Bug) > Python Example for Train Validation Split Missing >

[jira] [Commented] (SPARK-13771) Eliminate child of project if the project with no references to its child

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186715#comment-15186715 ] Apache Spark commented on SPARK-13771: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13771) Eliminate child of project if the project with no references to its child

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13771: Assignee: (was: Apache Spark) > Eliminate child of project if the project with no

[jira] [Assigned] (SPARK-13771) Eliminate child of project if the project with no references to its child

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13771: Assignee: Apache Spark > Eliminate child of project if the project with no references to

[jira] [Assigned] (SPARK-13568) Create feature transformer to impute missing values

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13568: Assignee: Apache Spark > Create feature transformer to impute missing values >

[jira] [Assigned] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13766: Assignee: Apache Spark > Inconsistent file extensions and omitted file extensions written

[jira] [Assigned] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13766: Assignee: (was: Apache Spark) > Inconsistent file extensions and omitted file

[jira] [Comment Edited] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186872#comment-15186872 ] Hyukjin Kwon edited comment on SPARK-13766 at 3/9/16 9:59 AM: -- Firstly,

[jira] [Commented] (SPARK-13767) py4j.protocol.Py4JNetworkError: An error occurred while trying to connect to the Java server

2016-03-09 Thread Poonam Agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186881#comment-15186881 ] Poonam Agrawal commented on SPARK-13767: Master is working I have started the master with

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2016-03-09 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186722#comment-15186722 ] Wojciech Jurczyk commented on SPARK-13030: -- I am not sure if I get you correctly. Are you

[jira] [Commented] (SPARK-13682) Finalize the public API for FileFormat

2016-03-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186766#comment-15186766 ] Reynold Xin commented on SPARK-13682: - Traits are just bad for binary compatibility. Adding a method

[jira] [Commented] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-09 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186775#comment-15186775 ] Wenchen Fan commented on SPARK-13393: - It's a known issue and there are already a lot of JIRAs

[jira] [Commented] (SPARK-12343) Remove YARN Client / ClientArguments

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186795#comment-15186795 ] Apache Spark commented on SPARK-12343: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Commented] (SPARK-13767) py4j.protocol.Py4JNetworkError: An error occurred while trying to connect to the Java server

2016-03-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186851#comment-15186851 ] Sean Owen commented on SPARK-13767: --- This sounds like a local problem, like, your master is not

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186872#comment-15186872 ] Hyukjin Kwon commented on SPARK-13766: -- Firstly, sorry, I just checked this after creating a PR. I

[jira] [Commented] (SPARK-13766) Inconsistent file extensions and omitted file extensions written by CSV, TEXT and JSON data sources

2016-03-09 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186890#comment-15186890 ] Hyukjin Kwon commented on SPARK-13766: -- Are there any possibility that the "part-*" files could be

[jira] [Commented] (SPARK-13744) Dataframe RDD caching increases the input size for subsequent stages

2016-03-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186848#comment-15186848 ] Sean Owen commented on SPARK-13744: --- Caching happens asynchronously. It is not completed after cache()

[jira] [Assigned] (SPARK-13568) Create feature transformer to impute missing values

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13568: Assignee: (was: Apache Spark) > Create feature transformer to impute missing values >

[jira] [Commented] (SPARK-13568) Create feature transformer to impute missing values

2016-03-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186714#comment-15186714 ] Apache Spark commented on SPARK-13568: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Updated] (SPARK-13706) Python Example for Train Validation Split Missing

2016-03-09 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13706: --- Assignee: Jeremy > Python Example for Train Validation Split Missing >

<    1   2   3