[jira] [Resolved] (SPARK-11500) Not deterministic order of columns when using merging schemas.

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-11500. Resolution: Fixed Fix Version/s: 1.7.0 Issue resolved by pull request 9517

[jira] [Commented] (SPARK-6726) Model export/import for spark.ml: LogisticRegression

2015-11-11 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000192#comment-15000192 ] Earthson Lu commented on SPARK-6726: Is the API ready for subtasks? I can do some work:) > Model

[jira] [Comment Edited] (SPARK-11583) Make MapStatus use less memory uage

2015-11-11 Thread Kent Yao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000187#comment-15000187 ] Kent Yao edited comment on SPARK-11583 at 11/11/15 10:21 AM: - [~imranr]

[jira] [Commented] (SPARK-10113) Support for unsigned Parquet logical types

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000203#comment-15000203 ] Cheng Lian commented on SPARK-10113: I think emitting a clear error message is more reasonable since

[jira] [Updated] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11651: Description: LinearRegressionSummary should support get residuals by type like R glm: {code}

[jira] [Created] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-11651: --- Summary: LinearRegressionSummary should support get residuals by type Key: SPARK-11651 URL: https://issues.apache.org/jira/browse/SPARK-11651 Project: Spark

[jira] [Created] (SPARK-11652) Remote code execution with InvokerTransformer

2015-11-11 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-11652: -- Summary: Remote code execution with InvokerTransformer Key: SPARK-11652 URL: https://issues.apache.org/jira/browse/SPARK-11652 Project: Spark Issue

[jira] [Commented] (SPARK-6789) Model export/import for spark.ml: ALS

2015-11-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000179#comment-15000179 ] Yanbo Liang commented on SPARK-6789: I will work on it. > Model export/import for spark.ml: ALS >

[jira] [Resolved] (SPARK-11594) Cannot create UDAF in REPL

2015-11-11 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-11594. --- Resolution: Not A Problem > Cannot create UDAF in REPL > --

[jira] [Commented] (SPARK-11089) Add a option for thrift-server to share a single session across all connections

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000195#comment-15000195 ] Cheng Lian commented on SPARK-11089: OK, I'm taking this. > Add a option for thrift-server to share

[jira] [Commented] (SPARK-9686) Spark hive jdbc client cannot get table from metadata store

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000215#comment-15000215 ] Cheng Lian commented on SPARK-9686: --- [~navis] [~bugg_tb] [~pin_zhang] May I ask were you all using

[jira] [Commented] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000112#comment-15000112 ] Hyukjin Kwon commented on SPARK-10978: -- I am sorry to add some comments more here but.. are you sure

[jira] [Comment Edited] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000112#comment-15000112 ] Hyukjin Kwon edited comment on SPARK-10978 at 11/11/15 8:29 AM: I am

[jira] [Updated] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11651: Description: LinearRegressionSummary should support get residuals by type like R glm: {code}

[jira] [Commented] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000163#comment-15000163 ] Apache Spark commented on SPARK-11651: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11651: Assignee: Apache Spark > LinearRegressionSummary should support get residuals by type >

[jira] [Assigned] (SPARK-11651) LinearRegressionSummary should support get residuals by type

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11651: Assignee: (was: Apache Spark) > LinearRegressionSummary should support get residuals

[jira] [Assigned] (SPARK-9866) VersionsSuite is unnecessarily slow in Jenkins

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9866: --- Assignee: (was: Apache Spark) > VersionsSuite is unnecessarily slow in Jenkins >

[jira] [Assigned] (SPARK-9866) VersionsSuite is unnecessarily slow in Jenkins

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9866: --- Assignee: Apache Spark > VersionsSuite is unnecessarily slow in Jenkins >

[jira] [Commented] (SPARK-9866) VersionsSuite is unnecessarily slow in Jenkins

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000119#comment-15000119 ] Apache Spark commented on SPARK-9866: - User 'JoshRosen' has created a pull request for this issue:

[jira] [Updated] (SPARK-11500) Not deterministic order of columns when using merging schemas.

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11500: --- Fix Version/s: 1.6.0 > Not deterministic order of columns when using merging schemas. >

[jira] [Comment Edited] (SPARK-11583) Make MapStatus use less memory uage

2015-11-11 Thread Kent Yao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000187#comment-15000187 ] Kent Yao edited comment on SPARK-11583 at 11/11/15 10:20 AM: - @lemire

[jira] [Commented] (SPARK-11583) Make MapStatus use less memory uage

2015-11-11 Thread Kent Yao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000187#comment-15000187 ] Kent Yao commented on SPARK-11583: -- @lemire [~imranr] 1. test cases 1.1 sparse case: for each task

[jira] [Commented] (SPARK-5968) Parquet warning in spark-shell

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000200#comment-15000200 ] Cheng Lian commented on SPARK-5968: --- It had once been fixed via a quite hacky trick. Unfortunately it

[jira] [Created] (SPARK-11650) "AkkaUtilsSuite.remote fetch ssl on - untrusted server" test is very slow

2015-11-11 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-11650: -- Summary: "AkkaUtilsSuite.remote fetch ssl on - untrusted server" test is very slow Key: SPARK-11650 URL: https://issues.apache.org/jira/browse/SPARK-11650 Project: Spark

[jira] [Commented] (SPARK-11594) Cannot create UDAF in REPL

2015-11-11 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000151#comment-15000151 ] Herman van Hovell commented on SPARK-11594: --- Move to scala 2.10.5 fixed this. > Cannot create

[jira] [Commented] (SPARK-11633) HiveContext throws TreeNode Exception : Failed to Copy Node

2015-11-11 Thread Saurabh Santhosh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000328#comment-15000328 ] Saurabh Santhosh commented on SPARK-11633: -- {code:title=Stacktrace|borderStyle=solid}

[jira] [Assigned] (SPARK-11654) add reduce to GroupedDataset

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11654: Assignee: (was: Apache Spark) > add reduce to GroupedDataset >

[jira] [Commented] (SPARK-11654) add reduce to GroupedDataset

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000382#comment-15000382 ] Apache Spark commented on SPARK-11654: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11654) add reduce to GroupedDataset

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11654: Assignee: Apache Spark > add reduce to GroupedDataset > > >

[jira] [Updated] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-11655: --- Description: We've been combatting an orphaned process issue on AMPLab Jenkins since October and I

[jira] [Issue Comment Deleted] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10978: - Comment: was deleted (was: Thanks the for the test! I think there is a bug.) > Allow PrunedFilterScan

[jira] [Updated] (SPARK-11659) Codegen sporadically fails with same input character

2015-11-11 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Catalin Alexandru Zamfir updated SPARK-11659: - Description: We pretty much have a default installation of Spark 1.5.1.

[jira] [Comment Edited] (SPARK-11553) row.getInt(i) if row[i]=null returns 0

2015-11-11 Thread Bartlomiej Alberski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000616#comment-15000616 ] Bartlomiej Alberski edited comment on SPARK-11553 at 11/11/15 5:10 PM:

[jira] [Resolved] (SPARK-11626) ml.feature.Word2Vec.transform() should not recompute word-vector map each time

2015-11-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11626. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9592

[jira] [Commented] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000763#comment-15000763 ] kevin yu commented on SPARK-11657: -- Hello Virgil: Can you try to toDF().show()? then do toDF().take(2)?

[jira] [Updated] (SPARK-11659) Codegen sporadically fails with same input character

2015-11-11 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Catalin Alexandru Zamfir updated SPARK-11659: - Description: We pretty much have a default instalation of Spark 1.5.1.

[jira] [Updated] (SPARK-11601) ML 1.6 QA: API: Binary incompatible changes

2015-11-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11601: -- Description: Generate a list of binary incompatible changes using MiMa and create new JIRAs

[jira] [Resolved] (SPARK-11646) WholeTextFileRDD should return Text rather than String

2015-11-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11646. - Resolution: Fixed Fix Version/s: 1.6.0 > WholeTextFileRDD should return Text rather than

[jira] [Updated] (SPARK-11566) Refactoring GaussianMixtureModel.gaussians in Python

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11566: -- Assignee: Yu Ishikawa > Refactoring GaussianMixtureModel.gaussians in Python >

[jira] [Updated] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11656: -- Assignee: Wenchen Fan > support typed aggregate in project list >

[jira] [Commented] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000854#comment-15000854 ] Yin Huai commented on SPARK-10978: -- I opened https://issues.apache.org/jira/browse/SPARK-11661. > Allow

[jira] [Updated] (SPARK-11643) inserting date with leading zero inserts null example '0001-12-10'

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11643: -- Target Version/s: (was: 1.5.0, 1.5.1) [~csa...@progress.com] this can't target 1.5.0 / 1.5.1 --

[jira] [Updated] (SPARK-11659) Codegen sporadically fails with same input character

2015-11-11 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Catalin Alexandru Zamfir updated SPARK-11659: - Description: We pretty much have a default instalation of Spark 1.5.1.

[jira] [Commented] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread Virgil Palanciuc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000840#comment-15000840 ] Virgil Palanciuc commented on SPARK-11657: -- On the simple example: {code} > val df =

[jira] [Created] (SPARK-11660) Spark Thrift GetResultSetMetadata describes a VARCHAR as a STRING

2015-11-11 Thread Chip Sands (JIRA)
Chip Sands created SPARK-11660: -- Summary: Spark Thrift GetResultSetMetadata describes a VARCHAR as a STRING Key: SPARK-11660 URL: https://issues.apache.org/jira/browse/SPARK-11660 Project: Spark

[jira] [Updated] (SPARK-11567) Add Python API for corr aggregate function

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11567: -- Assignee: Felix Cheung > Add Python API for corr aggregate function >

[jira] [Commented] (SPARK-11202) Unsupported dataType

2015-11-11 Thread F Jimenez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000665#comment-15000665 ] F Jimenez commented on SPARK-11202: --- The problem seems to stem from calling a deprecated method. In

[jira] [Commented] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000739#comment-15000739 ] Marcelo Vanzin commented on SPARK-11655: I'll take a look at the code. >

[jira] [Commented] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000821#comment-15000821 ] Yin Huai commented on SPARK-10978: -- Thanks the for the test! I think there is a bug. > Allow

[jira] [Commented] (SPARK-10978) Allow PrunedFilterScan to eliminate predicates from further evaluation

2015-11-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000823#comment-15000823 ] Yin Huai commented on SPARK-10978: -- Thanks the for the test! I think there is a bug. > Allow

[jira] [Updated] (SPARK-11659) Codegen sporadically fails with same input character

2015-11-11 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Catalin Alexandru Zamfir updated SPARK-11659: - Description: We pretty much have a default installation of Spark 1.5.1.

[jira] [Resolved] (SPARK-11481) orderBy with multiple columns in WindowSpec does not work properly

2015-11-11 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11481. Resolution: Fixed Assignee: Davies Liu Fix Version/s: 1.6.0

[jira] [Commented] (SPARK-11481) orderBy with multiple columns in WindowSpec does not work properly

2015-11-11 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000826#comment-15000826 ] Davies Liu commented on SPARK-11481: I think this is related to

[jira] [Resolved] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11656. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9630

[jira] [Issue Comment Deleted] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-11657: --- Comment: was deleted (was: I tried this sample data as local file mode. and it seems working to me. Have

[jira] [Commented] (SPARK-11553) row.getInt(i) if row[i]=null returns 0

2015-11-11 Thread Bartlomiej Alberski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000616#comment-15000616 ] Bartlomiej Alberski commented on SPARK-11553: - Ok. I think that I know what is the problem.

[jira] [Created] (SPARK-11659) Codegen sporadically fails with same input character

2015-11-11 Thread Catalin Alexandru Zamfir (JIRA)
Catalin Alexandru Zamfir created SPARK-11659: Summary: Codegen sporadically fails with same input character Key: SPARK-11659 URL: https://issues.apache.org/jira/browse/SPARK-11659 Project:

[jira] [Updated] (SPARK-11601) ML 1.6 QA: API: Binary incompatible changes

2015-11-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11601: -- Assignee: Tim Hunter > ML 1.6 QA: API: Binary incompatible changes >

[jira] [Commented] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000856#comment-15000856 ] Apache Spark commented on SPARK-11655: -- User 'vanzin' has created a pull request for this issue:

[jira] [Updated] (SPARK-11637) Alias do not work with udf with * parameter

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11637: -- Component/s: SQL > Alias do not work with udf with * parameter >

[jira] [Assigned] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11655: Assignee: Apache Spark > SparkLauncherBackendSuite leaks child processes >

[jira] [Updated] (SPARK-11652) Remote code execution with InvokerTransformer

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11652: -- Component/s: Spark Core > Remote code execution with InvokerTransformer >

[jira] [Updated] (SPARK-11643) inserting date with leading zero inserts null example '0001-12-10'

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11643: -- Component/s: SQL > inserting date with leading zero inserts null example '0001-12-10' >

[jira] [Assigned] (SPARK-9866) VersionsSuite is unnecessarily slow in Jenkins

2015-11-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-9866: - Assignee: Josh Rosen > VersionsSuite is unnecessarily slow in Jenkins >

[jira] [Commented] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread chris snow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000565#comment-15000565 ] chris snow commented on SPARK-11658: Pull request - https://github.com/apache/spark/pull/9631 >

[jira] [Issue Comment Deleted] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread chris snow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chris snow updated SPARK-11658: --- Comment: was deleted (was: Pull request - https://github.com/apache/spark/pull/9631) > simplify

[jira] [Commented] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread Virgil Palanciuc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000599#comment-15000599 ] Virgil Palanciuc commented on SPARK-11657: -- I can't reproduce this on standalone; I can

[jira] [Commented] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000607#comment-15000607 ] Xin Wu commented on SPARK-11657: I tried this sample data as local file mode. and it seems working to me.

[jira] [Updated] (SPARK-11657) Bad data read using dataframes

2015-11-11 Thread Virgil Palanciuc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virgil Palanciuc updated SPARK-11657: - Attachment: sample.tgz Sample directory, use to reproduce the problem > Bad data read

[jira] [Updated] (SPARK-11657) Bad data read using dataframes

2015-11-11 Thread Virgil Palanciuc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virgil Palanciuc updated SPARK-11657: - Description: I get strange behaviour when reading parquet data: {code} scala> val data

[jira] [Created] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread chris snow (JIRA)
chris snow created SPARK-11658: -- Summary: simplify documentation for PySpark combineByKey Key: SPARK-11658 URL: https://issues.apache.org/jira/browse/SPARK-11658 Project: Spark Issue Type:

[jira] [Created] (SPARK-11653) Would be very useful if spark-daemon.sh supported foreground operations

2015-11-11 Thread Adrian Bridgett (JIRA)
Adrian Bridgett created SPARK-11653: --- Summary: Would be very useful if spark-daemon.sh supported foreground operations Key: SPARK-11653 URL: https://issues.apache.org/jira/browse/SPARK-11653

[jira] [Commented] (SPARK-11653) Would be very useful if spark-daemon.sh supported foreground operations

2015-11-11 Thread Adrian Bridgett (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000390#comment-15000390 ] Adrian Bridgett commented on SPARK-11653: - Thanks Sean, I took a slightly different approach -

[jira] [Commented] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000424#comment-15000424 ] shane knapp commented on SPARK-11655: - actually, this started back in mid-may, but the impact

[jira] [Commented] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options

2015-11-11 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000426#comment-15000426 ] Thomas Graves commented on SPARK-11154: --- It seems unnecessary to me to add new configs just to

[jira] [Created] (SPARK-11657) Bad data read using dataframes

2015-11-11 Thread Virgil Palanciuc (JIRA)
Virgil Palanciuc created SPARK-11657: Summary: Bad data read using dataframes Key: SPARK-11657 URL: https://issues.apache.org/jira/browse/SPARK-11657 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11658: Assignee: Apache Spark > simplify documentation for PySpark combineByKey >

[jira] [Assigned] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11658: Assignee: (was: Apache Spark) > simplify documentation for PySpark combineByKey >

[jira] [Commented] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000563#comment-15000563 ] Apache Spark commented on SPARK-11658: -- User 'snowch' has created a pull request for this issue:

[jira] [Commented] (SPARK-5682) Add encrypted shuffle in spark

2015-11-11 Thread Ferdinand Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000526#comment-15000526 ] Ferdinand Xu commented on SPARK-5682: - Thank you for your question. The key is generated by key gen

[jira] [Updated] (SPARK-11657) Bad Dataframe data read from parquet

2015-11-11 Thread Virgil Palanciuc (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virgil Palanciuc updated SPARK-11657: - Summary: Bad Dataframe data read from parquet (was: Bad data read using dataframes) >

[jira] [Commented] (SPARK-10954) Parquet version in the "created_by" metadata field of Parquet files written by Spark 1.5 and 1.6 is wrong

2015-11-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000222#comment-15000222 ] Cheng Lian commented on SPARK-10954: Figured out the reason why {{created_by}} is wrong in Spark

[jira] [Assigned] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11656: Assignee: Apache Spark > support typed aggregate in project list >

[jira] [Commented] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000421#comment-15000421 ] Apache Spark commented on SPARK-11656: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11656: Assignee: (was: Apache Spark) > support typed aggregate in project list >

[jira] [Commented] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000441#comment-15000441 ] Sean Owen commented on SPARK-11154: --- I think we'd have to make new properties to maintain

[jira] [Created] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-11655: -- Summary: SparkLauncherBackendSuite leaks child processes Key: SPARK-11655 URL: https://issues.apache.org/jira/browse/SPARK-11655 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-11655: --- Attachment: screenshot-1.png > SparkLauncherBackendSuite leaks child processes >

[jira] [Updated] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-11655: --- Description: We've been combatting an orphaned process issue on AMPLab Jenkins since October and I

[jira] [Updated] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-11 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp updated SPARK-11655: Attachment: year_or_doom.png month_of_doom.png > SparkLauncherBackendSuite leaks

[jira] [Created] (SPARK-11654) add reduce to GroupedDataset

2015-11-11 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11654: --- Summary: add reduce to GroupedDataset Key: SPARK-11654 URL: https://issues.apache.org/jira/browse/SPARK-11654 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options

2015-11-11 Thread Dustin Cote (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000419#comment-15000419 ] Dustin Cote commented on SPARK-11154: - [~Kitard] I think the naming convention and strategy makes

[jira] [Commented] (SPARK-10865) [Spark SQL] [UDF] the ceil/ceiling function got wrong return value type

2015-11-11 Thread Dominic Ricard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000404#comment-15000404 ] Dominic Ricard commented on SPARK-10865: Is it possible for this fix to be included in the 1.5.2

[jira] [Created] (SPARK-11656) support typed aggregate in project list

2015-11-11 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11656: --- Summary: support typed aggregate in project list Key: SPARK-11656 URL: https://issues.apache.org/jira/browse/SPARK-11656 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-11648) IllegalReferenceCountException in Spark workloads

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000288#comment-15000288 ] Sean Owen commented on SPARK-11648: --- Duplicate of SPARK-11617 it seems >

[jira] [Commented] (SPARK-10371) Optimize sequential projections

2015-11-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000337#comment-15000337 ] Apache Spark commented on SPARK-10371: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-11583) Make MapStatus use less memory uage

2015-11-11 Thread Daniel Lemire (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000339#comment-15000339 ] Daniel Lemire commented on SPARK-11583: --- [~Qin Yao] [~aimran50] If you guys want to setup a

[jira] [Commented] (SPARK-6628) ClassCastException occurs when executing sql statement "insert into" on hbase table

2015-11-11 Thread Francesco Palmiotto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000345#comment-15000345 ] Francesco Palmiotto commented on SPARK-6628: Spark 1.5.1 is affected. > ClassCastException

[jira] [Commented] (SPARK-11653) Would be very useful if spark-daemon.sh supported foreground operations

2015-11-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000373#comment-15000373 ] Sean Owen commented on SPARK-11653: --- To avoid duplication, it's better to reopen the original issue

  1   2   3   >