[jira] [Resolved] (SPARK-5966) Spark-submit deploy-mode incorrectly affecting submission when master = local[4]

2015-10-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5966. -- Resolution: Fixed Fix Version/s: 1.6.0 1.5.3 Issue resolved by pull request

[jira] [Assigned] (SPARK-9858) Introduce an ExchangeCoordinator to estimate the number of post-shuffle partitions.

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9858: --- Assignee: Apache Spark (was: Yin Huai) > Introduce an ExchangeCoordinator to estimate the

[jira] [Assigned] (SPARK-9861) Join: Determine the number of reducers used by a shuffle join operator at runtime

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9861: --- Assignee: Apache Spark (was: Yin Huai) > Join: Determine the number of reducers used by a

[jira] [Commented] (SPARK-9265) Dataframe.limit joined with another dataframe can be non-deterministic

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973829#comment-14973829 ] Yanbo Liang commented on SPARK-9265: @Tathagata Das @Andrew Or [~rxin] Could you tell me how did you

[jira] [Issue Comment Deleted] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11303: Comment: was deleted (was: I think the reason of this bug is the same as SPARK-4963, I will send a

[jira] [Comment Edited] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973993#comment-14973993 ] Yanbo Liang edited comment on SPARK-11303 at 10/26/15 10:29 AM: It looks

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973993#comment-14973993 ] Yanbo Liang commented on SPARK-11303: - It looks like this bug caused by mutable row copy related

[jira] [Commented] (SPARK-11253) reset all accumulators in physical operators before execute an action

2015-10-26 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973786#comment-14973786 ] Wenchen Fan commented on SPARK-11253: - There is one more issue about the SQL metric: We use -1 as

[jira] [Comment Edited] (SPARK-9265) Dataframe.limit joined with another dataframe can be non-deterministic

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973829#comment-14973829 ] Yanbo Liang edited comment on SPARK-9265 at 10/26/15 6:59 AM: -- [~tdas]

[jira] [Assigned] (SPARK-11311) spark cannot describe temporary functions

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11311: Assignee: Apache Spark > spark cannot describe temporary functions >

[jira] [Assigned] (SPARK-11311) spark cannot describe temporary functions

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11311: Assignee: (was: Apache Spark) > spark cannot describe temporary functions >

[jira] [Commented] (SPARK-11311) spark cannot describe temporary functions

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973831#comment-14973831 ] Apache Spark commented on SPARK-11311: -- User 'adrian-wang' has created a pull request for this

[jira] [Commented] (SPARK-11312) Cannot drop temporary function

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973845#comment-14973845 ] Apache Spark commented on SPARK-11312: -- User 'adrian-wang' has created a pull request for this

[jira] [Assigned] (SPARK-11312) Cannot drop temporary function

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11312: Assignee: (was: Apache Spark) > Cannot drop temporary function >

[jira] [Assigned] (SPARK-11312) Cannot drop temporary function

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11312: Assignee: Apache Spark > Cannot drop temporary function > --

[jira] [Commented] (SPARK-9859) Aggregation: Determine the number of reducers at runtime

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973783#comment-14973783 ] Apache Spark commented on SPARK-9859: - User 'yhuai' has created a pull request for this issue:

[jira] [Created] (SPARK-11312) Cannot drop temporary function

2015-10-26 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-11312: --- Summary: Cannot drop temporary function Key: SPARK-11312 URL: https://issues.apache.org/jira/browse/SPARK-11312 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973896#comment-14973896 ] Yanbo Liang commented on SPARK-11303: - I think the reason of this bug is the same as SPARK-4963, I

[jira] [Commented] (SPARK-11250) Generate different alias for columns with same name during join

2015-10-26 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973801#comment-14973801 ] Narine Kokhlikyan commented on SPARK-11250: --- we can add aliases for the columns which are not

[jira] [Resolved] (SPARK-11312) Cannot drop temporary function

2015-10-26 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Wang resolved SPARK-11312. - Resolution: Duplicate > Cannot drop temporary function > -- > >

[jira] [Resolved] (SPARK-11310) only build spark core,Modify spark pom file:delete graphx

2015-10-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11310. --- Resolution: Invalid It's not clear what you're trying to ask, but this is not the place anyway. Ask

[jira] [Commented] (SPARK-11305) Remove Third-Party Hadoop Distributions Doc Page

2015-10-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973908#comment-14973908 ] Sean Owen commented on SPARK-11305: --- I support this and would tack on a few more reasons: - the Hadoop

[jira] [Comment Edited] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-10-26 Thread Kai Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973525#comment-14973525 ] Kai Jiang edited comment on SPARK-7106 at 10/26/15 7:36 PM: I would like to

[jira] [Commented] (SPARK-11317) YARN HBase token code shouldn't swallow invocation target exceptions

2015-10-26 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974905#comment-14974905 ] Steve Loughran commented on SPARK-11317: I'll do this as soon as SPARK-11265 is in; I've factored

[jira] [Created] (SPARK-11324) Flag to close Write Ahead Log after writing

2015-10-26 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-11324: --- Summary: Flag to close Write Ahead Log after writing Key: SPARK-11324 URL: https://issues.apache.org/jira/browse/SPARK-11324 Project: Spark Issue Type:

[jira] [Created] (SPARK-11326) Split networking in standalone mode

2015-10-26 Thread Jacek Lewandowski (JIRA)
Jacek Lewandowski created SPARK-11326: - Summary: Split networking in standalone mode Key: SPARK-11326 URL: https://issues.apache.org/jira/browse/SPARK-11326 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11325: Assignee: (was: Apache Spark) > Alias alias in Scala's DataFrame to as to match

[jira] [Commented] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975305#comment-14975305 ] Apache Spark commented on SPARK-11325: -- User 'nongli' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11325: Assignee: Apache Spark > Alias alias in Scala's DataFrame to as to match python >

[jira] [Commented] (SPARK-11328) Correctly propagate error message in the case of failures when writing parquet

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975361#comment-14975361 ] Yin Huai commented on SPARK-11328: -- The file already exists error was thrown from [this line |

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Summary: Filter operation on StringType after groupBy PERSISTED brings no results

[jira] [Created] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
Yin Huai created SPARK-11325: Summary: Alias alias in Scala's DataFrame to as to match python Key: SPARK-11325 URL: https://issues.apache.org/jira/browse/SPARK-11325 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975145#comment-14975145 ] Yin Huai commented on SPARK-11325: -- @nongli > Alias alias in Scala's DataFrame to as to match python >

[jira] [Comment Edited] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975145#comment-14975145 ] Yin Huai edited comment on SPARK-11325 at 10/26/15 9:38 PM: cc [~nongli]

[jira] [Comment Edited] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975145#comment-14975145 ] Yin Huai edited comment on SPARK-11325 at 10/26/15 9:38 PM: [~nongli] was

[jira] [Comment Edited] (SPARK-4751) Support dynamic allocation for standalone mode

2015-10-26 Thread Matthias Niehoff (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974119#comment-14974119 ] Matthias Niehoff edited comment on SPARK-4751 at 10/26/15 8:05 PM: --- The

[jira] [Commented] (SPARK-11302) Multivariate Gaussian Model with Covariance matrix return zero always

2015-10-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974982#comment-14974982 ] Sean Owen commented on SPARK-11302: --- OK I reproduced all of this, thank you. This is roughly the code

[jira] [Assigned] (SPARK-11324) Flag to close Write Ahead Log after writing

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11324: Assignee: (was: Apache Spark) > Flag to close Write Ahead Log after writing >

[jira] [Updated] (SPARK-11316) isEmpty before coalesce seems to cause huge performance issue in setupGroups

2015-10-26 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-11316: -- Description: So I haven't fully debugged this yet but reporting what I'm seeing and think

[jira] [Commented] (SPARK-11324) Flag to close Write Ahead Log after writing

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974993#comment-14974993 ] Apache Spark commented on SPARK-11324: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11324) Flag to close Write Ahead Log after writing

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11324: Assignee: Apache Spark > Flag to close Write Ahead Log after writing >

[jira] [Assigned] (SPARK-11326) Split networking in standalone mode

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11326: Assignee: Apache Spark > Split networking in standalone mode >

[jira] [Created] (SPARK-11330) Filter operation on StringType after groupBy brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
Saif Addin Ellafi created SPARK-11330: - Summary: Filter operation on StringType after groupBy brings no results when there are Key: SPARK-11330 URL: https://issues.apache.org/jira/browse/SPARK-11330

[jira] [Commented] (SPARK-11289) Substitute code examples in ML features extractors with include_example

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975374#comment-14975374 ] Xiangrui Meng commented on SPARK-11289: --- New example files should work. Eventually, we should try

[jira] [Updated] (SPARK-11289) Substitute code examples in ML features extractors with include_example

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11289: -- Shepherd: Xiangrui Meng Target Version/s: 1.6.0 > Substitute code examples in ML

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Environment: Stand alone Cluster of five servers (happens as well in local mode).

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Commented] (SPARK-4751) Support dynamic allocation for standalone mode

2015-10-26 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974961#comment-14974961 ] Andrew Or commented on SPARK-4751: -- it is available, sorry we will update the documentation soon to

[jira] [Updated] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Braithwaite updated SPARK-11327: - Description: I haven't figured out exactly what's going on yet, but there's something in

[jira] [Updated] (SPARK-11258) Converting a Spark DataFrame into an R data.frame is slow / requires a lot of memory

2015-10-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-11258: -- Assignee: Frank Rosner > Converting a Spark DataFrame into an R data.frame is

[jira] [Updated] (SPARK-11297) code example generated by include_example is not exactly the same with {% highlight %}

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11297: -- Assignee: Xusen Yin > code example generated by include_example is not exactly the same with

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-26 Thread Nakul Jindal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975436#comment-14975436 ] Nakul Jindal commented on SPARK-11332: -- I'll be working on this. > WeightedLeastSquares should use

[jira] [Updated] (SPARK-11331) Kryo serializer broken with StringTypes

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11331: -- Description: When using --driver-java-options

[jira] [Created] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-26 Thread holdenk (JIRA)
holdenk created SPARK-11332: --- Summary: WeightedLeastSquares should use ml features generic Instance class instead of private Key: SPARK-11332 URL: https://issues.apache.org/jira/browse/SPARK-11332 Project:

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Commented] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975485#comment-14975485 ] Yin Huai commented on SPARK-11325: -- [~andrewor14] Can you add [~nongli] to our developer or contributor

[jira] [Resolved] (SPARK-11325) Alias alias in Scala's DataFrame to as to match python

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11325. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9286

[jira] [Updated] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Braithwaite updated SPARK-11327: - Description: I haven't figured out exactly what's going on yet, but there's something in

[jira] [Updated] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Braithwaite updated SPARK-11327: - Description: I haven't figured out exactly what's going on yet, but there's something in

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Summary: Filter operation on StringType after groupBy PERSISTED brings no results when

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Created] (SPARK-11329) Expand Star when creating a struct

2015-10-26 Thread Yin Huai (JIRA)
Yin Huai created SPARK-11329: Summary: Expand Star when creating a struct Key: SPARK-11329 URL: https://issues.apache.org/jira/browse/SPARK-11329 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Created] (SPARK-11331) Kryo serializer broken with StringTypes

2015-10-26 Thread Saif Addin Ellafi (JIRA)
Saif Addin Ellafi created SPARK-11331: - Summary: Kryo serializer broken with StringTypes Key: SPARK-11331 URL: https://issues.apache.org/jira/browse/SPARK-11331 Project: Spark Issue

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-10-26 Thread yangwnejia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975447#comment-14975447 ] yangwnejia commented on SPARK-4105: --- I Have the same problem in Spark 1.4.0, it gives me these error: 1.

[jira] [Updated] (SPARK-10979) SparkR: Add merge to DataFrame

2015-10-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-10979: -- Assignee: Narine Kokhlikyan > SparkR: Add merge to DataFrame >

[jira] [Resolved] (SPARK-10979) SparkR: Add merge to DataFrame

2015-10-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-10979. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request

[jira] [Created] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
Alan Braithwaite created SPARK-11327: Summary: spark-dispatcher doesn't pass along some spark properties Key: SPARK-11327 URL: https://issues.apache.org/jira/browse/SPARK-11327 Project: Spark

[jira] [Updated] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Braithwaite updated SPARK-11327: - Description: I haven't figured out exactly what's going on yet, but there's something in

[jira] [Resolved] (SPARK-11258) Converting a Spark DataFrame into an R data.frame is slow / requires a lot of memory

2015-10-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-11258. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request

[jira] [Commented] (SPARK-11327) spark-dispatcher doesn't pass along some spark properties

2015-10-26 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975280#comment-14975280 ] Alan Braithwaite commented on SPARK-11327: -- This may warrant opening a separate issue, but

[jira] [Assigned] (SPARK-11326) Split networking in standalone mode

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11326: Assignee: (was: Apache Spark) > Split networking in standalone mode >

[jira] [Commented] (SPARK-11326) Split networking in standalone mode

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975309#comment-14975309 ] Apache Spark commented on SPARK-11326: -- User 'jacek-lewandowski' has created a pull request for this

[jira] [Created] (SPARK-11328) Correctly propagate error message in the case of failures when writing parquet

2015-10-26 Thread Yin Huai (JIRA)
Yin Huai created SPARK-11328: Summary: Correctly propagate error message in the case of failures when writing parquet Key: SPARK-11328 URL: https://issues.apache.org/jira/browse/SPARK-11328 Project:

[jira] [Updated] (SPARK-11289) Substitute code examples in ML features extractors with include_example

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11289: -- Assignee: Xusen Yin > Substitute code examples in ML features extractors with include_example

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy (2 columns) brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Summary: Filter operation on StringType after groupBy (2 columns) brings no results

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy (2+ columns) brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Summary: Filter operation on StringType after groupBy (2+ columns) brings no results

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results when there are

2015-10-26 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Description: ONLY HAPPENS WHEN PERSIST() IS CALLED val data =

[jira] [Commented] (SPARK-6234) 10% Performance regression with Breeze upgrade

2015-10-26 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975542#comment-14975542 ] Nishkam Ravi commented on SPARK-6234: - The regression in breeze has been fixed:

[jira] [Assigned] (SPARK-11335) Update documentation on accessing Kafka offsets from Python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11335: Assignee: (was: Apache Spark) > Update documentation on accessing Kafka offsets from

[jira] [Assigned] (SPARK-11335) Update documentation on accessing Kafka offsets from Python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11335: Assignee: Apache Spark > Update documentation on accessing Kafka offsets from Python >

[jira] [Updated] (SPARK-11337) Make example code in user guide testable

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11337: -- Description: The example code in the user guide is embedded in the markdown and hence it is

[jira] [Resolved] (SPARK-10562) .partitionBy() creates the metastore partition columns in all lowercase, but persists the data path as MixedCase resulting in an error when the data is later attempted

2015-10-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10562. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9226

[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975695#comment-14975695 ] Xiangrui Meng commented on SPARK-11337: --- [~yinxusen] I re-organized the JIRAs. Please create

[jira] [Created] (SPARK-11338) HistoryPage not multi-tenancy enabled (app links not prefixed with APPLICATION_WEB_PROXY_BASE)

2015-10-26 Thread Christian Kadner (JIRA)
Christian Kadner created SPARK-11338: Summary: HistoryPage not multi-tenancy enabled (app links not prefixed with APPLICATION_WEB_PROXY_BASE) Key: SPARK-11338 URL:

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.1

2015-10-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975715#comment-14975715 ] Shivaram Venkataraman commented on SPARK-11255: --- Yeah I think the point about having 3.1.1

[jira] [Commented] (SPARK-11338) HistoryPage not multi-tenancy enabled (app links not prefixed with APPLICATION_WEB_PROXY_BASE)

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975743#comment-14975743 ] Apache Spark commented on SPARK-11338: -- User 'ckadner' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11334) numRunningTasks can't be less than 0, or it will affect executor allocation

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11334: Assignee: (was: Apache Spark) > numRunningTasks can't be less than 0, or it will

[jira] [Commented] (SPARK-11334) numRunningTasks can't be less than 0, or it will affect executor allocation

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975602#comment-14975602 ] Apache Spark commented on SPARK-11334: -- User 'XuTingjun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11334) numRunningTasks can't be less than 0, or it will affect executor allocation

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11334: Assignee: Apache Spark > numRunningTasks can't be less than 0, or it will affect executor

[jira] [Commented] (SPARK-11335) Update documentation on accessing Kafka offsets from Python

2015-10-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975667#comment-14975667 ] Apache Spark commented on SPARK-11335: -- User 'manygrams' has created a pull request for this issue:

[jira] [Created] (SPARK-11335) Update documentation on accessing Kafka offsets from Python

2015-10-26 Thread Nick Evans (JIRA)
Nick Evans created SPARK-11335: -- Summary: Update documentation on accessing Kafka offsets from Python Key: SPARK-11335 URL: https://issues.apache.org/jira/browse/SPARK-11335 Project: Spark

[jira] [Commented] (SPARK-10383) Sync example code between API doc and user guide

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975687#comment-14975687 ] Xiangrui Meng commented on SPARK-10383: --- [~yinxusen] This JIRA is to sync example code between API

[jira] [Updated] (SPARK-11297) code example generated by include_example is not exactly the same with {% highlight %}

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11297: -- Shepherd: Xiangrui Meng Target Version/s: 1.6.0 > code example generated by

[jira] [Updated] (SPARK-11297) code example generated by include_example is not exactly the same with {% highlight %}

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11297: -- Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-10383) > code example

[jira] [Updated] (SPARK-11289) Substitute code examples in ML features extractors with include_example

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11289: -- Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-10383) > Substitute

[jira] [Updated] (SPARK-11289) Substitute code examples in ML features extractors with include_example

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11289: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-11337 > Substitute code

[jira] [Closed] (SPARK-11284) ALS produces predictions as floats and should be double

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-11284. - Resolution: Not A Problem > ALS produces predictions as floats and should be double >

[jira] [Reopened] (SPARK-11284) ALS produces predictions as floats and should be double

2015-10-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-11284: --- > ALS produces predictions as floats and should be double >

  1   2   3   >