[jira] [Commented] (SPARK-9301) collect_set and collect_list aggregate functions

2016-01-28 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15121968#comment-15121968 ] Cristian commented on SPARK-9301: - Seconded, looks like MutableAggregationBuffer is not so mutable after

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-12-02 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035700#comment-15035700 ] Cristian commented on SPARK-11596: -- That's great, thank you > SQL execution very slow for nested query

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-12-01 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15034058#comment-15034058 ] Cristian commented on SPARK-11596: -- Any chance this can be fixed soon ? It looks like a straightforward

[jira] [Created] (SPARK-11879) Checkpoint support for DataFrame

2015-11-20 Thread Cristian (JIRA)
Cristian created SPARK-11879: Summary: Checkpoint support for DataFrame Key: SPARK-11879 URL: https://issues.apache.org/jira/browse/SPARK-11879 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15019158#comment-15019158 ] Cristian commented on SPARK-11596: -- Although you are right that this does not reproduce without caching.

[jira] [Updated] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristian updated SPARK-11596: - Description: For nested query plans like a recursive unionAll, withExecutionId is extremely slow,

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15019096#comment-15019096 ] Cristian commented on SPARK-11596: -- Sorry, my repro code was missing a cache() statement. I added it now

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15019124#comment-15019124 ] Cristian commented on SPARK-11596: -- An easy way to check this is to run the code in local mode, and when

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15019241#comment-15019241 ] Cristian commented on SPARK-11596: -- Ok, I found a much simpler repro. Note the below does not actually

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15019267#comment-15019267 ] Cristian commented on SPARK-11596: -- Looks like the problem is here:

[jira] [Issue Comment Deleted] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristian updated SPARK-11596: - Comment: was deleted (was: An easy way to check this is to run the code in local mode, and when it

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-18 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15012386#comment-15012386 ] Cristian commented on SPARK-11596: -- This is not what this is about. It's very useful to have UnionAll

[jira] [Updated] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-09 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristian updated SPARK-11596: - Description: For nested query plans like a recursive unionAll, withExecutionId is extremely slow,

[jira] [Created] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-09 Thread Cristian (JIRA)
Cristian created SPARK-11596: Summary: SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId Key: SPARK-11596 URL: https://issues.apache.org/jira/browse/SPARK-11596

[jira] [Updated] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-11-09 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristian updated SPARK-11596: - Description: For nested query plans like a recursive unionAll, withExecutionId is extremely slow,

[jira] [Commented] (SPARK-4849) Pass partitioning information (distribute by) to In-memory caching

2015-07-20 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14634316#comment-14634316 ] Cristian commented on SPARK-4849: - I would argue that the priority for this is not Minor

[jira] [Created] (SPARK-8435) Cannot create tables in an specific database using a provider

2015-06-18 Thread Cristian (JIRA)
Cristian created SPARK-8435: --- Summary: Cannot create tables in an specific database using a provider Key: SPARK-8435 URL: https://issues.apache.org/jira/browse/SPARK-8435 Project: Spark Issue

[jira] [Commented] (SPARK-5863) Improve performance of convertToScala codepath.

2015-03-22 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14375243#comment-14375243 ] Cristian commented on SPARK-5863: - I'm a bit confused. The original jira refers to a very