[jira] [Commented] (SPARK-6848) Update outdated links of documents from .html to .md

2015-04-10 Thread Wang Haihua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490768#comment-14490768 ] Wang Haihua commented on SPARK-6848: OK, saw it, and forgive for my newbie behavior.

[jira] [Updated] (SPARK-5680) Sum function on all null values, should return zero

2015-04-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5680: Assignee: Venkata Ramana G Sum function on all null values, should return zero

[jira] [Updated] (SPARK-6479) Create external block store API

2015-04-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6479: --- Summary: Create external block store API (was: Create off-heap block storage API (internal))

[jira] [Updated] (SPARK-6479) Create external block store API

2015-04-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6479: --- Description: Would be great to create APIs for external block stores, rather than doing a bunch of

[jira] [Created] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-04-10 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-6851: --- Summary: Wrong answers for self joins of converted parquet relations Key: SPARK-6851 URL: https://issues.apache.org/jira/browse/SPARK-6851 Project: Spark

[jira] [Commented] (SPARK-6479) Create external block store API

2015-04-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490320#comment-14490320 ] Reynold Xin commented on SPARK-6479: [~zhanzhang] I thought about this more -- can you

[jira] [Commented] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490339#comment-14490339 ] Apache Spark commented on SPARK-6851: - User 'marmbrus' has created a pull request for

[jira] [Assigned] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6851: --- Assignee: Michael Armbrust (was: Apache Spark) Wrong answers for self joins of converted

[jira] [Assigned] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6851: --- Assignee: Apache Spark (was: Michael Armbrust) Wrong answers for self joins of converted

[jira] [Updated] (SPARK-5969) The pyspark.rdd.sortByKey always fills only two partitions when ascending=False.

2015-04-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5969: -- Assignee: Milan Straka The pyspark.rdd.sortByKey always fills only two partitions when

[jira] [Resolved] (SPARK-5969) The pyspark.rdd.sortByKey always fills only two partitions when ascending=False.

2015-04-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5969. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4761

[jira] [Updated] (SPARK-6529) Word2Vec transformer

2015-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6529: - Assignee: Xusen Yin Word2Vec transformer Key:

[jira] [Commented] (SPARK-6529) Word2Vec transformer

2015-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490352#comment-14490352 ] Joseph K. Bradley commented on SPARK-6529: -- Sure, I'll assign it to you. Thanks!

[jira] [Assigned] (SPARK-6850) SparkR flaky unit tests when run on Jenkins

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6850: --- Assignee: Apache Spark (was: Davies Liu) SparkR flaky unit tests when run on Jenkins

[jira] [Assigned] (SPARK-6850) SparkR flaky unit tests when run on Jenkins

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6850: --- Assignee: Davies Liu (was: Apache Spark) SparkR flaky unit tests when run on Jenkins

[jira] [Resolved] (SPARK-6216) Check Python version in worker before run PySpark job

2015-04-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6216. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5404

[jira] [Commented] (SPARK-6850) SparkR flaky unit tests when run on Jenkins

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490366#comment-14490366 ] Apache Spark commented on SPARK-6850: - User 'davies' has created a pull request for

[jira] [Resolved] (SPARK-6709) SparkSQL cannot parse sql correctly when the table contains count column.

2015-04-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6709. - Resolution: Won't Fix Use `backticks` for identifiers that are reserved words: {{SELECT

[jira] [Created] (SPARK-6852) Accept numeric as numPartitions in SparkR

2015-04-10 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6852: - Summary: Accept numeric as numPartitions in SparkR Key: SPARK-6852 URL: https://issues.apache.org/jira/browse/SPARK-6852 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5969) The pyspark.rdd.sortByKey always fills only two partitions when ascending=False.

2015-04-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5969: -- Fix Version/s: 1.2.3 1.3.2 The pyspark.rdd.sortByKey always fills only two

[jira] [Updated] (SPARK-5969) The pyspark.rdd.sortByKey always fills only two partitions when ascending=False.

2015-04-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5969: -- Affects Version/s: 1.3.1 1.0.2 1.1.1 The

[jira] [Resolved] (SPARK-6850) SparkR flaky unit tests when run on Jenkins

2015-04-10 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-6850. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request

[jira] [Resolved] (SPARK-6851) Wrong answers for self joins of converted parquet relations

2015-04-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6851. - Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Issue resolved by

[jira] [Created] (SPARK-6853) Contents of .globalenv in workers

2015-04-10 Thread Antonio Piccolboni (JIRA)
Antonio Piccolboni created SPARK-6853: - Summary: Contents of .globalenv in workers Key: SPARK-6853 URL: https://issues.apache.org/jira/browse/SPARK-6853 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5808) Assembly generated by sbt does not contain pyspark

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490536#comment-14490536 ] Apache Spark commented on SPARK-5808: - User 'vanzin' has created a pull request for

[jira] [Assigned] (SPARK-5808) Assembly generated by sbt does not contain pyspark

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5808: --- Assignee: (was: Apache Spark) Assembly generated by sbt does not contain pyspark

[jira] [Assigned] (SPARK-5808) Assembly generated by sbt does not contain pyspark

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5808: --- Assignee: Apache Spark Assembly generated by sbt does not contain pyspark

[jira] [Resolved] (SPARK-6620) Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-04-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6620. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5279

[jira] [Created] (SPARK-6854) No support for get and eval-quote

2015-04-10 Thread Antonio Piccolboni (JIRA)
Antonio Piccolboni created SPARK-6854: - Summary: No support for get and eval-quote Key: SPARK-6854 URL: https://issues.apache.org/jira/browse/SPARK-6854 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490545#comment-14490545 ] Ilya Ganelin commented on SPARK-6839: - Imran - I can knock this out. Thanks!

[jira] [Created] (SPARK-6855) Set R includes in each file to get right collate order

2015-04-10 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-6855: Summary: Set R includes in each file to get right collate order Key: SPARK-6855 URL: https://issues.apache.org/jira/browse/SPARK-6855 Project: Spark

[jira] [Assigned] (SPARK-6855) Set R includes in each file to get right collate order

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6855: --- Assignee: Apache Spark (was: Shivaram Venkataraman) Set R includes in each file to get

[jira] [Commented] (SPARK-6855) Set R includes in each file to get right collate order

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490554#comment-14490554 ] Apache Spark commented on SPARK-6855: - User 'shivaram' has created a pull request for

[jira] [Assigned] (SPARK-6855) Set R includes in each file to get right collate order

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6855: --- Assignee: Shivaram Venkataraman (was: Apache Spark) Set R includes in each file to get

[jira] [Created] (SPARK-6856) Make RDD information more useful in SparkR

2015-04-10 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-6856: Summary: Make RDD information more useful in SparkR Key: SPARK-6856 URL: https://issues.apache.org/jira/browse/SPARK-6856 Project: Spark

[jira] [Commented] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490576#comment-14490576 ] Ilya Ganelin commented on SPARK-6839: - The obvious solution won't work. Adding a

[jira] [Comment Edited] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490576#comment-14490576 ] Ilya Ganelin edited comment on SPARK-6839 at 4/11/15 12:07 AM:

[jira] [Comment Edited] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490576#comment-14490576 ] Ilya Ganelin edited comment on SPARK-6839 at 4/11/15 12:09 AM:

[jira] [Commented] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490636#comment-14490636 ] Imran Rashid commented on SPARK-6839: - [~ilganeli] sorry I am already on it! I should

[jira] [Assigned] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-6839: --- Assignee: Imran Rashid BlockManager.dataDeserialize leaks resources on user exceptions

[jira] [Commented] (SPARK-6849) The constructor of GradientDescent should be public

2015-04-10 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490642#comment-14490642 ] Guoqiang Li commented on SPARK-6849: [~srowen] This should be a bug. The

[jira] [Created] (SPARK-6857) Python SQL schema inference should support numpy types

2015-04-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6857: Summary: Python SQL schema inference should support numpy types Key: SPARK-6857 URL: https://issues.apache.org/jira/browse/SPARK-6857 Project: Spark

[jira] [Assigned] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5095: --- Assignee: Timothy Chen (was: Apache Spark) Support launching multiple mesos executors in

[jira] [Assigned] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5095: --- Assignee: Apache Spark (was: Timothy Chen) Support launching multiple mesos executors in

[jira] [Assigned] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6839: --- Assignee: Apache Spark (was: Imran Rashid) BlockManager.dataDeserialize leaks resources on

[jira] [Assigned] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6839: --- Assignee: Imran Rashid (was: Apache Spark) BlockManager.dataDeserialize leaks resources on

[jira] [Commented] (SPARK-6839) BlockManager.dataDeserialize leaks resources on user exceptions

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490684#comment-14490684 ] Apache Spark commented on SPARK-6839: - User 'squito' has created a pull request for

[jira] [Assigned] (SPARK-6710) Wrong initial bias in GraphX SVDPlusPlus

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6710: --- Assignee: Apache Spark Wrong initial bias in GraphX SVDPlusPlus

[jira] [Commented] (SPARK-6710) Wrong initial bias in GraphX SVDPlusPlus

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490692#comment-14490692 ] Apache Spark commented on SPARK-6710: - User 'michaelmalak' has created a pull request

[jira] [Assigned] (SPARK-6710) Wrong initial bias in GraphX SVDPlusPlus

2015-04-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6710: --- Assignee: (was: Apache Spark) Wrong initial bias in GraphX SVDPlusPlus

[jira] [Commented] (SPARK-6244) Implement VectorSpace to easy create a complicated feature vector

2015-04-10 Thread Kirill A. Korinskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490714#comment-14490714 ] Kirill A. Korinskiy commented on SPARK-6244: Sean, sorry for long response.