[jira] [Commented] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145857#comment-15145857 ] Apache Spark commented on SPARK-13300: -- User 'amitdev' has created a pull request fo

[jira] [Assigned] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13300: Assignee: Apache Spark > Spark examples page gives errors : Liquid error: pygments >

[jira] [Assigned] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13300: Assignee: (was: Apache Spark) > Spark examples page gives errors : Liquid error: pygme

[jira] [Updated] (SPARK-12772) Better error message for syntax error in the SQL parser

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Summary: Better error message for syntax error in the SQL parser (was: Better error message for sy

[jira] [Commented] (SPARK-12772) Better error message for parsing failure?

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145842#comment-15145842 ] Reynold Xin commented on SPARK-12772: - [~hvanhovell] would you have time to look into

[jira] [Updated] (SPARK-12772) Better error message for syntax error in the parser

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Summary: Better error message for syntax error in the parser (was: Better error message for parsin

[jira] [Issue Comment Deleted] (SPARK-12772) Better error message for parsing failure?

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Comment: was deleted (was: cc [~hvanhovell] / [~viirya] any idea about this one? ) > Better error

[jira] [Closed] (SPARK-6763) CountMinSketch

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-6763. -- Resolution: Duplicate Assignee: Reynold Xin (was: Liang-Chi Hsieh) Fix Version/s: 2.0.0

[jira] [Updated] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-13307: Labels: (was: spark, sql,) > TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1 >

[jira] [Commented] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145829#comment-15145829 ] Xiao Li commented on SPARK-13307: - Please use explain(true). It will be much easier to an

[jira] [Commented] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread bimal tandel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145766#comment-15145766 ] bimal tandel commented on SPARK-7367: - I cant reproduce this on the latest release. Th

[jira] [Commented] (SPARK-12154) Upgrade to Jersey 2

2016-02-12 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145752#comment-15145752 ] Matt Cheah commented on SPARK-12154: Sorry this had to be pushed back - but I'll work

[jira] [Commented] (SPARK-12154) Upgrade to Jersey 2

2016-02-12 Thread Milad Khajavi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145750#comment-15145750 ] Milad Khajavi commented on SPARK-12154: --- Hmm, Good point for changing pom version a

[jira] [Commented] (SPARK-12154) Upgrade to Jersey 2

2016-02-12 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145746#comment-15145746 ] Andrew Ash commented on SPARK-12154: [~khajavi] would you please give it a go? [~mch

[jira] [Updated] (SPARK-12154) Upgrade to Jersey 2

2016-02-12 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Ash updated SPARK-12154: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-11806 > Upgrade to Jersey 2 >

[jira] [Assigned] (SPARK-13308) ManagedBuffers passed to OneToOneStreamManager need to be freed in non-error cases

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13308: Assignee: Apache Spark (was: Josh Rosen) > ManagedBuffers passed to OneToOneStreamManager

[jira] [Commented] (SPARK-13308) ManagedBuffers passed to OneToOneStreamManager need to be freed in non-error cases

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145727#comment-15145727 ] Apache Spark commented on SPARK-13308: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-13308) ManagedBuffers passed to OneToOneStreamManager need to be freed in non-error cases

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13308: Assignee: Josh Rosen (was: Apache Spark) > ManagedBuffers passed to OneToOneStreamManager

[jira] [Created] (SPARK-13308) ManagedBuffers passed to OneToOneStreamManager need to be freed in non-error cases

2016-02-12 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-13308: -- Summary: ManagedBuffers passed to OneToOneStreamManager need to be freed in non-error cases Key: SPARK-13308 URL: https://issues.apache.org/jira/browse/SPARK-13308 Projec

[jira] [Commented] (SPARK-13257) Refine naive Bayes example code

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145720#comment-15145720 ] Apache Spark commented on SPARK-13257: -- User 'movelikeriver' has created a pull requ

[jira] [Resolved] (SPARK-13293) Generate code for Expand

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13293. - Resolution: Fixed Fix Version/s: 2.0.0 > Generate code for Expand > --

[jira] [Commented] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145651#comment-15145651 ] Marcelo Vanzin commented on SPARK-7367: --- I think this bug has been fixed since 1.4

[jira] [Comment Edited] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145628#comment-15145628 ] Bryan Cutler edited comment on SPARK-10086 at 2/13/16 12:44 AM: ---

[jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-10086: - Attachment: flakyRepro.py Simple script with similar operations to this StreamingKMeans test, use

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145625#comment-15145625 ] Bryan Cutler commented on SPARK-10086: -- I was able to track down the cause of these

[jira] [Commented] (SPARK-12544) Support window functions in SQLContext

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145581#comment-15145581 ] Davies Liu commented on SPARK-12544: We are retiring HiveContext in 2.0, we may updat

[jira] [Commented] (SPARK-13297) [SQL] Backticks cannot be escaped in column names

2016-02-12 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145566#comment-15145566 ] Xiu (Joe) Guo commented on SPARK-13297: --- Looks like in the current [master branch|

[jira] [Created] (SPARK-13307) TPCDS query 66 degraded by 35% in 1.6.0 compared to 1.4.1

2016-02-12 Thread JESSE CHEN (JIRA)
JESSE CHEN created SPARK-13307: -- Summary: TPCDS query 66 degraded by 35% in 1.6.0 compared to 1.4.1 Key: SPARK-13307 URL: https://issues.apache.org/jira/browse/SPARK-13307 Project: Spark Issue T

[jira] [Updated] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-12 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JESSE CHEN updated SPARK-13307: --- Summary: TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1 (was: TPCDS query 66 degraded by

[jira] [Assigned] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7367: --- Assignee: (was: Apache Spark) > spark-submit CLI --help -h overrides the application argu

[jira] [Commented] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145520#comment-15145520 ] Apache Spark commented on SPARK-7367: - User 'BimalTandel' has created a pull request f

[jira] [Assigned] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7367: --- Assignee: Apache Spark > spark-submit CLI --help -h overrides the application arguments > ---

[jira] [Commented] (SPARK-13305) With SPARK_WORKER_WEBUI_PORT and --webui-port set for start-slave.sh script, --webui-port is used twice

2016-02-12 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145512#comment-15145512 ] Jacek Laskowski commented on SPARK-13305: - I was explicit about the different way

[jira] [Commented] (SPARK-7367) spark-submit CLI --help -h overrides the application arguments

2016-02-12 Thread bimal tandel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145490#comment-15145490 ] bimal tandel commented on SPARK-7367: - I had a same problem today and I wrote the patc

[jira] [Updated] (SPARK-13304) Broadcast join with two ints could be very slow

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13304: --- Component/s: SQL > Broadcast join with two ints could be very slow >

[jira] [Updated] (SPARK-13306) Uncorrelated scalar subquery

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13306: --- Component/s: SQL > Uncorrelated scalar subquery > > > Ke

[jira] [Commented] (SPARK-12917) Add DML support to Spark SQL for HIVE

2016-02-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145479#comment-15145479 ] Herman van Hovell commented on SPARK-12917: --- It is currently not planned, and w

[jira] [Commented] (SPARK-13306) Uncorrelated scalar subquery

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145476#comment-15145476 ] Apache Spark commented on SPARK-13306: -- User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-13306) Uncorrelated scalar subquery

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13306: Assignee: Apache Spark (was: Davies Liu) > Uncorrelated scalar subquery > ---

[jira] [Assigned] (SPARK-13306) Uncorrelated scalar subquery

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13306: Assignee: Davies Liu (was: Apache Spark) > Uncorrelated scalar subquery > ---

[jira] [Commented] (SPARK-12544) Support window functions in SQLContext

2016-02-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145467#comment-15145467 ] Herman van Hovell commented on SPARK-12544: --- [~davies] No, they only require a

[jira] [Created] (SPARK-13306) Uncorrelated scalar subquery

2016-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13306: -- Summary: Uncorrelated scalar subquery Key: SPARK-13306 URL: https://issues.apache.org/jira/browse/SPARK-13306 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-12544) Support window functions in SQLContext

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145442#comment-15145442 ] Davies Liu commented on SPARK-12544: [~hvanhovell] Does window functions sill require

[jira] [Resolved] (SPARK-12544) Support window functions in SQLContext

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12544. Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.0.0 Since we updated

[jira] [Commented] (SPARK-13080) Implementation of the internal catalog API using Hive

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145431#comment-15145431 ] Apache Spark commented on SPARK-13080: -- User 'andrewor14' has created a pull request

[jira] [Resolved] (SPARK-12630) Make Parameter Descriptions Consistent for PySpark MLlib Classification

2016-02-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-12630. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11183 [https://g

[jira] [Updated] (SPARK-6166) Limit number of in flight outbound requests for shuffle fetch

2016-02-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-6166: Assignee: Sanket Reddy > Limit number of in flight outbound requests for shuffle fetch > ---

[jira] [Commented] (SPARK-13305) With SPARK_WORKER_WEBUI_PORT and --webui-port set for start-slave.sh script, --webui-port is used twice

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145396#comment-15145396 ] Sean Owen commented on SPARK-13305: --- That looks like what I'd expect it to do. You set

[jira] [Created] (SPARK-13305) With SPARK_WORKER_WEBUI_PORT and --webui-port set for start-slave.sh script, --webui-port is used twice

2016-02-12 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-13305: --- Summary: With SPARK_WORKER_WEBUI_PORT and --webui-port set for start-slave.sh script, --webui-port is used twice Key: SPARK-13305 URL: https://issues.apache.org/jira/browse/

[jira] [Created] (SPARK-13304) Broadcast join with two ints could be very slow

2016-02-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13304: -- Summary: Broadcast join with two ints could be very slow Key: SPARK-13304 URL: https://issues.apache.org/jira/browse/SPARK-13304 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-12962) PySpark support covar_samp and covar_pop

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12962. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10876 [https://github.

[jira] [Commented] (SPARK-13287) Standalone REST API throttling?

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145254#comment-15145254 ] Sean Owen commented on SPARK-13287: --- Yeah, I was guessing/hoping that wasn't quite righ

[jira] [Commented] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145239#comment-15145239 ] Apache Spark commented on SPARK-5095: - User 'mgummelt' has created a pull request for

[jira] [Commented] (SPARK-9763) Minimize exposure of internal SQL classes

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145229#comment-15145229 ] Reynold Xin commented on SPARK-9763: [~flysjy] is that caused by this ticket? > Mini

[jira] [Commented] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145211#comment-15145211 ] Apache Spark commented on SPARK-5095: - User 'mgummelt' has created a pull request for

[jira] [Commented] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145207#comment-15145207 ] Apache Spark commented on SPARK-12632: -- User 'BryanCutler' has created a pull reques

[jira] [Resolved] (SPARK-13260) count(*) does not work with CSV data source

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13260. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.0.0 > count(*) does not

[jira] [Commented] (SPARK-13288) [1.6.0] Memory leak in Spark streaming

2016-02-12 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145174#comment-15145174 ] JESSE CHEN commented on SPARK-13288: I have the heapdumps from 1.5 and 1.6. They are

[jira] [Commented] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145173#comment-15145173 ] Xiu (Joe) Guo commented on SPARK-13301: --- Hi Simone: How long is the string length

[jira] [Commented] (SPARK-13288) [1.6.0] Memory leak in Spark streaming

2016-02-12 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145148#comment-15145148 ] JESSE CHEN commented on SPARK-13288: Maybe "heap exhaustion" a better term to call th

[jira] [Assigned] (SPARK-13253) Error aliasing array columns.

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13253: Assignee: (was: Apache Spark) > Error aliasing array columns. > --

[jira] [Assigned] (SPARK-13253) Error aliasing array columns.

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13253: Assignee: Apache Spark > Error aliasing array columns. > - > >

[jira] [Commented] (SPARK-13253) Error aliasing array columns.

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145088#comment-15145088 ] Apache Spark commented on SPARK-13253: -- User 'kevinyu98' has created a pull request

[jira] [Commented] (SPARK-10777) order by fails when column is aliased and projection includes windowed aggregate

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145084#comment-15145084 ] Apache Spark commented on SPARK-10777: -- User 'kevinyu98' has created a pull request

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Niall McCarroll (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145077#comment-15145077 ] Niall McCarroll commented on SPARK-12261: - In various windows environments I've t

[jira] [Commented] (SPARK-12917) Add DML support to Spark SQL for HIVE

2016-02-12 Thread Hemang Nagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145057#comment-15145057 ] Hemang Nagar commented on SPARK-12917: -- Yes it is a transaction table feature, and s

[jira] [Commented] (SPARK-13287) Standalone REST API throttling?

2016-02-12 Thread Rares Vernica (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144995#comment-15144995 ] Rares Vernica commented on SPARK-13287: --- See the description: {quote} The response

[jira] [Commented] (SPARK-12251) Document Spark 1.6's off-heap memory configurations and add config validation

2016-02-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144988#comment-15144988 ] Josh Rosen commented on SPARK-12251: It's off by default; the 1.6.0 documentation was

[jira] [Commented] (SPARK-12251) Document Spark 1.6's off-heap memory configurations and add config validation

2016-02-12 Thread Ovidiu Marcu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144986#comment-15144986 ] Ovidiu Marcu commented on SPARK-12251: -- Reading though the latest documentation for

[jira] [Commented] (SPARK-12630) Make Parameter Descriptions Consistent for PySpark MLlib Classification

2016-02-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144981#comment-15144981 ] Apache Spark commented on SPARK-12630: -- User 'BryanCutler' has created a pull reques

[jira] [Resolved] (SPARK-13282) LogicalPlan toSql should just return a String rather than Option[String]

2016-02-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13282. - Resolution: Fixed Fix Version/s: 2.0.0 > LogicalPlan toSql should just return a String rat

[jira] [Commented] (SPARK-13303) Spark fails with pandas import error when pandas is not explicitly imported by user

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144974#comment-15144974 ] Sean Owen commented on SPARK-13303: --- Agree, I think this is one of those big "known iss

[jira] [Created] (SPARK-13303) Spark fails with pandas import error when pandas is not explicitly imported by user

2016-02-12 Thread Juliet Hougland (JIRA)
Juliet Hougland created SPARK-13303: --- Summary: Spark fails with pandas import error when pandas is not explicitly imported by user Key: SPARK-13303 URL: https://issues.apache.org/jira/browse/SPARK-13303

[jira] [Resolved] (SPARK-12705) Sorting column can't be resolved if it's not in projection

2016-02-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12705. Resolution: Fixed Issue resolved by pull request 11153 [https://github.com/apache/spark/pull/11153]

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144901#comment-15144901 ] Christopher Bourez commented on SPARK-12261: Sean, how can I get the executor

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144880#comment-15144880 ] Sean Owen commented on SPARK-12261: --- This is still just the driver log. > pyspark cras

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144873#comment-15144873 ] Christopher Bourez commented on SPARK-12261: Here is what i see when i activa

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144857#comment-15144857 ] Sean Owen commented on SPARK-12261: --- The change above is definitely not correct in gene

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144844#comment-15144844 ] Christopher Bourez commented on SPARK-12261: Sean Owen, do you reconsider the

[jira] [Created] (SPARK-13302) Cleanup Docstests in ml/clustering.py

2016-02-12 Thread holdenk (JIRA)
holdenk created SPARK-13302: --- Summary: Cleanup Docstests in ml/clustering.py Key: SPARK-13302 URL: https://issues.apache.org/jira/browse/SPARK-13302 Project: Spark Issue Type: Test Compon

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144793#comment-15144793 ] Christopher Bourez commented on SPARK-12261: Dear Niall Your solution works v

[jira] [Comment Edited] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Niall McCarroll (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144734#comment-15144734 ] Niall McCarroll edited comment on SPARK-12261 at 2/12/16 3:53 PM: -

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-02-12 Thread Niall McCarroll (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144734#comment-15144734 ] Niall McCarroll commented on SPARK-12261: - As a workaround you might try the foll

[jira] [Closed] (SPARK-13290) wholeTextFile and binaryFiles are really slow

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-13290. - > wholeTextFile and binaryFiles are really slow > - > >

[jira] [Resolved] (SPARK-13290) wholeTextFile and binaryFiles are really slow

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13290. --- Resolution: Not A Problem Yes, just reading a file length locally is going to be much much faster tha

[jira] [Commented] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144723#comment-15144723 ] Sean Owen commented on SPARK-13300: --- Yes, it's irrelevant -- you can see this in the so

[jira] [Commented] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread stefan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144717#comment-15144717 ] stefan commented on SPARK-13300: Happening on Windows 7 now. Chrome. Internet Explorer 11

[jira] [Updated] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13301: --- Environment: PySpark in yarn-client mode - CDH 5.5.1 (was: PySpark - CDH 5.5.1) > PySpark Dataframe return

[jira] [Updated] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13301: --- Description: Using a User Defined Function in PySpark inside the withColumn() method of Dataframe, gives wro

[jira] [Updated] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13301: --- Description: Using a User Defined Function in PySpark inside the withColumn() method of Dataframe, gives wro

[jira] [Updated] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13301: --- Description: Using a User Defined Function in PySpark inside the withColumn() method of Dataframe, gives wro

[jira] [Updated] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13301: --- Description: Using a User Defined Function in PySpark inside the withColumn() method of Dataframe, gives wro

[jira] [Created] (SPARK-13301) PySpark Dataframe return wrong results with custom UDF

2016-02-12 Thread Simone (JIRA)
Simone created SPARK-13301: -- Summary: PySpark Dataframe return wrong results with custom UDF Key: SPARK-13301 URL: https://issues.apache.org/jira/browse/SPARK-13301 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-02-12 Thread Qi Dai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144685#comment-15144685 ] Qi Dai commented on SPARK-13289: I'm using the "One Billion Words Language Modeling" data

[jira] [Reopened] (SPARK-13290) wholeTextFile and binaryFiles are really slow

2016-02-12 Thread mathieu longtin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mathieu longtin reopened SPARK-13290: - Slow relative to reading the exact same file on a local disk on the same machine. Python wil

[jira] [Commented] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144626#comment-15144626 ] Sean Owen commented on SPARK-13300: --- Hm, I see that on all browsers on OS X too. I won

[jira] [Commented] (SPARK-12583) spark shuffle fails with mesos after 2mins

2016-02-12 Thread Adrian Bridgett (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144624#comment-15144624 ] Adrian Bridgett commented on SPARK-12583: - Phew - thought maybe it was a bit odd

[jira] [Commented] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread stefan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144621#comment-15144621 ] stefan commented on SPARK-13300: jekyll has never been installed on this machine. is it n

[jira] [Created] (SPARK-13300) Spark examples page gives errors : Liquid error: pygments

2016-02-12 Thread stefan (JIRA)
stefan created SPARK-13300: -- Summary: Spark examples page gives errors : Liquid error: pygments Key: SPARK-13300 URL: https://issues.apache.org/jira/browse/SPARK-13300 Project: Spark Issue Type: Qu

[jira] [Resolved] (SPARK-13299) DataFrame limit operation is not consistent

2016-02-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13299. --- Resolution: Not A Problem Unless your DataFrame has a defined ordering, I don't think you'd expect th

  1   2   >