[jira] [Commented] (SPARK-12111) need upgrade instruction

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036863#comment-15036863 ] Sean Owen commented on SPARK-12111: --- Are you asking about spark-ec2 specifically? this generally is not

[jira] [Commented] (SPARK-12111) need upgrade instruction

2015-12-02 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036888#comment-15036888 ] Andrew Davidson commented on SPARK-12111: - This is where someone that knows the details of how

[jira] [Commented] (SPARK-12111) need upgrade instruction

2015-12-02 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036887#comment-15036887 ] Andrew Davidson commented on SPARK-12111: - Hi Sean I understand I will need to stop by cluster

[jira] [Commented] (SPARK-10878) Race condition when resolving Maven coordinates via Ivy

2015-12-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036896#comment-15036896 ] Josh Rosen commented on SPARK-10878: Also, ping [~brkyvz] > Race condition when resolving Maven

[jira] [Reopened] (SPARK-12111) need upgrade instruction

2015-12-02 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Davidson reopened SPARK-12111: - Hi Sean It must be possible for customers to upgrade installations. Given Spark is written

[jira] [Commented] (SPARK-10878) Race condition when resolving Maven coordinates via Ivy

2015-12-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036893#comment-15036893 ] Josh Rosen commented on SPARK-10878: I think this is because Spark's use of the Ivy cache and Ivy

[jira] [Assigned] (SPARK-12082) NettyBlockTransferSecuritySuite "security mismatch auth off on client" test is flaky

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12082: Assignee: Apache Spark > NettyBlockTransferSecuritySuite "security mismatch auth off on

[jira] [Assigned] (SPARK-12082) NettyBlockTransferSecuritySuite "security mismatch auth off on client" test is flaky

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12082: Assignee: (was: Apache Spark) > NettyBlockTransferSecuritySuite "security mismatch

[jira] [Assigned] (SPARK-12082) NettyBlockTransferSecuritySuite "security mismatch auth off on client" test is flaky

2015-12-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-12082: -- Assignee: Josh Rosen > NettyBlockTransferSecuritySuite "security mismatch auth off on client"

[jira] [Assigned] (SPARK-12094) Better format for query plan tree string

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12094: Assignee: Cheng Lian (was: Apache Spark) > Better format for query plan tree string >

[jira] [Assigned] (SPARK-12048) JDBCRDD calls close() twice - SQLite then throws an exception

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12048: Assignee: (was: Apache Spark) > JDBCRDD calls close() twice - SQLite then throws an

[jira] [Commented] (SPARK-11605) ML 1.6 QA: API: Java compatibility, docs

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035605#comment-15035605 ] Apache Spark commented on SPARK-11605: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11605) ML 1.6 QA: API: Java compatibility, docs

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11605: Assignee: Apache Spark (was: yuhao yang) > ML 1.6 QA: API: Java compatibility, docs >

[jira] [Assigned] (SPARK-12048) JDBCRDD calls close() twice - SQLite then throws an exception

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12048: Assignee: Apache Spark > JDBCRDD calls close() twice - SQLite then throws an exception >

[jira] [Assigned] (SPARK-11605) ML 1.6 QA: API: Java compatibility, docs

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11605: Assignee: yuhao yang (was: Apache Spark) > ML 1.6 QA: API: Java compatibility, docs >

[jira] [Updated] (SPARK-11905) [SQL] Support Persist/Cache and Unpersist in Dataset APIs

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11905: -- Assignee: Xiao Li > [SQL] Support Persist/Cache and Unpersist in Dataset APIs >

[jira] [Updated] (SPARK-12084) Fix codes that uses ByteBuffer.array incorrectly

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12084: -- Component/s: Spark Core > Fix codes that uses ByteBuffer.array incorrectly >

[jira] [Updated] (SPARK-12068) use a single column in Dataset.groupBy and count will fail

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12068: -- Assignee: Wenchen Fan > use a single column in Dataset.groupBy and count will fail >

[jira] [Commented] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-12-02 Thread Radoslaw Gruchalski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035632#comment-15035632 ] Radoslaw Gruchalski commented on SPARK-11638: - No problem, wasn't aware. "New guy" syndrome.

[jira] [Commented] (SPARK-12016) word2vec load model can't use findSynonyms to get words

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035580#comment-15035580 ] Apache Spark commented on SPARK-12016: -- User 'viirya' has created a pull request for this issue:

[jira] [Updated] (SPARK-10436) spark-submit overwrites spark.files defaults with the job script filename

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10436: -- Target Version/s: (was: 1.6.0) > spark-submit overwrites spark.files defaults with the job script

[jira] [Commented] (SPARK-12094) Better format for query plan tree string

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035539#comment-15035539 ] Apache Spark commented on SPARK-12094: -- User 'liancheng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12094) Better format for query plan tree string

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12094: Assignee: Apache Spark (was: Cheng Lian) > Better format for query plan tree string >

[jira] [Commented] (SPARK-11939) PySpark support model export/import for Pipeline API

2015-12-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035600#comment-15035600 ] Yanbo Liang commented on SPARK-11939: - OK, I will make an umbrella for this feature. > PySpark

[jira] [Commented] (SPARK-12048) JDBCRDD calls close() twice - SQLite then throws an exception

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035604#comment-15035604 ] Apache Spark commented on SPARK-12048: -- User 'rh99' has created a pull request for this issue:

[jira] [Updated] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11638: -- Target Version/s: (was: 1.4.0, 1.4.1, 1.5.0, 1.5.1, 1.5.2, 1.6.0) [~radekg] don't set Target version

[jira] [Updated] (SPARK-10043) Add window functions into SparkR

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10043: -- Target Version/s: (was: 1.6.0) > Add window functions into SparkR >

[jira] [Updated] (SPARK-9857) Add expression functions into SparkR which conflict with the existing R's generic

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9857: - Target Version/s: (was: 1.6.0) > Add expression functions into SparkR which conflict with the existing

[jira] [Resolved] (SPARK-3580) Add Consistent Method To Get Number of RDD Partitions Across Different Languages

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3580. -- Resolution: Fixed Assignee: Jeroen Schot Fix Version/s: 1.6.0 Resolved by

[jira] [Updated] (SPARK-11822) Add docs for new Netty RPC configuration

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11822: -- Target Version/s: (was: 1.6.0) Priority: Minor (was: Major) > Add docs for new Netty

[jira] [Updated] (SPARK-9972) Add `struct`, `encode` and `decode` function in SparkR

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9972: - Target Version/s: (was: 1.6.0) > Add `struct`, `encode` and `decode` function in SparkR >

[jira] [Commented] (SPARK-7889) Jobs progress of apps on complete page of HistoryServer shows uncompleted

2015-12-02 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035678#comment-15035678 ] Steve Loughran commented on SPARK-7889: --- While I work on this, I suspect one of the issues is the

[jira] [Updated] (SPARK-8360) Streaming DataFrames

2015-12-02 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-8360: - Attachment: StreamingDataFrameProposal.pdf This is a proposal for streaming dataframes that we were

[jira] [Comment Edited] (SPARK-8360) Streaming DataFrames

2015-12-02 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035335#comment-15035335 ] Cheng Hao edited comment on SPARK-8360 at 12/2/15 12:14 PM: Remove the google

[jira] [Commented] (SPARK-4117) Spark on Yarn handle AM being told command from RM

2015-12-02 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035721#comment-15035721 ] Devaraj K commented on SPARK-4117: -- Thanks [~tgraves] for the pointer. I will provide PR for this to

[jira] [Commented] (SPARK-11596) SQL execution very slow for nested query plans because of DataFrame.withNewExecutionId

2015-12-02 Thread Cristian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035700#comment-15035700 ] Cristian commented on SPARK-11596: -- That's great, thank you > SQL execution very slow for nested query

[jira] [Comment Edited] (SPARK-10911) Executors should System.exit on clean shutdown

2015-12-02 Thread Jaromir Vanek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035752#comment-15035752 ] Jaromir Vanek edited comment on SPARK-10911 at 12/2/15 1:04 PM: {quote}

[jira] [Resolved] (SPARK-11065) IOException thrown at job submit shutdown

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11065. --- Resolution: Cannot Reproduce Fix Version/s: (was: 1.6.0) > IOException thrown at job

[jira] [Reopened] (SPARK-11065) IOException thrown at job submit shutdown

2015-12-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-11065: --- Assignee: Jean-Baptiste Onofré > IOException thrown at job submit shutdown >

[jira] [Commented] (SPARK-10911) Executors should System.exit on clean shutdown

2015-12-02 Thread Jaromir Vanek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035752#comment-15035752 ] Jaromir Vanek commented on SPARK-10911: --- {{quote}} why isn't YARN killing the executors? {{quote}}

[jira] [Resolved] (SPARK-11065) IOException thrown at job submit shutdown

2015-12-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jean-Baptiste Onofré resolved SPARK-11065. -- Resolution: Fixed Fix Version/s: 1.6.0 > IOException thrown at job

[jira] [Commented] (SPARK-11065) IOException thrown at job submit shutdown

2015-12-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035773#comment-15035773 ] Jean-Baptiste Onofré commented on SPARK-11065: -- I just tested and it's now fixed. Can you

[jira] [Commented] (SPARK-12103) KafkaUtils createStream with multiple topics -- does not work as expected

2015-12-02 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036625#comment-15036625 ] Dan Dutrow commented on SPARK-12103: After digging into the Kafka code some more (specifically

[jira] [Created] (SPARK-12106) Flaky Test: BatchedWriteAheadLog - name log with aggregated entries with the timestamp of last entry

2015-12-02 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-12106: --- Summary: Flaky Test: BatchedWriteAheadLog - name log with aggregated entries with the timestamp of last entry Key: SPARK-12106 URL:

[jira] [Updated] (SPARK-12106) Flaky Test: BatchedWriteAheadLog - name log with aggregated entries with the timestamp of last entry

2015-12-02 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-12106: Description: This test is still transiently flaky, because async methods can finish out of order,

[jira] [Commented] (SPARK-12107) Update spark-ec2 versions

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036672#comment-15036672 ] Apache Spark commented on SPARK-12107: -- User 'nchammas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12107) Update spark-ec2 versions

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12107: Assignee: Apache Spark > Update spark-ec2 versions > - > >

[jira] [Assigned] (SPARK-12107) Update spark-ec2 versions

2015-12-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12107: Assignee: (was: Apache Spark) > Update spark-ec2 versions > -

[jira] [Created] (SPARK-12107) Update spark-ec2 versions

2015-12-02 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-12107: Summary: Update spark-ec2 versions Key: SPARK-12107 URL: https://issues.apache.org/jira/browse/SPARK-12107 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-12-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8517: - Shepherd: Xiangrui Meng > Improve the organization and style of MLlib's user guide >

[jira] [Commented] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-12-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036704#comment-15036704 ] Xiangrui Meng commented on SPARK-8517: -- * We should only mention MLlib specific types, like vectors

[jira] [Updated] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-12-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12001: --- Labels: backport-needed (was: ) > StreamingContext cannot be completely stopped if the stop() call

[jira] [Resolved] (SPARK-12001) StreamingContext cannot be completely stopped if the stop() call is interrupted

2015-12-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-12001. Resolution: Fixed Fix Version/s: 1.7.0 Issue resolved by pull request 9982

[jira] [Created] (SPARK-12108) Event logs are much bigger in 1.6 than in 1.5

2015-12-02 Thread Andrew Or (JIRA)
Andrew Or created SPARK-12108: - Summary: Event logs are much bigger in 1.6 than in 1.5 Key: SPARK-12108 URL: https://issues.apache.org/jira/browse/SPARK-12108 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-12109) Expressions's simpleString should delegate to its toString

2015-12-02 Thread Yin Huai (JIRA)
Yin Huai created SPARK-12109: Summary: Expressions's simpleString should delegate to its toString Key: SPARK-12109 URL: https://issues.apache.org/jira/browse/SPARK-12109 Project: Spark Issue

[jira] [Comment Edited] (SPARK-12103) KafkaUtils createStream with multiple topics -- does not work as expected

2015-12-02 Thread Dan Dutrow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036625#comment-15036625 ] Dan Dutrow edited comment on SPARK-12103 at 12/2/15 10:38 PM: -- After digging

[jira] [Created] (SPARK-12111) need upgrade instruction

2015-12-02 Thread Andrew Davidson (JIRA)
Andrew Davidson created SPARK-12111: --- Summary: need upgrade instruction Key: SPARK-12111 URL: https://issues.apache.org/jira/browse/SPARK-12111 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-12072) python dataframe ._jdf.schema().json() breaks on large metadata dataframes

2015-12-02 Thread Rares Mirica (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035823#comment-15035823 ] Rares Mirica commented on SPARK-12072: -- My set is in the millions of parameters. I believe you are

[jira] [Created] (SPARK-12095) Window function rowsBetween throws exception

2015-12-02 Thread Irakli Machabeli (JIRA)
Irakli Machabeli created SPARK-12095: Summary: Window function rowsBetween throws exception Key: SPARK-12095 URL: https://issues.apache.org/jira/browse/SPARK-12095 Project: Spark Issue

[jira] [Commented] (SPARK-11944) Python API for mllib.clustering.BisectingKMeans

2015-12-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15035844#comment-15035844 ] holdenk commented on SPARK-11944: - I'll start working on this. > Python API for

[jira] [Created] (SPARK-12096) remove the old constraint in word2vec

2015-12-02 Thread yuhao yang (JIRA)
yuhao yang created SPARK-12096: -- Summary: remove the old constraint in word2vec Key: SPARK-12096 URL: https://issues.apache.org/jira/browse/SPARK-12096 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-11081) Make spark-core pull in Jersey and javax.ws.rs dependencies separately for easier overriding

2015-12-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037040#comment-15037040 ] Matt Cheah commented on SPARK-11081: Quick question as I develop this as I'm fairly new to Maven -

[jira] [Commented] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037018#comment-15037018 ] Yin Huai commented on SPARK-12089: -- The stacktrace I have is {code} 15/12/02 01:10:43 ERROR

[jira] [Commented] (SPARK-12110) spark-1.5.1-bin-hadoop2.6; pyspark.ml.feature Exception: ("You must build Spark with Hive

2015-12-02 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037017#comment-15037017 ] Andrew Davidson commented on SPARK-12110: - Hi Patrick Here is how I start my notebook on my

[jira] [Commented] (SPARK-12114) ColumnPruning rule fails in case of "Project <- Filter <- Join"

2015-12-02 Thread Min Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037023#comment-15037023 ] Min Qiu commented on SPARK-12114: - The [pull request|https://github.com/apache/spark/pull/10087] is

[jira] [Commented] (SPARK-12110) spark-1.5.1-bin-hadoop2.6; pyspark.ml.feature Exception: ("You must build Spark with Hive

2015-12-02 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037022#comment-15037022 ] Andrew Davidson commented on SPARK-12110: - Hi Patrick when I run the same example code on my

[jira] [Updated] (SPARK-12114) ColumnPruning rule fails in case of "Project <- Filter <- Join"

2015-12-02 Thread Min Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Min Qiu updated SPARK-12114: External issue URL: https://github.com/apache/spark/pull/10087 > ColumnPruning rule fails in case of

[jira] [Created] (SPARK-12115) Change numPartitions() in RDD to be "getNumPartitions" to be consistent with pyspark/scala

2015-12-02 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12115: --- Summary: Change numPartitions() in RDD to be "getNumPartitions" to be consistent with pyspark/scala Key: SPARK-12115 URL: https://issues.apache.org/jira/browse/SPARK-12115

[jira] [Commented] (SPARK-12104) collect() does not handle multiple columns with same name

2015-12-02 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037071#comment-15037071 ] Sun Rui commented on SPARK-12104: - I will investigate it > collect() does not handle multiple columns

<    1   2   3