[jira] [Commented] (SPARK-7502) DAG visualization: handle removed stages gracefully

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542711#comment-14542711 ] Apache Spark commented on SPARK-7502: - User 'andrewor14' has created a pull request

[jira] [Assigned] (SPARK-7502) DAG visualization: handle removed stages gracefully

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7502: --- Assignee: Andrew Or (was: Apache Spark) DAG visualization: handle removed stages

[jira] [Assigned] (SPARK-7502) DAG visualization: handle removed stages gracefully

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7502: --- Assignee: Apache Spark (was: Andrew Or) DAG visualization: handle removed stages

[jira] [Created] (SPARK-7615) WordVector divided by Euclidean Norm equals to zero

2015-05-13 Thread Eric Li (JIRA)
Eric Li created SPARK-7615: -- Summary: WordVector divided by Euclidean Norm equals to zero Key: SPARK-7615 URL: https://issues.apache.org/jira/browse/SPARK-7615 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-6837) SparkR failure in processClosure

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6837: --- Assignee: (was: Apache Spark) SparkR failure in processClosure

[jira] [Commented] (SPARK-6837) SparkR failure in processClosure

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542780#comment-14542780 ] Apache Spark commented on SPARK-6837: - User 'hlin09' has created a pull request for

[jira] [Comment Edited] (SPARK-6289) PySpark doesn't maintain SQL date Types

2015-05-13 Thread Michael Nazario (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542807#comment-14542807 ] Michael Nazario edited comment on SPARK-6289 at 5/13/15 9:59 PM:

[jira] [Created] (SPARK-7619) autodoc_docstring_signature doesn't work for some classes/methods under spark.ml

2015-05-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7619: Summary: autodoc_docstring_signature doesn't work for some classes/methods under spark.ml Key: SPARK-7619 URL: https://issues.apache.org/jira/browse/SPARK-7619

[jira] [Commented] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542875#comment-14542875 ] Joseph K. Bradley commented on SPARK-7511: -- Sure, thanks! :) The generated

[jira] [Updated] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7511: - Description: Currently, Scala's HasSeed mix-in uses a random Long as the default value

[jira] [Resolved] (SPARK-7303) push down project if possible when the child is sort

2015-05-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7303. - Resolution: Fixed Issue resolved by pull request 5838

[jira] [Closed] (SPARK-7502) DAG visualization: handle removed stages gracefully

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7502. Resolution: Fixed Fix Version/s: 1.4.0 DAG visualization: handle removed stages gracefully

[jira] [Closed] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7399. Resolution: Pending Closed Target Version/s: 1.4.0 Master fails on 2.11 with compilation error

[jira] [Closed] (SPARK-7464) DAG visualization: highlight the same RDDs on hover

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7464. Resolution: Fixed Fix Version/s: 1.4.0 DAG visualization: highlight the same RDDs on hover

[jira] [Created] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7620: -- Summary: Removed calling size, length in while condition to avoid extra JVM call Key: SPARK-7620 URL: https://issues.apache.org/jira/browse/SPARK-7620 Project: Spark

[jira] [Commented] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542864#comment-14542864 ] holdenk commented on SPARK-7511: I can do this :) PySpark ML seed Param should be random

[jira] [Closed] (SPARK-7356) Flaky test: o.a.s.streaming.flume.FlumePollingStreamSuite

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7356. Resolution: Fixed Fix Version/s: 1.4.0 Flaky test: o.a.s.streaming.flume.FlumePollingStreamSuite

[jira] [Resolved] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7601. - Resolution: Fixed Issue resolved by pull request 6121

[jira] [Comment Edited] (SPARK-6289) PySpark doesn't maintain SQL date Types

2015-05-13 Thread Michael Nazario (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542807#comment-14542807 ] Michael Nazario edited comment on SPARK-6289 at 5/13/15 10:02 PM:

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542903#comment-14542903 ] Apache Spark commented on SPARK-2883: - User 'zhzhan' has created a pull request for

[jira] [Resolved] (SPARK-7081) Faster sort-based shuffle path using binary processing cache-aware sort

2015-05-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7081. Resolution: Fixed Fix Version/s: 1.4.0 Faster sort-based shuffle path using binary

[jira] [Assigned] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7620: --- Assignee: Apache Spark Removed calling size, length in while condition to avoid extra JVM

[jira] [Assigned] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7620: --- Assignee: (was: Apache Spark) Removed calling size, length in while condition to avoid

[jira] [Commented] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542956#comment-14542956 ] Apache Spark commented on SPARK-7620: - User 'dbtsai' has created a pull request for

[jira] [Resolved] (SPARK-7382) Python API for ml.classification

2015-05-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7382. -- Resolution: Fixed Issue resolved by pull request 6106

[jira] [Commented] (SPARK-7356) Flaky test: o.a.s.streaming.flume.FlumePollingStreamSuite

2015-05-13 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542971#comment-14542971 ] Hari Shreedharan commented on SPARK-7356: - Thanks [~andrewor14]. If this test

[jira] [Resolved] (SPARK-7602) add lz4 decompressor for streamContext.fileStream

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7602. -- Resolution: Invalid Fix Version/s: (was: 1.3.1) Target Version/s: (was: 1.4.0)

[jira] [Commented] (SPARK-6261) Python MLlib API missing items: Feature

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542122#comment-14542122 ] Yanbo Liang commented on SPARK-6261: [~josephkb] I think all the subtasks of this

[jira] [Created] (SPARK-7604) Python API for PCA and PCAModel

2015-05-13 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7604: -- Summary: Python API for PCA and PCAModel Key: SPARK-7604 URL: https://issues.apache.org/jira/browse/SPARK-7604 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Created] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-7606: --- Summary: Document all PySpark SQL/DataFrame public methods with @since tag Key: SPARK-7606 URL: https://issues.apache.org/jira/browse/SPARK-7606 Project: Spark

[jira] [Commented] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542176#comment-14542176 ] Nicholas Chammas commented on SPARK-7606: - (I just cloned SPARK-7588.) Dunno what

[jira] [Resolved] (SPARK-6734) Support GenericUDTF.close for Generate

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6734. --- Resolution: Fixed Issue resolved by pull request 5383 [https://github.com/apache/spark/pull/5383]

[jira] [Updated] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Venkata Ramana G (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Venkata Ramana G updated SPARK-7601: Description: Support Insert into JDBCDataSource. Following are usage examples {code}

[jira] [Created] (SPARK-7603) Crash of thrift server when doing SQL without limit

2015-05-13 Thread Ihor Bobak (JIRA)
Ihor Bobak created SPARK-7603: - Summary: Crash of thrift server when doing SQL without limit Key: SPARK-7603 URL: https://issues.apache.org/jira/browse/SPARK-7603 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Created] (SPARK-7605) Python API for ElementwiseProduct

2015-05-13 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7605: -- Summary: Python API for ElementwiseProduct Key: SPARK-7605 URL: https://issues.apache.org/jira/browse/SPARK-7605 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Commented] (SPARK-7507) pyspark.sql.types.StructType and Row should implement __iter__()

2015-05-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542175#comment-14542175 ] Nicholas Chammas commented on SPARK-7507: - cc [~davies] On a related note to our

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7576) User guide update for spark.ml ElementwiseProduct

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7576: - Component/s: ML User guide update for spark.ml ElementwiseProduct

[jira] [Commented] (SPARK-3056) Sort-based Aggregation

2015-05-13 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542148#comment-14542148 ] Jianshi Huang commented on SPARK-3056: -- Will [SPARK-2926] alone enough for this

[jira] [Commented] (SPARK-7603) Crash of thrift server when doing SQL without limit

2015-05-13 Thread Ihor Bobak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542153#comment-14542153 ] Ihor Bobak commented on SPARK-7603: --- I've just downloaded 1.2.2 and configured exactly

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-05-13 Thread Dr. Christian Betz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542152#comment-14542152 ] Dr. Christian Betz commented on SPARK-5081: --- Yes, I think so too. Just tell me

[jira] [Updated] (SPARK-7505) Update PySpark DataFrame docs: encourage __getitem__, mark as experimental, etc.

2015-05-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-7505: Description: The PySpark docs for DataFrame need the following fixes and improvements: #

[jira] [Created] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-7622: -- Summary: Test Jira Key: SPARK-7622 URL: https://issues.apache.org/jira/browse/SPARK-7622 Project: Spark Issue Type: Bug Reporter: Patrick

[jira] [Reopened] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7622: Test Jira - Key: SPARK-7622 URL:

[jira] [Closed] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell closed SPARK-7622. -- Resolution: Invalid Test Jira - Key: SPARK-7622

[jira] [Resolved] (SPARK-7622) Test Jira

2015-05-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7622. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6136

[jira] [Created] (SPARK-7625) Actions on DataFrame created from HIVE table with newly added column throw NPE

2015-05-13 Thread Zhiyang Guo (JIRA)
Zhiyang Guo created SPARK-7625: -- Summary: Actions on DataFrame created from HIVE table with newly added column throw NPE Key: SPARK-7625 URL: https://issues.apache.org/jira/browse/SPARK-7625 Project:

[jira] [Updated] (SPARK-7626) Actions on DataFrame created from HIVE table with newly added column throw NPE

2015-05-13 Thread Zhiyang Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhiyang Guo updated SPARK-7626: --- Affects Version/s: 1.3.1 Actions on DataFrame created from HIVE table with newly added column throw

[jira] [Resolved] (SPARK-6752) Allow StreamingContext to be recreated from checkpoint and existing SparkContext

2015-05-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-6752. -- Resolution: Fixed Allow StreamingContext to be recreated from checkpoint and existing

[jira] [Created] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
Jeremy A. Lucas created SPARK-7621: -- Summary: Report KafkaReceiver MessageHandler errors so StreamingListeners can take action Key: SPARK-7621 URL: https://issues.apache.org/jira/browse/SPARK-7621

[jira] [Comment Edited] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542998#comment-14542998 ] Jeremy A. Lucas edited comment on SPARK-7621 at 5/14/15 12:48 AM:

[jira] [Updated] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy A. Lucas updated SPARK-7621: --- Fix Version/s: 1.3.1 Report KafkaReceiver MessageHandler errors so StreamingListeners can

[jira] [Updated] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy A. Lucas updated SPARK-7621: --- Affects Version/s: 1.3.0 Report KafkaReceiver MessageHandler errors so StreamingListeners

[jira] [Updated] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy A. Lucas updated SPARK-7621: --- Flags: Patch Report KafkaReceiver MessageHandler errors so StreamingListeners can take

[jira] [Comment Edited] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542998#comment-14542998 ] Jeremy A. Lucas edited comment on SPARK-7621 at 5/14/15 12:48 AM:

[jira] [Resolved] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7622. Resolution: Invalid Test Jira - Key: SPARK-7622

[jira] [Reopened] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7622: Test Jira - Key: SPARK-7622 URL:

[jira] [Updated] (SPARK-7531) Install GPG on Jenkins machines

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7531: --- Fix Version/s: 1.4.0 Install GPG on Jenkins machines ---

[jira] [Updated] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-13 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy A. Lucas updated SPARK-7621: --- Attachment: SPARK-7621.patch Attached is a small patch to simply call {{reportError}} instead

[jira] [Updated] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7622: --- Fix Version/s: (was: 1.6.0) Test Jira - Key: SPARK-7622

[jira] [Reopened] (SPARK-7622) Test Jira

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7622: Test Jira - Key: SPARK-7622 URL:

[jira] [Updated] (SPARK-6568) spark-shell.cmd --jars option does not accept the jar that has space in its path

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6568: --- Fix Version/s: 1.4.0 spark-shell.cmd --jars option does not accept the jar that has space in

[jira] [Reopened] (SPARK-7561) Install Junit Attachment Plugin on Jenkins

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7561: Install Junit Attachment Plugin on Jenkins --

[jira] [Updated] (SPARK-7526) Specify ip of RBackend, MonitorServer and RRDD Socket server

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7526: --- Fix Version/s: 1.4.0 Specify ip of RBackend, MonitorServer and RRDD Socket server

[jira] [Commented] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543065#comment-14543065 ] yuhao yang commented on SPARK-7455: --- I'll start to work on this. Any help or suggestion

[jira] [Commented] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543113#comment-14543113 ] Apache Spark commented on SPARK-7511: - User 'holdenk' has created a pull request for

[jira] [Assigned] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7511: --- Assignee: (was: Apache Spark) PySpark ML seed Param should be random by default

[jira] [Assigned] (SPARK-7511) PySpark ML seed Param should be random by default

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7511: --- Assignee: Apache Spark PySpark ML seed Param should be random by default

[jira] [Updated] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-13 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jack Hu updated SPARK-7624: --- Description: I am running a simple spark streaming program with spark 1.3.1 in local mode, it receives json

[jira] [Created] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-13 Thread Jack Hu (JIRA)
Jack Hu created SPARK-7624: -- Summary: Task scheduler delay is increasing time over time in spark local mode Key: SPARK-7624 URL: https://issues.apache.org/jira/browse/SPARK-7624 Project: Spark

[jira] [Updated] (SPARK-7303) push down project if possible when the child is sort

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7303: --- Fix Version/s: 1.4.0 push down project if possible when the child is sort

[jira] [Updated] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7601: --- Fix Version/s: 1.4.0 Support Insert into JDBC Datasource

[jira] [Updated] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7482: --- Fix Version/s: 1.4.0 Rename some DataFrame API methods in SparkR to match their counterparts

[jira] [Created] (SPARK-7623) Spark prints SelectChannelConnector@0.0.0.0:4040: java.net.BindException: Address already in use java.net.BindException: Address already in use when run 2 spark in par

2015-05-13 Thread Yi Zhou (JIRA)
Yi Zhou created SPARK-7623: -- Summary: Spark prints SelectChannelConnector@0.0.0.0:4040: java.net.BindException: Address already in use java.net.BindException: Address already in use when run 2 spark in parallel Key: SPARK-7623

[jira] [Commented] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-13 Thread Tijo Thomas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543124#comment-14543124 ] Tijo Thomas commented on SPARK-7399: Jira Resolution status changed from Fixed to

[jira] [Updated] (SPARK-7626) Actions on DataFrame created from HIVE table with newly added column throw NPE

2015-05-13 Thread Zhiyang Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhiyang Guo updated SPARK-7626: --- Component/s: Spark Core Actions on DataFrame created from HIVE table with newly added column throw

[jira] [Created] (SPARK-7626) Actions on DataFrame created from HIVE table with newly added column throw NPE

2015-05-13 Thread Zhiyang Guo (JIRA)
Zhiyang Guo created SPARK-7626: -- Summary: Actions on DataFrame created from HIVE table with newly added column throw NPE Key: SPARK-7626 URL: https://issues.apache.org/jira/browse/SPARK-7626 Project:

[jira] [Commented] (SPARK-7183) Memory leak in netty shuffle with spark standalone cluster

2015-05-13 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543226#comment-14543226 ] Aaron Davidson commented on SPARK-7183: --- Sorry for delay, this should be fine to

[jira] [Updated] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7620: - Assignee: DB Tsai Removed calling size, length in while condition to avoid extra JVM call

[jira] [Resolved] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7620. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6137

[jira] [Comment Edited] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-05-13 Thread Akhil Thatipamula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543208#comment-14543208 ] Akhil Thatipamula edited comment on SPARK-7012 at 5/14/15 5:18 AM:

[jira] [Commented] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-05-13 Thread Akhil Thatipamula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543208#comment-14543208 ] Akhil Thatipamula commented on SPARK-7012: -- Can any one eloberate on this issue??

[jira] [Resolved] (SPARK-7589) Make Input Rate in the Streaming page consistent with other pages

2015-05-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-7589. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Shixiong Zhu Make Input Rate

[jira] [Assigned] (SPARK-7593) Python API for Bucketizer

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7593: --- Assignee: Apache Spark (was: Burak Yavuz) Python API for Bucketizer

[jira] [Assigned] (SPARK-7593) Python API for Bucketizer

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7593: --- Assignee: Burak Yavuz (was: Apache Spark) Python API for Bucketizer

[jira] [Created] (SPARK-7607) Spark SQL prog guide code error

2015-05-13 Thread Diana Carroll (JIRA)
Diana Carroll created SPARK-7607: Summary: Spark SQL prog guide code error Key: SPARK-7607 URL: https://issues.apache.org/jira/browse/SPARK-7607 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7607) Spark SQL prog guide code error

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542250#comment-14542250 ] Sean Owen commented on SPARK-7607: -- Looks like a good catch, feel free to open a PR.

[jira] [Created] (SPARK-7608) Memory leak in RDDOperationGraphListener

2015-05-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-7608: --- Summary: Memory leak in RDDOperationGraphListener Key: SPARK-7608 URL: https://issues.apache.org/jira/browse/SPARK-7608 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error

2015-05-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542258#comment-14542258 ] Tathagata Das commented on SPARK-6613: -- Could you try the current master branch of

[jira] [Commented] (SPARK-7608) Memory leak in RDDOperationGraphListener

2015-05-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542267#comment-14542267 ] Shixiong Zhu commented on SPARK-7608: - [~andrewor14] could you take a look at this

[jira] [Created] (SPARK-7602) add lz4 decompressor for streamContext.fileStream

2015-05-13 Thread lisendong (JIRA)
lisendong created SPARK-7602: Summary: add lz4 decompressor for streamContext.fileStream Key: SPARK-7602 URL: https://issues.apache.org/jira/browse/SPARK-7602 Project: Spark Issue Type:

[jira] [Issue Comment Deleted] (SPARK-6261) Python MLlib API missing items: Feature

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-6261: --- Comment: was deleted (was: [~josephkb] I think all the subtasks of this issue are solved. Could we

[jira] [Closed] (SPARK-7607) Spark SQL prog guide code error

2015-05-13 Thread Diana Carroll (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Carroll closed SPARK-7607. Resolution: Duplicate Woops, duplicate of SPARK-6383 (already fixed) Spark SQL prog guide code

[jira] [Commented] (SPARK-7579) User guide update for OneHotEncoder

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542304#comment-14542304 ] Joseph K. Bradley commented on SPARK-7579: -- Yep, that should be sufficient for

[jira] [Resolved] (SPARK-7567) Migrating Parquet data source to FSBasedRelation

2015-05-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7567. - Resolution: Fixed Migrating Parquet data source to FSBasedRelation

[jira] [Commented] (SPARK-6819) Support nested types in SparkR DataFrame

2015-05-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542301#comment-14542301 ] Shivaram Venkataraman commented on SPARK-6819: -- Moving this to 1.5 as this is

<    1   2   3   >