[jira] [Updated] (SPARK-3623) Graph should support the checkpoint operation

2014-12-06 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-3623: -- Assignee: Guoqiang Li Graph should support the checkpoint operation

[jira] [Commented] (SPARK-4659) Implement K-core decomposition algorithm

2014-12-06 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236695#comment-14236695 ] Ankur Dave commented on SPARK-4659: --- Great! Submit it it as a PR when you get the

[jira] [Commented] (SPARK-4659) Implement K-core decomposition algorithm

2014-12-06 Thread Xiaoming Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236711#comment-14236711 ] Xiaoming Li commented on SPARK-4659: Thanks for your reply.I saw your code, and I’ll

[jira] [Created] (SPARK-4776) Spark IO Messages are difficult to Debug

2014-12-06 Thread Kevin Mader (JIRA)
Kevin Mader created SPARK-4776: -- Summary: Spark IO Messages are difficult to Debug Key: SPARK-4776 URL: https://issues.apache.org/jira/browse/SPARK-4776 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-4777) Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory)

2014-12-06 Thread SuYan (JIRA)
SuYan created SPARK-4777: Summary: Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory) Key: SPARK-4777 URL: https://issues.apache.org/jira/browse/SPARK-4777

[jira] [Commented] (SPARK-4777) Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory)

2014-12-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236765#comment-14236765 ] Apache Spark commented on SPARK-4777: - User 'suyanNone' has created a pull request for

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236777#comment-14236777 ] Saisai Shao commented on SPARK-4740: Hi Reynold, I just tested your patch with

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236895#comment-14236895 ] Zhang, Liye commented on SPARK-4740: Hi [~rxin], on my 4 node cluster, I just tested

[jira] [Created] (SPARK-4778) PySpark Json and groupByKey broken

2014-12-06 Thread Brad Willard (JIRA)
Brad Willard created SPARK-4778: --- Summary: PySpark Json and groupByKey broken Key: SPARK-4778 URL: https://issues.apache.org/jira/browse/SPARK-4778 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236898#comment-14236898 ] Aaron Davidson commented on SPARK-4740: --- Thanks for testing out the patch. Could we

[jira] [Commented] (SPARK-4063) Add the ability to send messages to Kafka in the stream

2014-12-06 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236899#comment-14236899 ] Helena Edelson commented on SPARK-4063: --- Obviously. But since I created the ticket

[jira] [Closed] (SPARK-4063) Add the ability to send messages to Kafka in the stream

2014-12-06 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Helena Edelson closed SPARK-4063. - Resolution: Unresolved A PR has been submitted since I created the ticket Add the ability to

[jira] [Commented] (SPARK-4775) Possible problem in a simple join? Getting duplicate rows and missing rows

2014-12-06 Thread Stephen Boesch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236919#comment-14236919 ] Stephen Boesch commented on SPARK-4775: --- Two small tweaks to the testing class have

[jira] [Created] (SPARK-4779) PySpark Shuffle Fails Looking for Files that Don't Exist when low on Memory

2014-12-06 Thread Brad Willard (JIRA)
Brad Willard created SPARK-4779: --- Summary: PySpark Shuffle Fails Looking for Files that Don't Exist when low on Memory Key: SPARK-4779 URL: https://issues.apache.org/jira/browse/SPARK-4779 Project:

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236999#comment-14236999 ] Reynold Xin commented on SPARK-4740: BTW guys, would it be possible for us to ssh onto

[jira] [Updated] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4740: --- Attachment: (rxin patch normal executor)TestRunner sort-by-key - Thread dump for executor 0

[jira] [Updated] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4740: --- Attachment: (rxin patch better executor)TestRunner sort-by-key - Thread dump for executor

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237015#comment-14237015 ] Zhang, Liye commented on SPARK-4740: Hi [~adav], I attached the jstack info for both

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237018#comment-14237018 ] Aaron Davidson commented on SPARK-4740: --- Very interesting -- good executor is using

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237019#comment-14237019 ] Zhang, Liye commented on SPARK-4740: Hi [~adav],[~rxin] for the better executor, in

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237026#comment-14237026 ] Aaron Davidson commented on SPARK-4740: --- Could we get logs from the good/bad

[jira] [Commented] (SPARK-4607) Add random seed to GradientBoostedTrees

2014-12-06 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237029#comment-14237029 ] Kai Sasaki commented on SPARK-4607: --- [~josephkb] I think each trees in iterations of

[jira] [Comment Edited] (SPARK-4607) Add random seed to GradientBoostedTrees

2014-12-06 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237029#comment-14237029 ] Kai Sasaki edited comment on SPARK-4607 at 12/7/14 2:34 AM:

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237034#comment-14237034 ] Aaron Davidson commented on SPARK-4740: --- Do you have speculation enabled, by the

[jira] [Created] (SPARK-4780) Support executing multiple statements in sql(...)

2014-12-06 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-4780: Summary: Support executing multiple statements in sql(...) Key: SPARK-4780 URL: https://issues.apache.org/jira/browse/SPARK-4780 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4769) CTAS does not work when reading from temporary tables

2014-12-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237041#comment-14237041 ] Apache Spark commented on SPARK-4769: - User 'chenghao-intel' has created a pull

[jira] [Created] (SPARK-4781) Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext)

2014-12-06 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-4781: Summary: Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext) Key: SPARK-4781 URL:

[jira] [Updated] (SPARK-4781) Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext)

2014-12-06 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianshi Huang updated SPARK-4781: - Description: I have a table say created like follows: CREATE EXTERNAL TABLE pmt (

[jira] [Updated] (SPARK-4781) Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext)

2014-12-06 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jianshi Huang updated SPARK-4781: - Description: I have a table say created like follows: {code} CREATE EXTERNAL TABLE pmt (

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237075#comment-14237075 ] Patrick Wendell commented on SPARK-4740: [~terrymanu] - is it possible that

[jira] [Comment Edited] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237075#comment-14237075 ] Patrick Wendell edited comment on SPARK-4740 at 12/7/14 6:39 AM:

[jira] [Created] (SPARK-4782) Add inferSchema support for RDD[Map[String, Any]]

2014-12-06 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-4782: Summary: Add inferSchema support for RDD[Map[String, Any]] Key: SPARK-4782 URL: https://issues.apache.org/jira/browse/SPARK-4782 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4607) Add random seed to GradientBoostedTrees

2014-12-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14237086#comment-14237086 ] Joseph K. Bradley commented on SPARK-4607: -- [~lewuathe] Good point, that might be