[GitHub] spark issue #18840: [SPARK-21565] Propagate metadata in attribute replacemen...

2017-08-07 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18840 @joseph-torres could you change the PR title to "[SPARK-21565]**[SS]** Propagate metadata in attribute replacement"? We usually put the module name in the PR title. --- If your project

[GitHub] spark issue #18840: [SPARK-21565] Propagate metadata in attribute replacemen...

2017-08-07 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18840 LGTM pending tests. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18848: [SPARK-21374][CORE] Fix reading globbed paths fro...

2017-08-04 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18848#discussion_r131486596 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -638,4 +625,28 @@ object DataSource extends Logging

[GitHub] spark issue #18848: [SPARK-21374][CORE] Fix reading globbed paths from S3 in...

2017-08-04 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18848 cc @gatorsmile --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18848: [SPARK-21374][CORE] Fix reading globbed paths fro...

2017-08-04 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18848 [SPARK-21374][CORE] Fix reading globbed paths from S3 into DF with disabled FS cache ## What changes were proposed in this pull request? This PR replaces #18623 to do some clean up

[GitHub] spark pull request #18840: [SPARK-21565] Propagate metadata in attribute rep...

2017-08-03 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18840#discussion_r131309601 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/EventTimeWatermarkSuite.scala --- @@ -391,6 +391,30 @@ class EventTimeWatermarkSuite

[GitHub] spark issue #18840: [SPARK-21565] Propagate metadata in attribute replacemen...

2017-08-03 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18840 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18623: [SPARK-21374][CORE] Fix reading globbed paths fro...

2017-08-03 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18623#discussion_r131283705 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -132,7 +132,7 @@ case class DataSource

[GitHub] spark issue #18822: [SPARK-21546][SS] dropDuplicates should ignore watermark...

2017-08-02 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18822 Thanks! Merging to master and 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #18822: [SPARK-21546] dropDuplicates should ignore waterm...

2017-08-02 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18822 [SPARK-21546] dropDuplicates should ignore watermark when it's not a key ## What changes were proposed in this pull request? When the watermark is not a column of `dropDuplicates`, right

[GitHub] spark issue #18803: [SPARK-21597][SS]Fix a potential overflow issue in Event...

2017-08-02 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18803 Thanks! Merging to master and 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18803: [SPARK-21597][SS]Fix a potential overflow issue in Event...

2017-08-01 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18803 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18796: [CORE] [MINOR] Improve the error message of checkpoint R...

2017-08-01 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18796 Thanks! Merging to master --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18796: [CORE] [MINOR] Improve the error message of checkpoint R...

2017-08-01 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18796 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18803: [SPARK-21597][SS]Fix a potential overflow issue i...

2017-08-01 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18803 [SPARK-21597][SS]Fix a potential overflow issue in EventTimeStats ## What changes were proposed in this pull request? This PR fixed a potential overflow issue in EventTimeStats

[GitHub] spark pull request #18799: [SPARK-21596][SS]Ensure places calling HDFSMetada...

2017-08-01 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18799 [SPARK-21596][SS]Ensure places calling HDFSMetadataLog.get check the return value ## What changes were proposed in this pull request? When I was investigating a flaky test, I realized

[GitHub] spark pull request #18789: SPARK-20433 Bump jackson from 2.6.5 to 2.6.7.1

2017-08-01 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18789#discussion_r130686972 --- Diff: sql/core/pom.xml --- @@ -101,7 +101,7 @@ com.fasterxml.jackson.core jackson-databind

[GitHub] spark issue #18790: [SPARK-21587][SS] Added pushdown through watermarks.

2017-07-31 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18790 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18723: [SPARK-21517][CORE] Avoid copying memory when transfer c...

2017-07-25 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18723 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18723: [SPARK-21517][CORE] Avoid copying memory when transfer c...

2017-07-25 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18723 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18723: [SPARK-21517][CORE] Avoid copying memory when transfer c...

2017-07-25 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18723 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18388: [SPARK-21175] Reject OpenBlocks when memory shortage on ...

2017-07-22 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18388 @jinxing64 Sorry, I forgot to mention one request. Could you add a unit test? Right now it's disabled so the new codes are not tested. It will help avoid some obvious mistakes, such as the missing

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128882147 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java --- @@ -130,11 +143,25 @@ private void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128881079 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java --- @@ -130,11 +143,25 @@ private void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128879751 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java --- @@ -130,11 +143,25 @@ private void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128879607 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/OneForOneStreamManager.java --- @@ -96,18 +103,23 @@ public ManagedBuffer

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128879502 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/OneForOneStreamManager.java --- @@ -122,6 +134,7 @@ public void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128879315 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/OneForOneStreamManager.java --- @@ -53,9 +56,13

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128879045 --- Diff: docs/configuration.md --- @@ -1809,6 +1809,14 @@ Apart from these, the following properties are also available, and may be useful

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128878875 --- Diff: docs/configuration.md --- @@ -1809,6 +1809,14 @@ Apart from these, the following properties are also available, and may be useful

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128878596 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java --- @@ -130,11 +143,25 @@ private void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128878556 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/TransportRequestHandler.java --- @@ -118,6 +124,13 @@ private void

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128878335 --- Diff: common/network-common/src/main/java/org/apache/spark/network/util/TransportConf.java --- @@ -257,4 +257,7 @@ public Properties cryptoConf

[GitHub] spark pull request #18707: [SPARK-21503][UI]: Spark UI shows incorrect task ...

2017-07-21 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18707#discussion_r128876674 --- Diff: core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala --- @@ -140,6 +140,8 @@ class ExecutorsListener(storageStatusListener

[GitHub] spark pull request #18388: [SPARK-21175] Reject OpenBlocks when memory short...

2017-07-20 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r128688973 --- Diff: common/network-common/src/main/java/org/apache/spark/network/util/TransportConf.java --- @@ -257,4 +257,11 @@ public Properties cryptoConf

[GitHub] spark pull request #18642: [MINOR][REFACTORING] KeyValueGroupedDataset.mapGr...

2017-07-20 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18642#discussion_r128632530 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/KeyValueGroupedDataset.scala --- @@ -242,16 +242,7 @@ class KeyValueGroupedDataset[K, V] private[sql

[GitHub] spark issue #18676: [SPARK-21463] Allow userSpecifiedSchema to override part...

2017-07-19 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18676 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18623: [SPARK-21374][CORE] Fix reading globbed paths from S3 in...

2017-07-19 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18623 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18670: [SPARK-21455][CORE]RpcFailure should be call on RpcRespo...

2017-07-18 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18670 @ConeyLiu the network layer doesn't know how to serialize Throwable, or in other words, it cannot use JavaSerializer in Spark core. --- If your project is set up for it, you can reply

[GitHub] spark pull request #18675: [SPARK-21462][SS] Added batchId to StreamingQuery...

2017-07-18 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18675#discussion_r128091870 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/statefulOperators.scala --- @@ -79,6 +79,7 @@ trait StateStoreWriter extends

[GitHub] spark issue #18639: [SPARK-21408][core] Better default number of RPC dispatc...

2017-07-18 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18639 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18670: [SPARK-21455][CORE]RpcFailure should be call on RpcRespo...

2017-07-18 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18670 As @vanzin pointed it out, this is done on purpose. RpcResponseCallback and RpcCallContext are in different modules. RpcResponseCallback is a low level api, and RpcCallContext is on top

[GitHub] spark issue #18660: [SPARK-21445] Make IntWrapper and LongWrapper in UTF8Str...

2017-07-17 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18660 > I don't think we're actually trying to ship these values anywhere. I see. They are static classes. --- If your project is set up for it, you can reply to this email and have your re

[GitHub] spark issue #18661: [SPARK-21409][SS] Follow up PR to allow different types ...

2017-07-17 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18661 LGTM pending tests --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18660: [SPARK-21445] Make IntWrapper and LongWrapper in UTF8Str...

2017-07-17 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18660 Is it safe to just ignore them? Maybe we should recover them `readExternal/read` method? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #18629: [SPARK-21409][SS] Expose state store memory usage in SQL...

2017-07-17 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18629 LGTM. Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18638: [SPARK-21421][SS]Add the query id as a local property to...

2017-07-14 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18638 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18638: [SPARK-21421][SS]Add the query id as a local property to...

2017-07-14 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18638 cc @marmbrus --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18638: [SPARK-21421][SS]Add the query id as a local prop...

2017-07-14 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18638 [SPARK-21421][SS]Add the query id as a local property to allow source and sink using it ## What changes were proposed in this pull request? Add the query id as a local property to allow

[GitHub] spark pull request #18629: [SPARK-21409][SS] Expose state store memory usage...

2017-07-14 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18629#discussion_r127519438 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/HDFSBackedStateStoreProvider.scala --- @@ -172,7 +172,9 @@ private[state

[GitHub] spark pull request #18388: [SPARK-21175][WIP] Reject OpenBlocks when memory ...

2017-07-12 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r127131801 --- Diff: common/network-common/src/main/java/org/apache/spark/network/util/TransportConf.java --- @@ -257,4 +257,31 @@ public Properties cryptoConf

[GitHub] spark pull request #18388: [SPARK-21175][WIP] Reject OpenBlocks when memory ...

2017-07-12 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18388#discussion_r127131671 --- Diff: common/network-common/src/main/java/org/apache/spark/network/util/PooledByteBufAllocatorWithMetrics.java --- @@ -0,0 +1,70

[GitHub] spark issue #18357: [SPARK-21146] [CORE] Master/Worker should handle and shu...

2017-07-12 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18357 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18143: [SPARK-20919][SS] Simplificaiton of CachedKafkaConsumer ...

2017-07-11 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18143 @ScrapCodes I think it should be bounded by `spark.sql.kafkaConsumerCache.capacity`. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark issue #18357: [SPARK-21146] [CORE] Master/Worker should handle and shu...

2017-07-11 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18357 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18357: [SPARK-21146] [CORE] Master/Worker should handle and shu...

2017-07-11 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18357 @devaraj-kavali Sorray. I forgot this PR. I will trigger a new run as master has been updated a lot. Will set a reminder for me to merge this PR :) --- If your project is set up for it, you can

[GitHub] spark issue #18357: [SPARK-21146] [CORE] Master/Worker should handle and shu...

2017-07-11 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18357 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #14718: [SPARK-16711] YarnShuffleService doesn't re-init ...

2017-07-11 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/14718#discussion_r126761513 --- Diff: common/network-common/pom.xml --- @@ -45,6 +45,22 @@ commons-lang3 + + org.fusesource.leveldbjni

[GitHub] spark pull request #14718: [SPARK-16711] YarnShuffleService doesn't re-init ...

2017-07-11 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/14718#discussion_r126761181 --- Diff: common/network-common/pom.xml --- @@ -45,6 +45,22 @@ commons-lang3 + + org.fusesource.leveldbjni

[GitHub] spark issue #18593: [SPARK-21369][Core]Don't use Scala Tuple2 in common/netw...

2017-07-10 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18593 @vanzin it doesn't work. I added scala-library into the test scope and changed `org.apache.spark:spark-tags` back to the compile scope, but the build didn't fail. --- If your project is set up

[GitHub] spark issue #18593: [SPARK-21369][Core]Don't use Scala Tuple2 in common/netw...

2017-07-10 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18593 cc @cloud-fan @jinxing64 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #18593: [SPARK-21369][Core]Don't use Scala Tuple2 in comm...

2017-07-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18593#discussion_r126532474 --- Diff: common/network-common/src/main/java/org/apache/spark/network/server/OneForOneStreamManager.java --- @@ -98,21 +96,16 @@ public ManagedBuffer

[GitHub] spark pull request #18593: [SPARK-21369][Core]Don't use Scala Tuple2 in comm...

2017-07-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18593#discussion_r126532371 --- Diff: common/network-common/pom.xml --- @@ -90,7 +90,8 @@ org.apache.spark spark-tags_${scala.binary.version

[GitHub] spark pull request #18593: [SPARK-21369][Core]Don't use Scala Tuple in commo...

2017-07-10 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18593 [SPARK-21369][Core]Don't use Scala Tuple in common/network-* ## What changes were proposed in this pull request? Remove all usages of Scala Tuple from common/network-* projects

[GitHub] spark pull request #14718: [SPARK-16711] YarnShuffleService doesn't re-init ...

2017-07-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/14718#discussion_r126521226 --- Diff: common/network-common/pom.xml --- @@ -45,6 +45,22 @@ commons-lang3 + + org.fusesource.leveldbjni

[GitHub] spark issue #18565: [SPARK-21342] Fix DownloadCallback to work well with Ret...

2017-07-09 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18565 @jinxing64 I submitted https://github.com/jinxing64/spark/pull/1 to your repo to fix a potential file leak. Otherwise, this looks good to me. --- If your project is set up for it, you can reply

[GitHub] spark pull request #18565: [SPARK-21342] Fix DownloadCallback to work well w...

2017-07-08 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18565#discussion_r126291505 --- Diff: core/src/main/scala/org/apache/spark/network/netty/NettyBlockTransferService.scala --- @@ -53,6 +53,7 @@ private[spark] class

[GitHub] spark issue #18562: [SPARK-21069][SS][DOCS] Add rate source to programming g...

2017-07-08 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18562 Thanks! LGTM. Merging to master and 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #18562: [SPARK-21069][SS][DOCS] Add rate source to progra...

2017-07-07 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18562#discussion_r126264346 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -547,6 +549,19 @@ Here are the details of all the sources in Spark

[GitHub] spark issue #18388: [SPARK-21175] Reject OpenBlocks when memory shortage on ...

2017-07-07 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18388 @jinxing64 Since [ExternalShuffleService](https://github.com/apache/spark/blob/a0fe32a219253f0abe9d67cf178c73daf5f6fcc1/core/src/main/scala/org/apache/spark/deploy/ExternalShuffleService.scala#L55

[GitHub] spark pull request #18565: [SPARK-21342] Fix DownloadCallback to work well w...

2017-07-07 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18565#discussion_r126242905 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockFetcher.java --- @@ -151,15 +152,27 @@ private void

[GitHub] spark pull request #18565: [SPARK-21342] Fix DownloadCallback to work well w...

2017-07-07 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18565#discussion_r126242576 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockFetcher.java --- @@ -151,15 +152,27 @@ private void

[GitHub] spark pull request #18565: [SPARK-21342] Fix DownloadCallback to work well w...

2017-07-07 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18565#discussion_r126259778 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockFetcher.java --- @@ -151,15 +152,27 @@ private void

[GitHub] spark issue #18566: Refine the document for spark.reducer.maxReqSizeShuffleT...

2017-07-07 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18566 @jinxing64 yeah, please also update configuration.md. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #18388: [SPARK-21175] Reject OpenBlocks when memory shortage on ...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18388 > there are 200K+ connections and 3.5M blocks(FileSegmentManagedBuffer) being fetched. Did you use a large `spark.shuffle.io.numConnectionsPerPeer`? If not, the number of connections se

[GitHub] spark issue #18509: [SPARK-21329][SS] Make EventTimeWatermarkExec explicitly...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18509 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18482: [SPARK-21262] Stop sending 'stream request' when shuffle...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18482 In a second thought, I think we don't need this PR. We can disable `spark.reducer.maxReqSizeShuffleToMem` by default. Let's just document this configuration will break old shuffle service

[GitHub] spark issue #18485: [SPARK-21267][SS][DOCS] Update Structured Streaming Docu...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18485 Thanks! Merging to master and 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #18503: [SPARK-21271][SQL] Ensure Unsafe.sizeInBytes is a...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18503#discussion_r126035228 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/HDFSBackedStateStoreProvider.scala --- @@ -350,20 +350,24 @@ private

[GitHub] spark issue #18485: [SPARK-21267][SS][DOCS] Update Structured Streaming Docu...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18485 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #18357: [SPARK-21146] [CORE] Master/Worker should handle and shu...

2017-07-06 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18357 LGTM. Pending tests. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #18509: [SS][MINOR] Make EventTimeWatermarkExec explicitl...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18509#discussion_r125819776 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/EventTimeWatermarkExec.scala --- @@ -81,7 +81,7 @@ class EventTimeStatsAccum

[GitHub] spark pull request #18357: [SPARK-21146] [CORE] Master/Worker should handle ...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18357#discussion_r125817628 --- Diff: core/src/main/scala/org/apache/spark/deploy/master/Master.scala --- @@ -1037,6 +1037,7 @@ private[deploy] object Master extends Logging

[GitHub] spark pull request #18357: [SPARK-21146] [CORE] Master/Worker should handle ...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18357#discussion_r125817583 --- Diff: core/src/main/scala/org/apache/spark/util/SparkUncaughtExceptionHandler.scala --- @@ -20,29 +20,29 @@ package org.apache.spark.util import

[GitHub] spark pull request #18357: [SPARK-21146] [CORE] Master/Worker should handle ...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18357#discussion_r125817503 --- Diff: core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala --- @@ -737,6 +737,7 @@ private[deploy] object Worker extends Logging

[GitHub] spark pull request #18357: [SPARK-21146] [CORE] Master/Worker should handle ...

2017-07-06 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18357#discussion_r125817451 --- Diff: core/src/main/scala/org/apache/spark/util/SparkUncaughtExceptionHandler.scala --- @@ -20,29 +20,29 @@ package org.apache.spark.util import

[GitHub] spark issue #18461: [SPARK-21248][SS]The clean up codes in StreamExecution s...

2017-07-05 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18461 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #18485: [SPARK-21267][SS][DOCS] Update Structured Streami...

2017-07-05 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18485#discussion_r125781354 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1971,8 +2011,23 @@ write.stream(aggDF, "memory", outputMode =

[GitHub] spark pull request #18509: [SS][MINOR] Make EventTimeWatermarkExec explicitl...

2017-07-03 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18509#discussion_r125346130 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/EventTimeWatermarkExec.scala --- @@ -81,7 +81,7 @@ class EventTimeStatsAccum

[GitHub] spark issue #18490: [SPARK-21269][Core][WIP] Fix FetchFailedException when e...

2017-06-30 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18490 I cannot reproduce this issue. Could you provide a unit test to reproduce this? Anyway, I suggest using `kryo.register(classOf[HighlyCompressedMapStatus], new KryoJavaSerializer

[GitHub] spark pull request #18485: [SPARK-21267][SS][DOCS] Update Structured Streami...

2017-06-30 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18485#discussion_r125128901 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1971,8 +2011,23 @@ write.stream(aggDF, "memory", outputMode =

[GitHub] spark pull request #18485: [SPARK-21267][SS][DOCS] Update Structured Streami...

2017-06-30 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18485#discussion_r125128668 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1922,6 +1953,15 @@ Not available in R. +### Reporting Metrics using

[GitHub] spark pull request #18485: [SPARK-21267][SS][DOCS] Update Structured Streami...

2017-06-30 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18485#discussion_r125127821 --- Diff: docs/index.md --- @@ -88,13 +89,13 @@ options for deployment: **Programming Guides:** * [Quick Start](quick-start.html): a quick

[GitHub] spark pull request #18485: [SPARK-21267][SS][DOCS] Update Structured Streami...

2017-06-30 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18485#discussion_r125127552 --- Diff: docs/_layouts/global.html --- @@ -69,14 +69,14 @@ Programming Guides

[GitHub] spark issue #18490: [SPARK-21269][Core][WIP] Fix FetchFailedException when e...

2017-06-30 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18490 I don't get it. Could you point out which place serializes MapStatus using Kyro? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark issue #18478: [SPARK-21253][Core][HOTFIX]Fix Scala 2.10 build

2017-06-29 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18478 Verified both Scala 2.10 and 2.11 build locally. Since Jenkins PR build doesn't use Scala 2.10, I'm going to merge directly. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request #18478: [SPARK-21253][Core]Fix Scala 2.10 build

2017-06-29 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/18478 [SPARK-21253][Core]Fix Scala 2.10 build ## What changes were proposed in this pull request? A follow up PR to fix Scala 2.10 build for #18472 ## How was this patch tested

[GitHub] spark pull request #16989: [SPARK-19659] Fetch big blocks to disk when shuff...

2017-06-29 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/16989#discussion_r124959257 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/OneForOneBlockFetcher.java --- @@ -126,4 +150,38 @@ private void

[GitHub] spark pull request #18472: [SPARK-21253][Core]Fix a bug that StreamCallback ...

2017-06-29 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18472#discussion_r124955416 --- Diff: common/network-common/src/main/java/org/apache/spark/network/client/TransportResponseHandler.java --- @@ -104,15 +106,31 @@ public void

[GitHub] spark pull request #18357: [SPARK-21146] [CORE] Worker should handle and shu...

2017-06-29 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/18357#discussion_r124944873 --- Diff: core/src/main/scala/org/apache/spark/util/SparkUncaughtExceptionHandler.scala --- @@ -26,27 +26,34 @@ import org.apache.spark.internal.Logging

<    3   4   5   6   7   8   9   10   11   12   >