[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2018-06-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16374 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2018-06-12 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 I don't use anymore approach above. To unpersist unnecessary RDD, I hacked MapWithStateDStream a little bit by calling unpersist for previously generated RDDs in internalMapWithStateStream.

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2018-06-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16374 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16374 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-12-14 Thread wobuxiangtong
Github user wobuxiangtong commented on the issue: https://github.com/apache/spark/pull/16374 Excuse me: I am using mapwithstate to storage data in sparkstreaming。What confused me is that rememberDuration must more than checkpointDuration。I read the code of MapWithStateRD

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-01-10 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 DStream.clearMetadata allows to keep metadata for up to checkpoint_interval_multiplier RDDs. This PR clears metadata as soon as possible. As the result applicatin uses less memory and less GC o

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-01-09 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/16374 > scala.NotImplementedError: put() should not be called on an EmptyStateMap This is another issue and not related to the memory issue. Could you create a new ticket, please? For thi

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-01-05 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 cc @zsxwing, @tdas --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-01-05 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 We use spark 2.0.0. Changing spark.memory.fraction (we tried 0.3 and 0.4) (also tried to increase memoryOverhead to 20% of executor memory) does not help. The application is not s

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2017-01-03 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 We dont use windows operations (Kafka streaming) - so we don't need 'old' RDDs :) I need to double check if spark.memory.fraction help to us. --- If your project is set up for it, you can

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2016-12-29 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/16374 Some operators (e.g., window) may still need to use these RDD, dropping them will slow your application. That's why `DStream` uses `rememberDuration` to filter RDDs. You can [tune the memor

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2016-12-23 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 cc @tdas --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2016-12-22 Thread vpchelko
Github user vpchelko commented on the issue: https://github.com/apache/spark/pull/16374 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and w

[GitHub] spark issue #16374: [SPARK-18925][STREAMING] Reduce memory usage of mapWithS...

2016-12-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16374 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat