[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

JasonMWhite Thu, 27 Apr 2017 09:50:13 -0700

Github user JasonMWhite commented on the issue:

    https://github.com/apache/spark/pull/17774
  
    I think @koeninger's suggestion is valid. `effectiveRateLimitPerPartition` 
is the upper bound on the number of messages per partition per second, and 
`maxMessagesPerPartition` sets an upper bound on the number of messages to be 
retrieved per partition per batch window.
    
    Making `effectiveRateLimitPerPartition` a float will allow it to handle 
properly rates of < 1/partition/s, so this is definitely a good idea. 
`maxMessagesPerPartition` must still be an integer, as you can't retrieve 
partial messages. All agreed there.
    
    Setting `maxMessagesPerPartition` to have a minimum of 1 message per window 
is a good safe value to allow progress in all cases. If there isn't 1 message 
to retrieve, `clamp` will prevent it from attempting to retrieve an invalid 
message.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

Reply via email to