Dylan Hercher created BEAM-12685:
------------------------------------

             Summary: Allow managed thread count in AvroIO
                 Key: BEAM-12685
                 URL: https://issues.apache.org/jira/browse/BEAM-12685
             Project: Beam
          Issue Type: Improvement
          Components: io-java-files
            Reporter: Dylan Hercher


During execution, the `ReadAllViaFileBasedSource` runs ReShuffle and creates an 
un-grouped set of file range readers.  This can easily cause OOM issues when 
the number of groups changes as there is no limit to the number of concurrent 
file reads.

 

Using Reshuffle.viaRandomKeys.withNumBuckets instead will allow the same 
default behavior, but lets the user configure the number of readers as and when 
needed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to