[ https://issues.apache.org/jira/browse/TEZ-3334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15378147#comment-15378147 ]
Jason Lowe commented on TEZ-3334: --------------------------------- Another would-be-nice feature is per-disk thread pools. Today threads can get bunched up on slow disks, leaving no threads to service other disks. At some point adding more threads to a disk just makes the I/O worse, so it'd be nice to be able to configure threads-per-disk rather than threads overall and hope they spread cleanly in practice. > Tez Custom Shuffle Handler > -------------------------- > > Key: TEZ-3334 > URL: https://issues.apache.org/jira/browse/TEZ-3334 > Project: Apache Tez > Issue Type: New Feature > Reporter: Jonathan Eagles > > For conditions where auto-parallelism is reduced (e.g. TEZ-3222), a custom > shuffle handler could help reduce the number of fetches and could more > efficiently fetch data. In particular if a reducer is fetching 100 pieces > serially from the same mapper it could do this in one fetch call. -- This message was sent by Atlassian JIRA (v6.3.4#6332)