[ https://issues.apache.org/jira/browse/FLINK-19121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jingsong Lee closed FLINK-19121. -------------------------------- Resolution: Fixed master: 41c3a19b235ad1351e9376d2d70101bd8090a4a8 release-1.11: f96c562a23dfb1f5a4b562a32099738a3f3db3e6 > Avoid accessing HDFS frequently in HiveBulkWriterFactory > -------------------------------------------------------- > > Key: FLINK-19121 > URL: https://issues.apache.org/jira/browse/FLINK-19121 > Project: Flink > Issue Type: Bug > Components: Connectors / Hive > Affects Versions: 1.12.0, 1.11.1 > Reporter: Jingsong Lee > Assignee: Jingsong Lee > Priority: Blocker > Labels: pull-request-available > Fix For: 1.11.2 > > > In HadoopPathBasedBulkWriter, getSize will invoke `FileSystem.exists` and > `FileSystem.getFileStatus`, but it is invoked per record. > There will be lots of visits to HDFS, may make HDFS pressure too high. -- This message was sent by Atlassian Jira (v8.3.4#803005)