[
https://issues.apache.org/jira/browse/FLINK-19121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17190606#comment-17190606
]
Jingsong Lee edited comment on FLINK-19121 at 9/18/20, 6:28 AM:
----------------------------------------------------------------
master: 41c3a19b235ad1351e9376d2d70101bd8090a4a8
release-1.11: 7d86498e088aac28e9d7c77499e2118370f5cc96
was (Author: lzljs3620320):
master: 41c3a19b235ad1351e9376d2d70101bd8090a4a8
release-1.11: f96c562a23dfb1f5a4b562a32099738a3f3db3e6
> Avoid accessing HDFS frequently in HiveBulkWriterFactory
> --------------------------------------------------------
>
> Key: FLINK-19121
> URL: https://issues.apache.org/jira/browse/FLINK-19121
> Project: Flink
> Issue Type: Bug
> Components: Connectors / Hive
> Affects Versions: 1.12.0, 1.11.1
> Reporter: Jingsong Lee
> Assignee: Jingsong Lee
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 1.12.0, 1.11.3
>
>
> In HadoopPathBasedBulkWriter, getSize will invoke `FileSystem.exists` and
> `FileSystem.getFileStatus`, but it is invoked per record.
> There will be lots of visits to HDFS, may make HDFS pressure too high.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)