Github user hequn8128 commented on the issue:
https://github.com/apache/flink/pull/6075
Hi @zhangminglei , thanks very much for your PR.
As for dependencies, I think it is better to move ORC related classes into
the module of `flink-orc`, so we don't need to add orc dependency in module of
`flink-connector-filesystem`.
Moreover, the `OrcFileWriter` should implement interface
`CheckpointedFunction` to prevent loss of data in `rowBatch` during job
failover. During checkpointing, you can store the data in `rowBatch` into
state, or call `flush()`. Similar logic can be found in `BucketingSink`.
Best, Hequn.
---