Hi Fabian, Vino, I have one more question, which I initially planned to create a new thread, but now I think it is better to ask here: I need to process one big tar.gz file which contains multiple small gz files. What is the best way to do this? I am thinking of having one single thread process that read the TarArchiveStream (which has been decompressed from that tar.gz by Flink automatically), and then distribute the TarArchiveEntry entries to a multi-thread operator which would process the small files in parallel. If this is feasible, which elements from Flink I can reuse?
Thanks a lot. Regards, Averell -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/