lirui-apache commented on issue #11175: [FLINK-16197][hive] Failed to query 
partitioned table when partition …
URL: https://github.com/apache/flink/pull/11175#issuecomment-589686279
 
 
   > Is this a reasonable issue?
   
   The issue was reported by a user trying to access Hive tables from Flink. 
Since Hive can handle this situation, I think a migrating user might expect 
Flink to do the same.
   
   BTW, Hive handles the issue a little differently. Instead of ignoring the 
missing folder, it creates empty files/folders for such partitions, under a 
scratch dir in HDFS. Since we don't have such a scratch dir on Flink side 
(therefore unable to do the clean-up), I think we can't follow Hive's solution.
   
   Also note that this is not trying to deal with concurrency issues. 
Concurrently accessing the table (e.g. delete the files while another job reads 
it) can still fail.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to