damccorm opened a new issue, #21570:
URL: https://github.com/apache/beam/issues/21570

   In TextIO and AvroIO, we have a configuration option called 
watchForNewFiles, and in FileIO.MatchConfiguration, we have an option called 
watchInterval. Right now, these match any files according to the filtering 
criteria, and then periodically check for new files. A file is determined to be 
new if it has a different filename than a file that has already been read.
   
   We want to add an option to choose to consider a file new if it has a 
different timestamp from an existing file, even if the file itself has the same 
name.
   
   See the following design doc for more detail:
   
   
[https://docs.google.com/document/d/1xnacyLGNh6rbPGgTAh5D1gZVR8rHUBsMMRV3YkvlL08/edit?usp=sharing&resourcekey=0-be0uF-DdmwAz6Vg4Li9FNw](https://docs.google.com/document/d/1xnacyLGNh6rbPGgTAh5D1gZVR8rHUBsMMRV3YkvlL08/edit?usp=sharing&resourcekey=0-be0uF-DdmwAz6Vg4Li9FNw)
   
    
   
   Imported from Jira 
[BEAM-14267](https://issues.apache.org/jira/browse/BEAM-14267). Original Jira 
may contain additional context.
   Reported by: yihu.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to