Andrew Olson created CRUNCH-678:
-----------------------------------

             Summary: Avoid unnecessary retrieval of last modified time
                 Key: CRUNCH-678
                 URL: https://issues.apache.org/jira/browse/CRUNCH-678
             Project: Crunch
          Issue Type: Improvement
          Components: Core
            Reporter: Andrew Olson
            Assignee: Josh Wills


There is no assurance that the last modified time can be retrieved efficiently 
for all file systems. In particular, with object stores and large data sets it 
could be very slow. Since this information is actually not always needed, we 
should only retrieve it when necessary (i.e. when the write mode is checkpoint) 
for sources and targets.

CRUNCH-658 expressed similar concerns for the getSize method. This would be a 
simpler and safer optimization to make.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to