Hung Tran created GOBBLIN-787:
---------------------------------

             Summary: Add an option to include the task start time in the 
output file name
                 Key: GOBBLIN-787
                 URL: https://issues.apache.org/jira/browse/GOBBLIN-787
             Project: Apache Gobblin
          Issue Type: Task
            Reporter: Hung Tran
            Assignee: Hung Tran


In some cases a task may be scheduled to run on multiple workers. One case 
where this happens is when running with the Helix task execution framework. 
Helix may reschedule a task on a different worker if it loses contact with a 
worker. That worker may continue executing for some time before the task is 
terminated. During this period if the output file names collide then there may 
be an error during data publish.

Add an option "writer.addTaskTimestamp" that can be used to reduce the chance 
of name collisions by appending a task startup timestamp to the file name.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to