Will the snipped below over the same 23 logs it ran over the previous hour
(i.e. just bumping up
the log iterator by 1) each hour, or is oozie going to run the action once
24 logs are present, then not kick off the action again until 24 new logs
are present? I think it is the former, but just
making sure.

<coordinator-app name="app-coord-daily" frequency="${coord:days(1)}"
                    start="2009-01-01T24:00Z" end="2009-12-31T24:00Z"
timezone="UTC"
                    xmlns="uri:oozie:coordinator:0.1">
      <datasets>

<include>hdfs://foo:8020/app/dataset-definitions/datasets.xml</include>
      </datasets>
      <input-events>
        <data-in name="input" dataset="1HourLogs">
          <start-instance>${coord:current(-23)}</start-instance>
          <end-instance>${coord:current(0)}</end-instance>
        </data-in>
      </input-events>
      <output-events>
        <data-out name="output" dataset="1DayLogs">
          <instance>${coord:current(0)}</instance>
        </data-out>
      </output-events>
      <action>
        <workflow>
        ...
       </workflow>
      </action>
   </coordinator-app>

Reply via email to