Actually, my confusion was here (just answered my own question).
<coordinator-app name="app-coord-daily" frequency="${coord:days(1)}"
start="2009-01-01T24:00Z" end="2009-12-31T24:00Z"
timezone="UTC"
xmlns="uri:oozie:coordinator:0.1">
If the logs are continually present, having the coordinator run once a day
will make it so, 24 new logs are grabbed each time.
Thanks.
On Thu, Feb 20, 2014 at 11:21 AM, Purshotam Shah <[email protected]>wrote:
>
> Yes. If you dataset ³1HourLogs² is hourly, then every time it going to
> look for 23 previous hour logs + 1 .
>
>
>
> On 2/20/14, 9:03 AM, "Scott Preddy" <[email protected]> wrote:
>
> >Will the snipped below over the same 23 logs it ran over the previous hour
> >(i.e. just bumping up
> >the log iterator by 1) each hour, or is oozie going to run the action once
> >24 logs are present, then not kick off the action again until 24 new logs
> >are present? I think it is the former, but just
> >making sure.
> >
> ><coordinator-app name="app-coord-daily" frequency="${coord:days(1)}"
> > start="2009-01-01T24:00Z" end="2009-12-31T24:00Z"
> >timezone="UTC"
> > xmlns="uri:oozie:coordinator:0.1">
> > <datasets>
> >
> ><include>hdfs://foo:8020/app/dataset-definitions/datasets.xml</include>
> > </datasets>
> > <input-events>
> > <data-in name="input" dataset="1HourLogs">
> > <start-instance>${coord:current(-23)}</start-instance>
> > <end-instance>${coord:current(0)}</end-instance>
> > </data-in>
> > </input-events>
> > <output-events>
> > <data-out name="output" dataset="1DayLogs">
> > <instance>${coord:current(0)}</instance>
> > </data-out>
> > </output-events>
> > <action>
> > <workflow>
> > ...
> > </workflow>
> > </action>
> > </coordinator-app>
>
>