If I understand correctly, by process you mean SSH action (bash) in oozie. Yes you can configure different types of actions in a single workflow. 1. SSH action 2. Fork/Join 3 M/R jobs 3. SSH action.
Thanks, -Idris On Thu, Feb 5, 2015 at 10:10 AM, 임정택 <[email protected]> wrote: > Hello all. > > We're periodically scan HBase tables to aggregate statistic information, > and store it to MySQL. > > We have 3 kinds of CP (kind of data source), each has one Channel and one > Article table. > (Channel : Article is 1:N relation.) > > All CPs table schema are different a bit, so in order to aggregate we > should apply different logics, with joining Channel and Article. > > I've thought about workflow like this, but I wonder it can make sense. > > 1. run single process which initializes MySQL by creating table, deleting > row, etc. > 2. run 3 M/Rs simultaneously to aggregate statistic information for each > CP, and insert rows per Channel to MySQL. > 3. run single process which finalizes whole aggregation - runs aggregation > query from MySQL to insert new row to MySQL, rolling table, etc. > > Definitely 1,2,3 should be run in a row. > > Any helps are really appreciated! > Thanks. > > Regards. > Jungtaek Lim (HeartSaVioR) >
