I have an ETL process that needs to run in two modes:

  - Loading files
  - Streaming

My destination needs to be bulk loaded when reading from files, and
writing directly to the DB when streaming.

I've managed to subclass the SAX parser to read from STDIN for
streaming.

I tried splitting the source and destination into different .ctl files
and wrapping them in a batch file, but the pipeline was not propagated
between the control files.

What's the best way to factor out and reuse portions of the ETL pipeline?

Thanks
 -Bob
_______________________________________________
Activewarehouse-discuss mailing list
Activewarehouse-discuss@rubyforge.org
http://rubyforge.org/mailman/listinfo/activewarehouse-discuss

Reply via email to