Paul,

In Apache NiFi 0.6.0 if you're looking for a change capture type
mechanism to source from relational databases take a look at
QueryDatabaseTable [1].

That processor is new and any feedback and or contribs for it would be awesome.

ExecuteSQL does have some time driven use cases to capture snapshots
and such but you're right that it doesn't sound like a good fit for
your case.

[1] 
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.QueryDatabaseTable/index.html

On Wed, Mar 30, 2016 at 9:11 AM, Paul Bormans <[email protected]> wrote:
> I'm evaluating Apache Nifi as data ingestion tool to load data from an
> RDBMS into S3. A first test shows odd behavior where the same rows are
> written to the flowfile over and over again while i expected that only new
> rows are written.
>
> In fact i was missing configuration options to specify what column could be
> used to query only for new rows.
>
> Taking a look at the processor implementation makes me believe that the
> only option is to define a query including OFFSET n LIMIT m where "n" is
> dynamically set based upon previous onTriggers; would this even be possible?
>
> Some setup info:
> nifi: 0.6.0
> backend: postgresql
> driver: postgresql-9.4.1208.jre6.jar
> query: select * from addresses
>
> More in general i don't see a use-case where the current ExecuteSQL
> processor fits as a processor (without input flowfile). Someone can explain?
>
> Paul

Reply via email to