Matt Burgess created NIFI-5590:
----------------------------------
Summary: Allow database fetch processors to store state by
database/catalog/schema
Key: NIFI-5590
URL: https://issues.apache.org/jira/browse/NIFI-5590
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Matt Burgess
The database fetch processors (GenerateTableFetch, QueryDatabaseTable, e.g.)
use a scheme of "table.column" to store the state of the maximum values for a
column, as well as a similar entry for the column type.
Now that GenerateTableFetch can accept incoming flow files, and if it uses a
DBCPConnectionPoolLookup service, the database could differ yet contain the
same table/column name combinations (MySQL shards, e.g.). The state values as
currently stored will be shared in this situation, which could certainly lead
to errors.
If possible/prudent, the fully-qualified state name should include
database/catalog/schema information such that the state entries are unique at a
database level. This may prove difficult as different drivers may or may not
make such information available, for example sometimes the database name is the
schema, sometimes it is the catalog, etc. As long as the names are unique at
the database level, there should be no conflicts. This may lead to a more
complicated naming scheme, and backwards-compatibility should be maintained
(unless this is implemented for a major release).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)