Matt Burgess created NIFI-5590:
----------------------------------

             Summary: Allow database fetch processors to store state by 
database/catalog/schema
                 Key: NIFI-5590
                 URL: https://issues.apache.org/jira/browse/NIFI-5590
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Extensions
            Reporter: Matt Burgess


The database fetch processors (GenerateTableFetch, QueryDatabaseTable, e.g.) 
use a scheme of "table.column" to store the state of the maximum values for a 
column, as well as a similar entry for the column type.

Now that GenerateTableFetch can accept incoming flow files, and if it uses a 
DBCPConnectionPoolLookup service, the database could differ yet contain the 
same table/column name combinations (MySQL shards, e.g.). The state values as 
currently stored will be shared in this situation, which could certainly lead 
to errors.

If possible/prudent, the fully-qualified state name should include 
database/catalog/schema information such that the state entries are unique at a 
database level. This may prove difficult as different drivers may or may not 
make such information available, for example sometimes the database name is the 
schema, sometimes it is the catalog, etc. As long as the names are unique at 
the database level, there should be no conflicts. This may lead to a more 
complicated naming scheme, and backwards-compatibility should be maintained 
(unless this is implemented for a major release). 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to