[jira] [Updated] (NIFI-10379) FechGoogleDrive record-based input handling improvement

Tamas Palfy (Jira) Tue, 23 Aug 2022 11:36:07 -0700


     [ 
https://issues.apache.org/jira/browse/NIFI-10379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Tamas Palfy updated NIFI-10379:
-------------------------------
    Description: 
*FechGoogleDrive* can read input from flowfile contents as records.
However the following cases are not handled correctly:
 # There is no property on the processor to configure what field in the records 
are to be used as file identifiers (based on which fetch is executed). Instead 
it is a hardcoded string value.
*Edit:* Upon further consideration the record-based approach poses technical 
challenges if we wanted to be properly customizable.
Instead we will consider this to mainly support the output of the 
ListGoogleDrive processor. Still can be used from differenc sources but the 
expected schema is pre-defined. 
 # It is possible for the processor to fetch successfully some files but later 
experience a general error which sends the incoming flowfile to an error 
relationship. This is a mixed result (some records succeed but then _all 
records_ fail) and should not happen. Either all records should be processed 
(resulting one flowfile for each sent to a success or an error relationship) or 
none of them (resulting only the incoming flowfile being sent to another error 
relationship).
 # _session.xxx(flowFile)_ calls are used as if those were _viod_ (and operated 
on the incoming flowFile). Should use the returned flowfile instead.

  was:
*FechGoogleDrive* can read input from flowfile contents as records.
However the following cases are not handled correctly:
 # There is no property on the processor to configure what field in the records 
are to be used as file identifiers (based on which fetch is executed). Instead 
it is a hardcoded string value.
 # It is possible for the processor to fetch successfully some files but later 
experience a general error which sends the incoming flowfile to an error 
relationship. This is a mixed result (some records succeed but then _all 
records_ fail) and should not happen. Either all records should be processed 
(resulting one flowfile for each sent to a success or an error relationship) or 
none of them (resulting only the incoming flowfile being sent to another error 
relationship).
 # _session.xxx(flowFile)_ calls are used as if those were _viod_ (and operated 
on the incoming flowFile). Should use the returned flowfile instead.


> FechGoogleDrive record-based input handling improvement
> -------------------------------------------------------
>
>                 Key: NIFI-10379
>                 URL: https://issues.apache.org/jira/browse/NIFI-10379
>             Project: Apache NiFi
>          Issue Type: Bug
>            Reporter: Tamas Palfy
>            Assignee: Tamas Palfy
>            Priority: Major
>
> *FechGoogleDrive* can read input from flowfile contents as records.
> However the following cases are not handled correctly:
>  # There is no property on the processor to configure what field in the 
> records are to be used as file identifiers (based on which fetch is 
> executed). Instead it is a hardcoded string value.
> *Edit:* Upon further consideration the record-based approach poses technical 
> challenges if we wanted to be properly customizable.
> Instead we will consider this to mainly support the output of the 
> ListGoogleDrive processor. Still can be used from differenc sources but the 
> expected schema is pre-defined. 
>  # It is possible for the processor to fetch successfully some files but 
> later experience a general error which sends the incoming flowfile to an 
> error relationship. This is a mixed result (some records succeed but then 
> _all records_ fail) and should not happen. Either all records should be 
> processed (resulting one flowfile for each sent to a success or an error 
> relationship) or none of them (resulting only the incoming flowfile being 
> sent to another error relationship).
>  # _session.xxx(flowFile)_ calls are used as if those were _viod_ (and 
> operated on the incoming flowFile). Should use the returned flowfile instead.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (NIFI-10379) FechGoogleDrive record-based input handling improvement

Reply via email to