Balázs Gerner created NIFI-12923:
------------------------------------

             Summary: PutHDFS to support appending avro data
                 Key: NIFI-12923
                 URL: https://issues.apache.org/jira/browse/NIFI-12923
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Balázs Gerner
            Assignee: Balázs Gerner


The goal of this ticket is to extend the PutHDFS processor with the ability to 
append avro records. The processor already provides an option to set 'append' 
as conflict resolution strategy, but that does not work correctly in case of 
avro files, because the serialized avro file cannot be deserialized again 
(because the binary content is invalid).

Some notes about the implementation:
 * The user needs to explicitly select avro as file format and append as 
conflict resolution mode to enable 'avro append' mode, otherwise regular append 
mode will work just as before. There is no auto detection of mimetype for the 
incoming flowfile.
 * The records of the incoming flowfile and the ones in the existing avro file 
need to conform to the same avro schema, otherwise the append operation fails 
with incompatible schema.
 * The 'avro append' mode should only work when compression type is set to 
'none', if any other compression type is selected in 'avro append' mode the 
user should get a validation error.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to