[
https://issues.apache.org/jira/browse/NIFI-14020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lehel Boér updated NIFI-14020:
------------------------------
Fix Version/s: 2.3.0
Resolution: Fixed
Status: Resolved (was: Patch Available)
> Add Record and Demarcator support to ConsumeGCPubSub
> ----------------------------------------------------
>
> Key: NIFI-14020
> URL: https://issues.apache.org/jira/browse/NIFI-14020
> Project: Apache NiFi
> Issue Type: Improvement
> Components: Extensions
> Reporter: Pierre Villard
> Assignee: Pierre Villard
> Priority: Major
> Fix For: 2.3.0
>
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> At the moment, ConsumeGCPubSub will generate one FlowFile per consumed
> message (the Batch Size property is only specifying the maximum number of
> messages we may pull from the subscription with one API call). This can be
> extremely inefficient.
> Similarly to the Kafka processors, we should add the option to have multiple
> Processing Strategies:
> * Flow File - which is the current behavior - where one message is one
> FlowFile and FlowFile attributes will be used to store the attributes
> associated with the message as well as some information such as message ID,
> ack ID, etc.
> * Demarcator - where messages will be appended into a single FlowFile with a
> custom demarcator between each message. In this case specific attributes
> associated to messages will be lost. This however is the most efficient
> strategy if very high throughput is required and message format is allowing
> this approach.
> * Record - where a reader and writer can be specified to process the
> messages. This is useful if we want to change message format on the fly or if
> the message format is not allowing the demarcator strategy. In addition, an
> output strategy is available with two allowable values:
> ** Value - messages are all added in the same flowfile with the specified
> writer. In this case specific attributes associated to messages will be lost.
> ** Wrapper - in this case, we are overriding the schema of the writer to
> include the metadata of the message as well as a map of its attributes.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)