mqofori commented on PR #6053:
URL: https://github.com/apache/nifi/pull/6053#issuecomment-1559705445

   > Thanks for updating and preparing the new PR @davyam. Other reviewers are 
probably in a better position to evaluate some of the functionality, but I 
provided a few comments on style and functionality.
   > 
   > At a high-level, on particular question concerns the use of JSON as the 
output format, and one message per FlowFile. For high-volume processing, the 
ability to combine multiple records in a single FlowFile allows for much higher 
throughput. As mentioned in the detailed comments, adding support for a 
RecordWriter service would also make the output much more flexible, supporting 
JSON as well as other types.
   
   By the way, I agree with this. After testing this for some time, 
enhancements can definitely be made to improve the performance and scalability 
of this processor. The current one-record-per-FlowFile approach is very slow 
for bulk operations; say delete/update say 100K+ records. Some sort of batching 
strategy here could be particularly useful for high-volume database operations, 
helping to maximize throughput and address potential performance issues as 
@exceptionfactory has mentioned. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to