[ 
https://issues.apache.org/jira/browse/NIFI-13796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17884101#comment-17884101
 ] 

ASF subversion and git services commented on NIFI-13796:
--------------------------------------------------------

Commit 457d83ef84e683a59a8502dd33daed34be1485e4 in nifi's branch 
refs/heads/main from Mark Payne
[ https://gitbox.apache.org/repos/asf?p=nifi.git;h=457d83ef84 ]

NIFI-13796: When writing to a content claim and its size + offset exceeds the 
max appendable size, do not write to that content claim again
This closes #9306.

Signed-off-by: Joseph Witt <[email protected]>


> Improve efficiency of FlowFile content slab allocation
> ------------------------------------------------------
>
>                 Key: NIFI-13796
>                 URL: https://issues.apache.org/jira/browse/NIFI-13796
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Mark Payne
>            Assignee: Mark Payne
>            Priority: Major
>             Fix For: 2.0.0-M5
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When we write to the content repository, we write until the we hit the 
> configurable max appendable claim size. Then we move on to another resource 
> claim in the content repository. However, this only happens inter-session, as 
> the session holds open an OutputStream to a Content Claim for efficiency 
> purposes.
> This, however, has the downside of being less efficient in terms of splitting 
> FlowFile across content claims. As a result, we can write significantly far 
> beyond the 50 KB max when writing many FlowFiles in a session (think 
> UnpackContent, Split*, etc.)
> We can improve this by checking the size of the content claim + offset when 
> we finish writing to a content claim and only adding back to our active claim 
> queue if that size is less than the max appendable size. This should improve 
> NiFi's ability to cleanup content more aggressively.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to