[ 
https://issues.apache.org/jira/browse/NIFI-15091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Doran updated NIFI-15091:
-------------------------------
    Description: 
There was recently an [interesting 
discussion|https://github.com/apache/nifi/pull/10372#issuecomment-3362326306] 
on the use of {{-XX:+UseStringDeduplication}} benefiting NiFi clusters that 
process many FlowFiles containing many FlowFile attributes, potentially with 
duplicate values.

At a glance, it seems that many NiFi use cases would benefit from enabling the 
{{UseStringDeduplication}} JVM GC arg, and there is probably little to no 
downside or risk to enabling it by default, even for use cases might not 
benefit. 

We should profile one or more representative flows in NiFi with and without 
this flag, quantify the potential savings, and, if merited, consider enabling 
this GC option by default in the JVM args of {{bootstrap.conf}} sourced by 
{{nifi.sh}}.

  was:
There was recently an [interesting 
discussion|https://github.com/apache/nifi/pull/10372#issuecomment-3362326306] 
on the use of `-XX:+UseStringDeduplication` benefiting NiFi clusters that 
process many FlowFiles containing many FlowFile attributes, potentially with 
duplicate values.

At a glance, it seems that many NiFi use cases would benefit from enabling the 
`UseStringDeduplication` JVM GC arg, and there is probably little to no 
downside or risk to enabling it by default, even for use cases might not 
benefit. 

We should profile one or more representative flows in NiFi with and without 
this flag, quantify the potential savings, and, if merited, consider enabling 
this GC option by default in the JVM args of `bootstrap.conf` sourced by 
`nifi.sh`.


> Evaluate NiFi's default JVM GC args. Consider adding UseStringDeduplication
> ---------------------------------------------------------------------------
>
>                 Key: NIFI-15091
>                 URL: https://issues.apache.org/jira/browse/NIFI-15091
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Configuration
>            Reporter: Kevin Doran
>            Assignee: Kevin Doran
>            Priority: Minor
>
> There was recently an [interesting 
> discussion|https://github.com/apache/nifi/pull/10372#issuecomment-3362326306] 
> on the use of {{-XX:+UseStringDeduplication}} benefiting NiFi clusters that 
> process many FlowFiles containing many FlowFile attributes, potentially with 
> duplicate values.
> At a glance, it seems that many NiFi use cases would benefit from enabling 
> the {{UseStringDeduplication}} JVM GC arg, and there is probably little to no 
> downside or risk to enabling it by default, even for use cases might not 
> benefit. 
> We should profile one or more representative flows in NiFi with and without 
> this flag, quantify the potential savings, and, if merited, consider enabling 
> this GC option by default in the JVM args of {{bootstrap.conf}} sourced by 
> {{nifi.sh}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to