I really went to replace Flume with Nifi, so for the simplest use case I basically have Kafka->UpdateAttribute-> MergeContent(32->64->128MB)->PutHDFS.
I need to run in cluster mode to get the thoughput I need, but run into the problem of flowfiles assigned to nodes are only merged on those nodes. Effectively splitting my merge efficiencly by the number of nifi nodes. Is there a workaround for this issue?
