Mark Payne created NIFI-2925:
--------------------------------

             Summary: FlowFiles that are swapped out are never released from 
Content Repository
                 Key: NIFI-2925
                 URL: https://issues.apache.org/jira/browse/NIFI-2925
             Project: Apache NiFi
          Issue Type: Bug
          Components: Core Framework
    Affects Versions: 1.0.0
            Reporter: Mark Payne
            Assignee: Mark Payne
            Priority: Blocker
             Fix For: 1.1.0


To reproduce this, I created a simple Flow: GenerateFlowFile (1 KB file size) 
with success going to 2 different UpdateAttribute Processors (so that the same 
Content Claim is held by 2 different FlowFiles). I let about 150,000 FlowFiles 
queue up (with backpressure turned off). I then start one of the 
UpdateAttribute processors. This drained its queue. I could then look at my 
content repo for any files not archived:

{code}
content_repository $ find . -type f | grep -v archive | wc -l
     192
{code}

After a few minutes, the FlowFile repo is checkpointed, which will result in 
things getting cleaned up if they can. The above command shows the same result 
(expected, since the FlowFiles are still held. I then empty the queue. After 
the FlowFile checkpoints again, I should see nothing in the content repo 
outside of archive, but I see:

{code}
content_repository $ find . -type f | grep -v archive | wc -l
     167
{code}

I see the same thing happening if I turn on expiration to remove the FlowFiles 
instead of clicking Empty Queue, or if a processor runs and completes the 
processing of the data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to