Any ideas on how to debug this further?

I know very little about the internals of NiFi but there are obviously
still references to that content and it shouldn't have been deleted. Can
you think of a way I could have done this by accident?

On Tue, Feb 16, 2016 at 10:35 PM, Joe Witt <[email protected]> wrote:

> Interesting.  What that suggests is the content has been removed from
> the content repo itself.
>
> Thanks
> Joe
>
> On Tue, Feb 16, 2016 at 4:15 PM, Lars Francke <[email protected]>
> wrote:
> > I attached a debugger and checked a few of those FlowFiles that failed
> and
> > searched the logs for those. This is what I found:
> >
> > 2016-02-16 18:28:35,953 INFO [main]
> o.a.n.c.repository.FileSystemRepository
> > Found unknown file
> > /Users/lars/Downloads/nifi-0.5.0/content_repository/103/14556368398
> > 47-103 (1058303 bytes) in File System Repository; archiving file
> >
> > 2016-02-16 18:42:54,840 WARN [Timer-Driven Process Thread-9]
> > o.a.n.c.r.WriteAheadFlowFileRepository Repository Record
> >
> StandardRepositoryRecord[UpdateType=CONTENTMISSING,Record=StandardFlowFileRecord[uuid=af69ca83-fc03-41f0-91e1-e3d65da54840,claim=StandardContentClaim
> > [resourceClaim=StandardResourceClaim[id=1455636632024-102,
> > container=default, section=102], offset=661978,
> > length=10],offset=0,name=69321836993544,size=10]] is marked to be
> aborted;
> > it will be persisted in the FlowFileRepository as a DELETE record
> >
> > Now I can't remember having done this but it's entirely possible that I
> > restarted NiFi prior to my experiment described above.
> >
> >
> > On Tue, Feb 16, 2016 at 9:16 PM, Joe Witt <[email protected]> wrote:
> >
> >> Lars,
> >>
> >> Definitely look forward to understanding the mechanics here a bit
> >> better of what you're seeing and if you can provide something
> >> reproducible.  Even if you have a custom processor the API/Process
> >> Session construct should protect from many of the things that can go
> >> wrong there.  Now the content repo will likely be large empty as the
> >> data represents on 888KB of data and it is probably in a relative
> >> small number of files on disk.
> >>
> >> Thanks
> >> joe
> >>
> >> On Tue, Feb 16, 2016 at 2:57 PM, Lars Francke <[email protected]>
> >> wrote:
> >> > Hi Matt,
> >> >
> >> > thanks for the quick response. It's late here so I'll try reproducing
> >> > tomorrow.
> >> >
> >> > Source and destination processors are custom processors.
> >> > This is Nifi 0.5.0 RC3
> >> >
> >> > NiFi thinks all FlowFiles are still there: <http://imgur.com/isDlRk4>
> >> >
> >> > I'm looking at logs now no ERRORs or WARN that seem suspicious so far
> >> >
> >> > On Tue, Feb 16, 2016 at 8:46 PM, Matthew Clarke <
> >> [email protected]>
> >> > wrote:
> >> >
> >> >> Lars,
> >> >>       What version of NiFi are you running?
> >> >>       What type of processor was your source processor?
> >> >>       What type of processor was the destination processor?
> >> >>       I tried reproducing using a GenerateFlowFile to produce ~100k
> >> >> Flowfiles on a connection to an UpdateAttribute processor. I then
> >> stopped
> >> >> the GenerateFlowFile processor , added a funnel, and moved the
> >> connection.
> >> >> I also added another processor feeding that same funnel and routed
> the
> >> >> connection from the funnel back to the UpdateAttribute processor.
> The
> >> >> files moved as expected through the funnnel.
> >> >>
> >> >>       Can you reproduce?   Any other errors in your app log from
> prior
> >> to
> >> >> completing the connection?
> >> >>
> >> >> Thanks,
> >> >> Matt
> >> >>
> >> >> On Tue, Feb 16, 2016 at 1:15 PM, Lars Francke <
> [email protected]>
> >> >> wrote:
> >> >>
> >> >> > Hi,
> >> >> >
> >> >> > I'm trying to understand what happened and how I can prevent this
> in
> >> the
> >> >> > future.
> >> >> >
> >> >> > The outcome seems to be that all my FlowFiles which were sitting
> in a
> >> >> > connection have been deleted from disk.
> >> >> >
> >> >> > I had a flow with two processors connected via a single connection.
> >> >> >
> >> >> > What I did:
> >> >> > * Stop all Processors
> >> >> > * Add a Funnel
> >> >> > * Add a Processor
> >> >> > * Move destination end of existing connection to funnel (with the
> >> >> existing
> >> >> > FlowFiles)
> >> >> > * Connect new Processor to Funnel
> >> >> > * Connect Funnel to old destination Processor
> >> >> >
> >> >> > The connection between the Funnel and the Destination processor
> still
> >> >> shows
> >> >> > all 90k FlowFiles but the Processor fails on session.read with a
> >> >> > MissingFlowFileException.
> >> >> >
> >> >> > Sure enough my content_repository is mostly empty too.
> >> >> >
> >> >> > Now this isn't so bad because it's only a dev environment but I'd
> >> like to
> >> >> > understand how this could happen. Did I do something wrong?
> >> >> >
> >> >> > Any hints on what to search for in the logs or which place in the
> >> source
> >> >> > code to look?
> >> >> >
> >> >> > Cheers,
> >> >> > Lars
> >> >> >
> >> >>
> >>
>

Reply via email to