Just a reminder, I just today noticed the "archive.enabled" option was false and changed it to true.
$ find . -type f -ls | grep archive | wc -l 0 On Wed, Jun 15, 2016 at 4:53 PM, Mark Payne <marka...@hotmail.com> wrote: > OK, thanks. It doesn't appear that it believes there is anything to > reclaim. > > Can you try going to your content repository and running: > > find . -type f -ls | grep archive > > Curious as to how much data it has archived. > > > On Jun 15, 2016, at 4:48 PM, Ricky Saltzer <ri...@cloudera.com> wrote: > > > > Oh sorry! Trying again > > > > [1] > > > https://gist.githubusercontent.com/rickysaltzer/b00196a3881c052df9b38b418722cd02/raw/279a1bc8c60530426732eb7b653de1f3f74574e2/gistfile1.txt > > > > > > On Wed, Jun 15, 2016 at 4:38 PM, Ricky Saltzer <ri...@cloudera.com> > wrote: > > > >> I should also mention, I just realized that our worker nodes are on > 0.5.1, > >> and for some reason I missed updating the master from 0.4.0. I'm sure > that > >> is not helping. > >> > >> On Wed, Jun 15, 2016 at 4:36 PM, Ricky Saltzer <ri...@cloudera.com> > wrote: > >> > >>> Looks like the threads are parked and waiting [1] > >>> > >>> [1] > >>> > http://github.mtv.cloudera.com/gist/ricky/7a5d89f2eeba58e2206d/raw/0e2b446ca049a8b5f27298c700ac709772d2847c/gistfile1.txt > >>> > >>> On Wed, Jun 15, 2016 at 4:33 PM, Joe Witt <joe.w...@gmail.com> wrote: > >>> > >>>> thanks Ricky - then please take a look at mark's note as that is > >>>> probably more relevant to your case. > >>>> > >>>> On Wed, Jun 15, 2016 at 4:32 PM, Ricky Saltzer <ri...@cloudera.com> > >>>> wrote: > >>>>> Hey Joe - > >>>>> > >>>>> The NiFi web UI currently reads as: > >>>>> > >>>>> Active threads: 3 > >>>>> Queued: 10,173 / 0 bytes > >>>>> Connected nodes: 2 / 2 > >>>>> Stats last refreshed: 13:31:28 PDT > >>>>> > >>>>> > >>>>> On Wed, Jun 15, 2016 at 4:29 PM, Joe Witt <joe.w...@gmail.com> > wrote: > >>>>> > >>>>>> And the data remains? If so that is an interesting data point I > >>>>>> think. So to mark's point how much data do you have queued up > >>>>>> actively in the flow then on that nodes? Number of objects you > >>>>>> mention is 3273 files corresponding to 825GB in the content > >>>>>> repository. Does NiFi see those 825GB worth of data as being in the > >>>>>> flow/queued up? And then if that is the case are we talking about a > >>>>>> roughly 1TB repo and so the reported value seems correct and this is > >>>>>> simply a case of queueing near to the limit your system can hold? > >>>>>> > >>>>>> On Wed, Jun 15, 2016 at 4:24 PM, Ricky Saltzer <ri...@cloudera.com> > >>>> wrote: > >>>>>>> I have two nodes in clustered mode. I have the other node that > isn't > >>>>>>> filling up as my primary. I've actually already restarted nifi on > >>>> the > >>>>>> node > >>>>>>> which has the large repository a few times. > >>>>>>> > >>>>>>> On Wed, Jun 15, 2016 at 4:22 PM, Joe Witt <joe.w...@gmail.com> > >>>> wrote: > >>>>>>> > >>>>>>>> Ricky, > >>>>>>>> > >>>>>>>> If you restart nifi and then find that it cleans those things up I > >>>>>>>> believe then it is related to the defects corrected in the 0.5/0.6 > >>>>>>>> timeframe. > >>>>>>>> > >>>>>>>> Is restarting an option for you at this time. You agree mark? > >>>>>>>> > >>>>>>>> Thanks > >>>>>>>> Joe > >>>>>>>> > >>>>>>>> On Wed, Jun 15, 2016 at 4:21 PM, Ricky Saltzer < > ri...@cloudera.com > >>>>> > >>>>>> wrote: > >>>>>>>>> Hey Mark - > >>>>>>>>> > >>>>>>>>> Thanks for the quick reply! This is our production system so it's > >>>>>>>>> unfortunately running 0.4.0. There are currently 3273 files, > >>>> with some > >>>>>>>>> files dating back to May 18th. The content repository itself is > >>>> 825G. > >>>>>>>>> > >>>>>>>>> Ricky > >>>>>>>>> > >>>>>>>>> On Wed, Jun 15, 2016 at 4:17 PM, Mark Payne < > >>>> marka...@hotmail.com> > >>>>>>>> wrote: > >>>>>>>>> > >>>>>>>>>> Hey Ricky > >>>>>>>>>> > >>>>>>>>>> The reclaim process is pretty much continuous. What version of > >>>> NiFi > >>>>>> are > >>>>>>>>>> you running? > >>>>>>>>>> I know there was an issue with this a while back that caused it > >>>> not > >>>>>> to > >>>>>>>>>> cleanup properly. > >>>>>>>>>> > >>>>>>>>>> Also, how much data & how many FlowFiles do you have queued up > >>>> in > >>>>>> your > >>>>>>>>>> flow? > >>>>>>>>>> Data won't be archived or reclaimed if in the flow. > >>>>>>>>>> > >>>>>>>>>> Thanks > >>>>>>>>>> -Mark > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>>> On Jun 15, 2016, at 4:04 PM, Ricky Saltzer < > >>>> ri...@cloudera.com> > >>>>>>>> wrote: > >>>>>>>>>>> > >>>>>>>>>>> Hey guys - > >>>>>>>>>>> > >>>>>>>>>>> I recently discovered I didn't have my "archive.enabled" > >>>> option > >>>>>> set to > >>>>>>>>>> true > >>>>>>>>>>> after my disk filled up to 95%. I enabled it and then set the > >>>>>>>> retention > >>>>>>>>>>> period to 12 hours and 50% (default values). However, after > >>>>>> restarting > >>>>>>>>>>> NiFi, I am not seeing any disk space reclaimed. > >>>>>>>>>>> > >>>>>>>>>>> I'm curious, is the reclaiming process periodic or continuous? > >>>>>>>>>>> > >>>>>>>>>>> --- > >>>>>>>>>>> ricky > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> -- > >>>>>>>>> Ricky Saltzer > >>>>>>>>> http://www.cloudera.com > >>>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> -- > >>>>>>> Ricky Saltzer > >>>>>>> http://www.cloudera.com > >>>>>> > >>>>> > >>>>> > >>>>> > >>>>> -- > >>>>> Ricky Saltzer > >>>>> http://www.cloudera.com > >>>> > >>> > >>> > >>> > >>> -- > >>> Ricky Saltzer > >>> http://www.cloudera.com > >>> > >>> > >> > >> > >> -- > >> Ricky Saltzer > >> http://www.cloudera.com > >> > >> > > > > > > -- > > Ricky Saltzer > > http://www.cloudera.com > > -- Ricky Saltzer http://www.cloudera.com