Thanks Erick and Michael for the prompt responses. Cheers, Niran
>________________________________ > From: Erick Erickson <erickerick...@gmail.com> >To: solr-user@lucene.apache.org >Sent: Monday, March 25, 2013 10:21 AM >Subject: Re: Tlog File not removed after hard commit > >The tlogs will stay there to provide "peer synch" on the last 100 docs. Say >a node somehow gets out of synch. There are two options >1> replay from the log >2> replicate the entire index. > >To avoid <2> if possible, the tlog is kept around. In your case, all your >data is put in the tlog file, so the "keep the last 100 docs available" >rule means you'll keep the entire log for the run around until the _next_ >run completes, at which point I'd expect the oldest one to be deleted. > >Best >Erick > > >On Mon, Mar 25, 2013 at 8:40 AM, Michael Della Bitta < >michael.della.bi...@appinions.com> wrote: > >> My understanding is that logs stick around for a while just in case they >> can be used to catch up a shard that rejoins the cluster. >> On Mar 24, 2013 12:03 PM, "Niran Fajemisin" <afa...@yahoo.com> wrote: >> >> > Hi all, >> > >> > We import about 1.5 million documents on a nightly basis using DIH. >> During >> > this time, we need to ensure that all documents make it into index >> > otherwise rollback on any errors; which DIH takes care of for us. We also >> > disable autoCommit in DIH but instruct it to commit at the very end of >> the >> > import. This is all done through configuration of the DIH config XML file >> > and the command issued to the request handler. >> > >> > We have noticed that the tlog file appears to linger around even after >> DIH >> > has issued the hard commit. My expectation would be that after the hard >> > commit has occurred, the tlog file will be removed. I'm obviously >> > misunderstanding how this all works. >> > >> > Can someone please help me understand how this is meant to function? >> > Thanks! >> > >> > -Niran >> > > >