I usually keep the segments as long as I can keep and periodically delete them It basically depends on your own needs
On Sun, May 4, 2014 at 1:41 PM, chethan <[email protected]> wrote: > Well, the only case where you would want to retain them is if you ever need > to index the same data again to Solr without having to crawl them. So > automating the deletion part is risky unless you're sure the indexing has > gone right. > > Regards, > > -- > Chethan Prasad > > > On Sun, May 4, 2014 at 12:13 AM, John Lafitte <[email protected] > >wrote: > > > What would be the case where you would want to keep the segments? I'm > > considering automatically deleting them after sending the data to solr > > On May 3, 2014 2:29 AM, "chethan" <[email protected]> wrote: > > > > > Thanks for your reply! > > > > > > Regards, > > > > > > -- > > > Chethan Prasad > > > > > > > > > On Sat, May 3, 2014 at 12:22 PM, remi tassing <[email protected]> > > > wrote: > > > > > > > you are correct > > > > > > > > > > > > On Fri, May 2, 2014 at 7:46 PM, chethan <[email protected]> > > wrote: > > > > > > > > > Hi, > > > > > > > > > > I have a Nutch crawl with 4 segments which are fully indexed using > > the > > > > > bin/nutch > > > > > solrindexcommand. Now I'm all out of storage on the box, so can I > > > delete > > > > > the 4 segments and retain only the crawldb and continue crawling > from > > > > where > > > > > I left it? > > > > > > > > > > Since all the segments are merged and indexed to Solr I don't see a > > > > problem > > > > > in deleting the segments, or am I wrong there? > > > > > > > > > > Regards, > > > > > > > > > > -- > > > > > Chethan Prasad > > > > > > > > > > > > > > >

