me also. great book, just wanted a bit more on complex DIH :) On Oct 14, 2010, at 10:38 AM, Jason Brown wrote:
> Not related to the opening thread - but wante to thank Eric for his book. > Clarified a lot of stuff and very useful. > > > -----Original Message----- > From: Eric Pugh [mailto:ep...@opensourceconnections.com] > Sent: Thu 14/10/2010 15:34 > To: solr-user@lucene.apache.org > Subject: Re: What is the maximum number of documents that can be indexed ? > > I would recommend looking at the work the HathiTrust has done. They have > published some really great blog articles about the work they have done in > scaling Solr, and have put in huge amounts of data. > > The good news is that there isn't a exact number, because "It depends". The > bad news is that there isn't an exact number because "it depends"! > > Eric > > > > On Oct 13, 2010, at 8:58 PM, Otis Gospodnetic wrote: > >> Marco (use solr-u...@lucene list to follow up, please), >> >> There are no precise answers to such questions. Solr can keep indexing. >> The >> limit is, I think, the available disk space. I've never pushed Solr or >> Lucene >> to the point where Lucene index segments would become a serious pain, but >> even >> that can be controlled. Same thing with number of open files, large file >> support, etc. >> >> Otis >> ---- >> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch >> Lucene ecosystem search :: http://search-lucene.com/ >> >> >>> >>> From: Marco Ciaramella <ciaramellama...@gmail.com> >>> To: d...@lucene.apache.org >>> Sent: Wed, October 13, 2010 6:19:15 PM >>> Subject: What is the maximum number of documents that can be indexed ? >>> >>> Hi all, >>> I am working on a performance specification document on a Solr/Lucene-based >>> application; this document is intended for the final customer. My question >>> is: >>> what is the maximum number of document I can index assuming 10 or 20kbytes >>> for >>> each document? >>> >>> >>> I could not find a precise answer to this question, and I tend to consider >>> that >>> Solr index can be virtually limited only by the JVM, the Operating System >>> (limits to large file support), or by hardware constraints (mainly RAM, >>> etc. ... >>> ). >>> >>> >>> Thanks >>> Marco >>> >>> >>> > > ----------------------------------------------------- > Eric Pugh | Principal | OpenSource Connections, LLC | 434.466.1467 | > http://www.opensourceconnections.com > Co-Author: Solr 1.4 Enterprise Search Server available from > http://www.packtpub.com/solr-1-4-enterprise-search-server > Free/Busy: http://tinyurl.com/eric-cal > > > > > > > > > > > If you wish to view the St. James's Place email disclaimer, please use the > link below > > http://www.sjp.co.uk/portal/internet/SJPemaildisclaimer