Hi, If you are not familiar with information retrieval, I would recommend getting started with learning about it. For example, this book is available for free and is really excellent: http://nlp.stanford.edu/IR-book/. Trust me, you're not wasting time if you read it.
More specifically to Lucene, the LucenePapers[1] wiki page has resources to papers that describe with a good level of details some internal features of Lucene, such as document writers per thread or the trie-structured index for efficient numeric range queries. Otherwise you could look for videos of talks at Lucene revolution, Berlin Buzzword or ApacheCon, they sometimes have interesting talks about Lucene internals. Here are a few ones: - Automata invasion http://vimeo.com/42163249 - Query suggestion with Lucene http://www.youtube.com/watch?v=XrssyUPQcG4 - Facets http://www.youtube.com/watch?v=-CNZxkAMcKk [1] http://wiki.apache.org/lucene-java/LucenePapers On Fri, Jan 10, 2014 at 7:06 AM, VIGNESH S <[email protected]> wrote: > Hi, > > I have basic knowledge on Lucene 4 code. > > I am interested in understanding the internals of Lucene and contribute. > > Is there any document or something which explains the internal architecture > of Lucene 4 like when checkpoints happen or something.. > > I am currently going through the code to understand but it takes lot of > time. > > It will be really helpful if someone helps me with some documents or slides > which explains me the internal architecture of lucene > > -- > Thanks and Regards > Vignesh Srinivasan > 9739135640 -- Adrien --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
