Hello, I have been working with Lucene and Solr for quite some time and have a good understanding of a lot of moving parts at the code level. However I wish to learn Lucene internals from the ground up and want to familiarize myself with all the dirty details. I would like to know what would be the best way to go about it.
To kick things off, I have been thinking about picking up “Lucene in Action”, but have been hesitant (and possibly wrongly) since it is based on Lucene 3.0 and we have come a long way since then. To give an example of the level of detail I wish to learn (among other things) would be what parts of a segment (.tim, .tip, etc) get loaded in memory at search time, which part uses finite state machines and why, etc I would really appreciate any thoughts/inputs on how I can go about this. Thanks in advance! Regards, Rahul