"At the base, Solr indexes are Lucene indexes, so one can always drop down to that level."
That's what I'm looking for. I understand, that at the end, there has to be an inverse index (or rather multiple of them), holding all "words" which occurre in my documents, each "word" having a list of documents the "word" was part of. I would like to do some statistics based on this information, would like to analyze how it changes if I change my text processing settings, ... If you would give me a starting point like "Data is stored in Lucene indexes, which are documented at XXX. In a request handler you can access the indexes via YYY.", I would be perfectly happy figuring out the rest on my own. Documentation about 4.0 is a bit limited, so it's hard to find an entry point. cheers, Achim Am 11.01.2013 um 20:54 schrieb Gora Mohanty: > On 12 January 2013 01:06, Achim Domma <do...@procoders.net> wrote: >> >> Hi, >> >> I have just setup my first Solr 4.0 instance and have added about one >> million documents. I would like to access the raw data stored in the index. >> Can somebody give me a starting point how to do that? >> >> As a first step, a simple dump would be absolutely ok. I just want to play >> around and do some static offline analysis. In the long term, I probably >> would like to implement custom search components to enrich my search >> results. So if there's no export for raw data, I would be happy to learn how >> to implement custom handlers and/or search components. Some guidance where >> to start would be very appreciated. > > It is not clear what you mean by "raw data", and what level of > customisation you are after. Here are two possibilities: > * At the base, Solr indexes are Lucene indexes, so one can always > drop down to that level. > * Also, Solr allows plugins for various components. This link might > be of help, depending on the extent of customisation you are after: > http://wiki.apache.org/solr/SolrPlugins > > Maybe you should approach this from the other end: If you could > describe what you are trying to achieve, people might be able to > offer possibilities. > > Regards, > Gora