Hi,

We currently are storing all of our data in sql database and use solr for indexing. We get a list of id's from solr and retrieve the data from the db.

We are considering storing all the data in solr to simplify administration and remove any synchronisation and are considering the following:

1. storing the data in individual fields in solr (indexed=true, store=true)
2. storing the data in a serialized form in a binary field in solr (using google proto buffers or similar) and keep the rest of the solr fields as indexed=true, stored=*false*. 3. keep as is. data stored in db and just keep solr fields as indexed=true, stored=false

Can anyone provide some advice in terms of performance of the different approaches. Are there any obvious pitfalls to option 1 and 2 that i need to be mindful of?

I am thinking option 2 would be the fastest as it would be reading the data in one contiguous block. Will be doing some preformance test to verify this soon.

FYI we are looking at 5-10M records, a serialised object is 500 to 1000 bytes and we index approx 20 fields.

Thanks for any advice.
andre

Reply via email to