Hi, I am relatively new to solr and evaluating it for my project. I would have lots of data coming in at a fast rate (say 10 MB per sec) and I would need the recent data (last 24 hours, or last 100GB) to be searchable faster than the old data. I did a bit of reading on the controls provided by solr and came across the concept of mergeFactor (defaults to 10) - this means solr merges every 10 segments into one.
However, I need something like this - 1. Keep each of last 24 hours segments separate. 2. Segments generated between last 48 to 24 hours to be merged into one. Similarly, for segments created between 72 to 48 hours and so on for last 1 week. 3. Similarly, merge previous 4 week's data into one segment each week. 4. Merge all previous months data into one segment each month. I am not sure if there is a configuration possible in solr application. If not, are there APIs which will allow me to do this? Also, I want to understand how solr stores data or does it have a dependency on the way data is stored. Since the volumes are high, it would be great if the data is compressed and stored (while still searchable). If it is possible, I would like to know what kind of compression does solr do? Thank you for the responses. Regards, Vinay