Re: Solr takes time to warm up core with huge data

2020-06-09 Thread Erick Erickson
I’d ignore the form of the query for the present, I think that’s a red herring. Start by taking all your sort clauses off. Then add them back one by one (you have to restart Solr between these experiments). My bet: your problem is “uninverting” and you’ll see your startup speed get worse the

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Srinivas Kashyap
Hi Shawn, It's a vague question and I haven't tried it out yet. Can I instead mention query as below: Basically instead of q=*:*=PARENT_DOC_ID:100=MODIFY_TS:[1970-01-01T00:00:00Z TO *]=PHY_KEY2:"HQ012206"=PHY_KEY1:"BAMBOOROSE"=1000=MODIFY_TS desc,LOGICAL_SECT_NAME asc,TRACK_ID

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Colvin Cowie
Great, thanks Erick On Mon, 8 Jun 2020 at 13:22, Erick Erickson wrote: > It’s _bounded_ buy MaxDoc/8 + (some overhead). The overhead is > both the map overhead and the representation of the query. > > This is an upper bound, the full bitset is not stored if there > are few entries that match

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Erick Erickson
It’s _bounded_ buy MaxDoc/8 + (some overhead). The overhead is both the map overhead and the representation of the query. This is an upper bound, the full bitset is not stored if there are few entries that match the filter, in that case the doc IDs are stored. Consider if maxDoc is 1M and only 2

Re: Solr takes time to warm up core with huge data

2020-06-08 Thread Colvin Cowie
Sorry to hijack this a little bit. Shawn, what's the calculation for the size of the filter cache? Is that 1 bit per document in the core / shard? Thanks On Fri, 5 Jun 2020 at 17:20, Shawn Heisey wrote: > On 6/5/2020 12:17 AM, Srinivas Kashyap wrote: > >

Re: Solr takes time to warm up core with huge data

2020-06-05 Thread Shawn Heisey
On 6/5/2020 12:17 AM, Srinivas Kashyap wrote: q=*:*=PARENT_DOC_ID:100=MODIFY_TS:[1970-01-01T00:00:00Z TO *]=PHY_KEY2:"HQ012206"=PHY_KEY1:"JACK"=1000=MODIFY_TS desc,LOGICAL_SECT_NAME asc,TRACK_ID desc,TRACK_INTER_ID asc,PHY_KEY1 asc,PHY_KEY2 asc,PHY_KEY3 asc,PHY_KEY4 asc,PHY_KEY5 asc,PHY_KEY6

Re: Solr takes time to warm up core with huge data

2020-06-05 Thread Erick Erickson
is consuming JVM memory. So, how much memory or what > configuration should I be doing on solrconfig.xml to make it work. > > Thanks, > Srinivas > > From: Jörn Franke > Sent: 05 June 2020 12:30 > To: solr-user@lucene.apache.org > Subject: Re: Solr takes time to warm up core

RE: Solr takes time to warm up core with huge data

2020-06-05 Thread Srinivas Kashyap
simple query with filter query mentioned as shown above, it is consuming JVM memory. So, how much memory or what configuration should I be doing on solrconfig.xml to make it work. Thanks, Srinivas From: Jörn Franke Sent: 05 June 2020 12:30 To: solr-user@lucene.apache.org Subject: Re: Solr takes time

Re: Solr takes time to warm up core with huge data

2020-06-05 Thread Jörn Franke
I think DIH is the wrong solution for this. If you do an external custom load you will be probably much faster. You have too much JVM memory from my point of view. Reduce it to eight or similar. It seems you are just exporting data so you are better off work the exporting handler. Add

RE: Solr takes time to warm up core with huge data

2020-06-05 Thread Srinivas Kashyap
Thanks Shawn, The filter queries are not complex. Below are the filter queries I’m running for the corresponding schema entry: q=*:*=PARENT_DOC_ID:100=MODIFY_TS:[1970-01-01T00:00:00Z TO *]=PHY_KEY2:"HQ012206"=PHY_KEY1:"JACK"=1000=MODIFY_TS desc,LOGICAL_SECT_NAME asc,TRACK_ID

Re: Solr takes time to warm up core with huge data

2020-06-04 Thread Shawn Heisey
On 6/4/2020 9:51 PM, Srinivas Kashyap wrote: We are on solr 8.4.1 and In standalone server mode. We have a core with 497,767,038 Records indexed. It took around 32Hours to load data through DIH. The disk occupancy is shown below: 82G /var/solr/data//data/index When I restarted solr