On 8/17/2020 12:22 PM, Abhijit Pawar wrote:
We are indexing some 200K plus documents in SOLR 5.4.1 with no shards /
replicas and just single core.
It takes almost 3.5 hours to index that data.
I am using a data import handler to import data from the mongo database.

Is there something we can do to reduce the time taken to index?
Will upgrade to newer version help?

There's not enough information here to provide a diagnosis.

Are you running Solr in cloud mode (with zookeeper)?

3.5 hours for 200000 documents sounds like slowness with the data source, not a problem with Solr, but it's too soon to rule anything out.

Would you be able to write a program that pulls data from your mongo database but doesn't send it to Solr?  Ideally it would be a Java program using the same JDBC driver you're using with DIH.

Thanks,
Shawn

Reply via email to