You might consider local disks. I once ran Solr with the indexes on an
NFS-mounted volume and the slowdown was severe.
wunder
On Oct 16, 2013, at 7:40 AM, primoz.sk...@policija.si wrote:
I think DIH uses only one core per instance. IMHO 300 doc/sec is quite
good. If you would like to use more cores you need to use solrj. Or maybe
more than one DIH and more cores of course.
Primoz
From: Giovanni Bricconi giovanni.bricc...@banzai.it
To: solr-user solr-user@lucene.apache.org
Date: 16.10.2013 16:25
Subject:howto increase indexing speed?
I have a small solr setup, not even on a physical machine but a vmware
virtual machine with a single cpu that reads data using DIH from a
database. The machine has no phisical disks attached but stores data on a
netapp nas.
Currently this machine indexes 320 documents/sec, not bad but we plan to
double the index and we would like to keep nearly the same.
Doing some basic checks during the indexing I have found with iostat that
the usage of the disks is nearly 8% and the source database is running
fine, instead the virtual cpu is 95% running on solr.
Now I can quite easily add another virtual cpu to the solr box, but as far
as I know this won't help because DIH doesn't work in parallel. Am I
wrong?
What would you do? Rewrite the feeding process quitting dih and using
solrj
to feed data in parallel? Would you instead keep DIH and switch to a
sharded configuration?
Thank you for any hints
Giovanni
--
Walter Underwood
wun...@wunderwood.org