Dear list,

Our yearly Solr statistics sharding (stats-util -s) failed this year
because our core is very large (43GiB) and apparently timed out somewhere.
It failed again when I tried to run it manually:

Moving: 51633080 into core statistics-2019
...
Exception: Read timed out
java.net.SocketTimeoutException: Read timed out

As a test I used this really great tool called solr-import-export-json to
export some of my 2019 statistics to JSON on the production server, then
import them into a new core in my development instance:

$ ./run.sh -s http://localhost:8081/solr/statistics -a export -o
/tmp/statistics-2019-01.json -f 'dateYearMonth:2019-01' -k uid
$ curl '
http://localhost:8080/solr/admin/cores?action=CREATE&name=statistics-2019&instanceDir=/home/aorth/dspace/solr/statistics&dataDir=/home/aorth/dspace/solr/statistics-2019/data
'
$ ./run.sh -s http://localhost:8080/solr/statistics-2019 -a import -o
/tmp/statistics-2019-01.json -k uid

This worked brilliantly... in fact I am very impressed with this tool and
recommend it to people!

The problem is, this core does not get enumerated automatically by Solr
after I restart the servlet container. I got it to load by hard-coding the
core into dspace/solr/solr.xml config² but it seems hacky. How are these
core shards enumerated by DSpace's Solr application? What would cause
shards to not be loaded automatically?

My environment is DSpace 5.8 with Tomcat 7.0.99 and OpenJDK 8.

Thanks,

¹ https://github.com/freedev/solr-import-export-json
²
https://cwiki.apache.org/confluence/display/solr/Solr.xml%20(supported%20through%204.x)
-- 
Alan Orth
[email protected]
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch

-- 
All messages to this mailing list should adhere to the DuraSpace Code of 
Conduct: https://duraspace.org/about/policies/code-of-conduct/
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/CAKKdN4Uph5t5XxjE%3Dvygx6Z5hZhyrUU4%2Bgj0rM2Uz%3D1Jc%2BrONQ%40mail.gmail.com.

Reply via email to