Hi all,

We are migrating our old dspace to 5.x version, and we have a trouble with 
the statistics. In DSpace 1.6 we had developed our own system to log and 
save the visits statistics (process logs + save into MySQL) because one of 
the requirements of the client was to personalize completely the statistics 
page (Downloads and visits by university, top10 of countries, statistics by 
year since 2001, top25 of thesis) and it was easier to adapt with MySQL 
than Solr. In this version we decided to integrate with Solr tools 
available with DSpace to avoid to use this own product, but now I'm 
thinking that was a bad idea. We are having performance problem and we 
tried many things to improve it (bigger cache for filter and queryResults, 
separate Solr into another VM with 4CPU and 8GB RAM) but it's not enough 
because some pages require between 10 and 15 seconds to charge fully (the 
amount of statistics index now, without all of the years charged, is 10GB 
of indexes and 58.054.466 documents). The last test that we are thinking is 
mixing sharding by years with a SolrCloud solution, separating year cores 
into 2 or more VM, but we are not sure how this can be integrated with 
DSpace. Someone is working with an architecture like this? It's possible to 
integrate SolrCloud with DSpace 5.2? Do you think this will improve the 
performance? 

Thanks for your help

-- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.

Reply via email to