Hi all, We are migrating our old dspace to 5.x version, and we have a trouble with the statistics. In DSpace 1.6 we had developed our own system to log and save the visits statistics (process logs + save into MySQL) because one of the requirements of the client was to personalize completely the statistics page (Downloads and visits by university, top10 of countries, statistics by year since 2001, top25 of thesis) and it was easier to adapt with MySQL than Solr. In this version we decided to integrate with Solr tools available with DSpace to avoid to use this own product, but now I'm thinking that was a bad idea. We are having performance problem and we tried many things to improve it (bigger cache for filter and queryResults, separate Solr into another VM with 4CPU and 8GB RAM) but it's not enough because some pages require between 10 and 15 seconds to charge fully (the amount of statistics index now, without all of the years charged, is 10GB of indexes and 58.054.466 documents). The last test that we are thinking is mixing sharding by years with a SolrCloud solution, separating year cores into 2 or more VM, but we are not sure how this can be integrated with DSpace. Someone is working with an architecture like this? It's possible to integrate SolrCloud with DSpace 5.2? Do you think this will improve the performance?
Thanks for your help -- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/dspace-tech. For more options, visit https://groups.google.com/d/optout.
