Andrew,

@mire has been working to contribute an addon for DSpace that will
support utilizing Solr as a store to manage usage events on the
repository (this addon forms the backend for the @mire Statistics
Suite), we have places a basic implementation into the dspace
repository and are currently working to complete the last steps in
wiring it to the UsageEvent (actually this has grown into bringing
portions of the DSpace 2.0 Event API into DSpace 1.6 for use in
attaching multiple EventListeners to receive the usage events
generated by the existing implementation.

The implementation is currently organized into two projects located here:
https://scm.dspace.org/svn/repo/modules/dspace-solr/
https://scm.dspace.org/svn/repo/modules/dspace-solr-stats/

It utilizes posting usage events to Solr via a multithreaded HTTP
client that will process/queue multiple requests asynchronously into
the Solr instance.  It is my opinion that this solution is effectively
separate enough from DSpace that it may eventually be utilized for
other systems.

I think that ultimately the scalability issues of dealing with usage
event storage of past years (or if experiencing high volume of
activity, past months).  This still needs to be addressed no matter
what the final solution in this area.  I suspect this will mean
rolling up old events into aggregated statistical "views/reports"
rather than retaining those individual events.  I seem to recall that
Awstats was capable of something along these lines, allowing it to
maintain a relatively fixed database size regardless of how many years
for data were stored within it.

We would certainly be very interested in contribution by other
community members in this area.

Cheers,
Mark

-- 
Mark R. Diggory
@mire - http://www.atmire.com

------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to