This is good. I have not had a chance to play around with Elastic Search,
so I will definitely look into that. We actually have Mark's Solr stats
running now. What it lacks‹really, what they all lack‹is a sure-fire way
making only counting real users.

If you are anything like us, then you know most of your traffic is robots.
You obviously don't want them counted in your stats. The blacklist
approach does not work; it doesn't come close, unless you have a full time
person working to keep the list up to date.

The Google Analytics approach is better, but it comes with its own set of
problems. Like, links from Google have to go to the item page, not
directly to the PDF, in order to be counted.

A third idea I've been kicking around is to augment the current system
with ping-back, so only the downloads and page visits that respond to the
ping back are counted. That would probably take the least work to
implement, but it could have issues as well.

This is going a bit long, but let me add that I am surprised that this is
not a bigger concern within the community. Accurate statistics are key to
measuring an article's impact, and the impact of the repository itself. It
definitely influences buy-in: both faculty participation and institutional
support. 

Anyway, thanks for the input. I wonder, does anyone else have any insight
into gathering accurate statistics?

Cheers, 
Bill




On 10/24/12 6:44 PM, "helix84" <[email protected]> wrote:

>Hi all,
>
>if you want to collaborate on this, it would be best to create an
>issue (or multiple issues) in Jira describing what needs to be done
>and who's doing it. It will prevent duplication of work and allow
>everyone to check the status of such work. It's also desirable to link
>to code on GitHub from the issue(s).
>
>You may also want to coordinate with Peter Dietz (should be on this
>list), who has developed another statistics implementation (Elastic
>search) in addition to Solr statistics (Mark Diggory will want to have
>a say in that, too). It may turn out that there's a need for a
>framework supporting different statistics implementations/backends
>(Solr, Elastic, Analytics, ...), while allowing to use the same
>presentation elements (like graphs) with any of them.
>
>Regards,
>~~helix84


------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to