Hi Nason,

On 05/06/13 02:18, Nason Bimbe wrote:
Thank you very much for your reply. Yes the stats data was for one month only (May) that's why I did not add the date filter.

I have run
[dspace]/bin/dspace stats-util -u
[dspace]/bin/dspace stats-util -i

Then the SOLR query and nothing has changed.

Did you add &fq=-isBot:true to your query? You need to add this, otherwise "bot" hits will still be included in your query results even after the two commands above.

If I may ask do you now the query used by the 'View Usage Statistics' link in XMLUI because the stats displayed look more reasonable, in other words can I replicate the query by querying SOLR directly?

I've customised the stats for "my" repository so much that I have no idea what's normally on that page. From the demo server, it looks like this lists the 10 items with the most downloads. To get those, you'd use:
http://localhost:8080/solr/statistics/select?indent=true&rows=0&q=type:0&facet=true&facet.limit=10&facet.field=owningItem&fq=-isBot:true

The English version of this query is something like
  • Show me the 10 most frequent (facet=true&facet.limit=10)
  • items (facet.field=owningItem)
  • where "frequent" means going by bitstream downloads (q=type:0)
  • but I don't want to see any individual events in additio(rows=0)
  • and exclude all hits by bots (fq=-isBot:true) -- this counts only events that don't have an isBot value at all or where the isBot value is false
The "indent=true" just makes the output easier to read.

Would you also be in a position to explain the possible values for type in the query for me please? For example I have seen posts describing number of downloads as being of type = 0 in SOLR.

The type refers to the constants in Constants.java (https://github.com/DSpace/DSpace/blob/master/dspace-api/src/main/java/org/dspace/core/Constants.java). Probably the most important ones are 0 (bitstream downloads) and 2 (item page views). The id is always the internal id of the object concerned; for bitstreams, owningItem will hold the id of the bitstream's item. All other fields for view/download events look pretty self-explanatory to me; please ask if you have specific questions.

There is documentation about all fields here: https://wiki.duraspace.org/display/DSDOC3x/DSpace+Statistics
In particular, this section explains what fields are available for querying / faceting: https://wiki.duraspace.org/display/DSDOC3x/DSpace+Statistics#DSpaceStatistics-Whatisexactlybeinglogged?

And this section explains a bit about custom queries: https://wiki.duraspace.org/display/DSDOC3x/DSpace+Statistics#DSpaceStatistics-CustomReporting-QueryingSOLRDirectly though much of figuring this out is just getting your head around Solr. Again, ask if you have specific questions.

cheers,
Andrea

-- 
Dr Andrea Schweer
IRR Technical Specialist, ITS Information Systems
The University of Waikato, Hamilton, New Zealand


------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. A cloud service to automate IT design, transition and operations
2. Dashboards that offer high-level views of enterprise services
3. A single system of record for all IT processes
http://p.sf.net/sfu/servicenow-d2d-j
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to