Hi George,
To do work on statistics conversions I looked at Stuart Lewis'
stats-log-converter code, and expanded upon it to make a
stats-apache-log-converter, which converted Apache access logs into SOLR
data. It had to make sense of urls with handles and bitstreams, to which it
does database lookups to find out if it is a community, collection, item,
bitstream, and got the internal id, which is needed by SOLR.
I started tracking this in:
http://jira.dspace.org/jira/browse/DS-522
And it needs to be polished a bit more before it would be ready to commit.
However, it was enough to get the data I needed. So my advice for you would
be to awk your data into something that would be input for what I've done,
or use the code as a start to build something that suits your needs
SOLR also expects IP addresses (for geolocation), and timestamps (which must
be unique).
If the IP is unknown, as is likely in this case, I think DSpace will count
them as "Unknown Country/City"
Peter Dietz
Systems Developer/Engineer
Ohio State University Libraries
On Mon, Jun 28, 2010 at 8:31 PM, George Stanley Kozak <[email protected]>wrote:
> Hi. .
>
> I am experimenting with dspace 1.6.2 in prepartion to moving it into
> production. I have a question about the solr statistics. I did the
> conversion of the dspace logs, and that worked fine. However, I have data
> from earlier this year which I have collected (it is in tab delemited
> files...hits and downloads by handle and bitstream), but it is no longer in
> the format of a dspace log. Is there some way of adding this data into the
> SOLR stats?
>
>
> George Kozak
>
> Digital Library Specialist
>
> Division of Library Information Technologies (DLIT)
>
> 501 Olin Library
>
> Cornell University
>
> Ithaca, NY 14853
>
> 607-255-8924
>
>
> ------------------------------------------------------------------------------
> This SF.net email is sponsored by Sprint
> What will you do first with EVO, the first 4G phone?
> Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>
>
------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech