Hi George,

To do work on statistics conversions I looked at Stuart Lewis'
stats-log-converter code, and expanded upon it to make a
stats-apache-log-converter, which converted Apache access logs into SOLR
data. It had to make sense of urls with handles and bitstreams, to which it
does database lookups to find out if it is a community, collection, item,
bitstream, and got the internal id, which is needed by SOLR.

I started tracking this in:
http://jira.dspace.org/jira/browse/DS-522

And it needs to be polished a bit more before it would be ready to commit.
However, it was enough to get the data I needed. So my advice for you would
be to awk your data into something that would be input for what I've done,
or use the code as a start to build something that suits your needs

SOLR also expects IP addresses (for geolocation), and timestamps (which must
be unique).

If the IP is unknown, as is likely in this case, I think DSpace will count
them as "Unknown Country/City"


Peter Dietz
Systems Developer/Engineer
Ohio State University Libraries



On Mon, Jun 28, 2010 at 8:31 PM, George Stanley Kozak <[email protected]>wrote:

>  Hi. .
>
> I am experimenting with dspace 1.6.2 in prepartion to moving it into
> production.  I have a question about the solr statistics.  I did the
> conversion of  the dspace logs, and that worked fine.  However, I have data
> from earlier this year which I have collected (it is in tab delemited
> files...hits and downloads by handle and bitstream), but it is no longer in
> the format of a dspace log.  Is there some way of adding this data into the
> SOLR stats?
>
>
> George Kozak
>
> Digital Library Specialist
>
> Division of Library Information Technologies (DLIT)
>
> 501 Olin Library
>
> Cornell University
>
> Ithaca, NY 14853
>
> 607-255-8924
>
>
> ------------------------------------------------------------------------------
> This SF.net email is sponsored by Sprint
> What will you do first with EVO, the first 4G phone?
> Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>
>
------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to