I am running Dspace 3.1 and I am interested in getting the number of
downloads from SOLR.
I run this
http://localhost:8080/solr/statistics/select?indent=on&version=2.2&start=0&rows=100&fl=*%2Cscore&qt=standard&wt=standard&explainOther=&hl.fl=&facet=true&facet.field=owningComm&q=type:0
and when I look at the results in the facets section, the numbers seems too
high for a month's data. Looking into some of the docs returned I can see
that some are bots yet my config says bots should be omitted.
Can some please help me explain this? I have looked at
http://iplists.com/google.txt and this ip address is not on the list. What
is going on here?
[config snippet]
# control solr statistics querying to filter out spider IPs
# false by default
#query.filter.spiderIp = false
# control solr statistics querying to look at "isBot" field to determine
# if record is a bot. true by default.
#query.filter.isBot = true
[result snippet]
<doc>
<float name="score">3.504466</float>
<arr name="bundleName"><str>ORIGINAL</str></arr>
<str name="city">Mountain View</str>
<str name="continent">NA</str>
<str name="countryCode">US</str>
<str name="dns">crawl-66-249-75-222.googlebot.com.</str>
<int name="id">1673</int>
<str name="ip">66.249.75.222</str>
<bool name="isBot">false</bool>
<float name="latitude">37.419205</float>
<float name="longitude">-122.0574</float>
<arr name="owningColl"><int>12</int></arr>
<arr name="owningComm"><int>9</int><int>2</int><int>2</int></arr>
<arr name="owningItem"><int>829</int></arr>
<str name="statistics_type">view</str>
<date name="time">2013-05-08T17:20:01.295Z</date>
<int name="type">0</int>
<str name="uid">7f7a6135-6c0a-4e2d-b8a3-2c022c2ab20e</str>
<str name="userAgent">Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_1 like
Mac OS X; en-us) AppleWebKit/532.9 (KHTML, like Gecko) Version/4.0.5
Mobile/8B117 Safari/6531.22.7 (compatible; Googlebot-Mobile/2.1; +
http://www.google.com/bot.html)</str>
</doc>
<doc>
<float name="score">3.504466</float>
<arr name="bundleName"><str>ORIGINAL</str></arr>
<str name="city">Moscow</str>
<str name="continent">EU</str>
<str name="countryCode">RU</str>
<str name="dns">spider-178-154-243-100.yandex.com.</str>
<int name="id">1056</int>
<str name="ip">178.154.243.100</str>
<bool name="isBot">false</bool>
<float name="latitude">55.752197</float>
<float name="longitude">37.6156</float>
<arr name="owningColl"><int>10</int></arr>
<arr name="owningComm"><int>1</int></arr>
<arr name="owningItem"><int>175</int></arr>
<str name="statistics_type">view</str>
<date name="time">2013-05-08T16:52:12.106Z</date>
<int name="type">0</int>
<str name="uid">1943f4fe-3ce8-4793-8c3e-3c5e34fa7975</str>
<str name="userAgent">Mozilla/5.0 (compatible; YandexBot/3.0; +
http://yandex.com/bots)</str>
</doc>
Thanks
Nason
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette