analog-help  

Re: [analog-help] complete list of accessed files?

Christoph Kukulies
Wed, 12 Aug 2009 11:46:57 -0700

Aengus schrieb:
On 8/12/2009 10:47 AM, Christoph Kukulies wrote:

I introduced this setting (REQFLOOR 1r) now in my config file, I also included
FILEINCLUDE *.pdf

Still I don't see a single .pdf in the list of requested files (last listing in the report). I have about 3000 .pdf file requests in the original apache access_log file.

I assume the file extension (FILEINCLUDE syntax) isn't case sensitive.

Actually, if Analog is running on a case sensitive OS, then the FILEINCLUDE probably is case sensitive.

http://analog.cx/docs/alias.html#CASE

Can you post 3 or 4 lines from your log file, including some PDF requests?
87.79.34.253 - - [11/Aug/2009:17:58:38 +0200] "GET /export/download/de/AB-lang/AB-3-5-7.pdf HTTP/1.1" 200 158955 "http://www.mysite.de/de/produkte/AB-lang//index.htm"; "Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9.1.2) Gecko/20090729 Firefox/3.5.2 (.NET CLR 3.5.30729)"

217.91.80.223 - - [12/Aug/2009:09:31:33 +0200] "GET /export/download/de/AB-lang/AB-Plan.pdf HTTP/1.1" 200 212734 "http://www.mysite.de/de/produkte/AB-lang/AB-Plan.htm"; "Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9.0.13) Gecko/2009073022 Firefox/3.0.13"

159.51.236.51 - - [12/Aug/2009:14:21:36 +0200] "GET /export/download/de/AB-lang/XYZPP.pdf HTTP/1.1" 200 108759 "http://www.mysite.de/de/produkte/AB-lang/ABACC-XYZYPP.htm"; "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.0.3705; .NET CLR 1.1.4322; .NET CLR 2.0.50727)"

85.3.36.7 - - [12/Aug/2009:15:36:27 +0200] "GET /export/download/de/AB-lang/AB-Plan.pdf HTTP/1.1" 206 208857 "-" "Mozilla/5.0 (Windows; U; Windows NT 6.0; de; rv:1.9.0.13) Gecko/2009073022 Firefox/3.0.13 (.NET CLR 3.5.30729)"

The first three are referred from our own pages (mysite.de).
I found that most of the requests with return code 200 are bots, crawlers, Yandex, msnbot, google search results. I will try the suggested page-include and see what happens when I include SEARCHENGINES again.

I hope the syntax is correct since I must admit that I had to reconstruct (paste together) the main log file from different files (referrer.log, agents.log) because there was a break in the logformat during time.

--
Christoph Kukulies


Aengus
+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------

+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------