I have run the below command on yesterday's log file. grep 'GET /pages/index.html' dumpty.2004.10.25.access.log | wc -l 18048
Without a doubt, the symbolic link is being written into my log file. When
I grep for the actual path in the logs I find only one occurance.
Here are the log entries for a successful request of the symlink:
67.118.164.102 - - [25/Oct/2004:00:27:58 -0400] "GET /pages/index.html
HTTP/1.1" 200 77485 "-" "Mozilla/4.
0 (compatible; MSIE 6.0; Windows NT 5.2)"
221.217.222.116 - - [25/Oct/2004:00:28:01 -0400] "GET /pages/index.html
HTTP/1.1" 200 22805 "-" "Mozilla/4
.0 (compatible; MSIE 6.0; Windows NT 5.1)"
65.54.98.152 - - [25/Oct/2004:00:28:06 -0400] "GET /pages/index.html
HTTP/1.0" 200 77323 "-" "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1; SV1; MSN 9.0; MSNbMSNI; MSNmen-us;
MSNcIA; MPLUS)"
221.217.222.116 - - [25/Oct/2004:00:28:07 -0400] "GET /pages/index.html
HTTP/1.1" 200 77485 "-" "Mozilla/4
.0 (compatible; MSIE 6.0; Windows NT 5.1)"
Here are a few log entries of the old redirected homepage:
221.115.4.156 - - [25/Oct/2004:00:44:25 -0400] "GET /homepage.html
HTTP/1.1" 301 319 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
.NET CLR 1.1.4322)"
68.61.224.142 - - [25/Oct/2004:00:44:27 -0400] "GET /homepage.html
HTTP/1.1" 301 319 "-" "Opera/7.51 (Windows NT 5.1; U) [en]"
221.115.4.156 - - [25/Oct/2004:00:44:35 -0400] "GET /homepage.html
HTTP/1.1" 301 319 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
.NET CLR 1.1.4322)"
68.147.219.153 - - [25/Oct/2004:00:44:36 -0400] "GET /homepage.html
HTTP/1.1" 301 319 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
SV1)"
62.231.244.22 - - [25/Oct/2004:00:44:43 -0400] "GET /homepage.html
HTTP/1.0" 301 307 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)"
I suppose I could try to look at one host and follow its path through my
log files. I'm at a total loss. But I do appreciate all the help.
Steven
Duke Hillard
<[EMAIL PROTECTED]> To: Support for analog web log
analyzer <[EMAIL PROTECTED]>
Sent by: cc:
[EMAIL PROTECTED] Subject: Re: [analog-help] problems
with symlinks
ts.meer.net
10/26/04 05:11 PM
Please respond to duke;
Please respond to
Support for analog web
log analyzer
I think I see the problem. Let me offer an experiment.
I'm using Solaris 8. Redhat 9 should perform similarly.
I go to my Apache document root "/web/docs" and I
create a file "file.html" in my doc root. Then I create
a symbolic link "pointer.html" to "file.html".
Next I visit "http://www.louisiana.edu/file.html". As I
might expect, "/file.html" is listed in my logfile. Next I
visit "http://www.louisiana.edu/pointer.html". Which
should appear in my logfile "file.html" or "pointer.html"?
Since Apache served "file.html", it adds "file.html" to
the log file and not "pointer.html".
Likewise, since "/pages/" is a symlink, it won't appear
in your Apache logfile. What will appear is the actual
path that "/pages/" points to (or symbolizes). There is
no way that "/pages/index.html" will be added to your
logfile because Apache isn't serving that page. Rather,
look for the actual path in your logfile and in the reports
that Analog generates.
HTH,
-- Duke
[EMAIL PROTECTED] wrote:
>Thank you for answering my question. The answer confirms that there is a
>problem with the numbers that analog is calculating. I will be
(hopefully)
>more clear this time as I describe my problem because I am sure that there
>is an answer.
>
>I have an old homepage called /homepage.html. There is a redirect in the
>apache config that sends people to the new homepage which is called
>/pages/index.html. The "pages" part of that link is a symlink on the
>filesystem to another location. For whatever reason, "/pages/index.html"
>is not appearing in the list of most requested pages but I know that it is
>because it is my homepage.
>
>Every 301/302 should result in a 200 if the redirect is correct and at
that
>point analog should count it as a successful request. The redirect is
>correct because it works on the site and therefore if I have 14,000
>redirects from /homepage.html shouldn't I also see /pages/index.html with
>14,000 requests? I see /homepage.html in the list of redirects at the top
>but I do not see /pages/index.html in the list of requests at all.
>
>Any advice is always appreciated.
>
>Steven
>
>
>
>
>
>
> Stephen Turner
> <[EMAIL PROTECTED] To: Support for
analog web log analyzer <[EMAIL PROTECTED]>
> er.net> cc:
> Sent by: Subject: Re:
[analog-help] problems with symlinks
> [EMAIL PROTECTED]
> ts.meer.net
>
>
> 10/26/04 11:26 AM
> Please respond to
> Support for analog web
> log analyzer
>
>
>
>
>
>
>On Tue, 26 Oct 2004 [EMAIL PROTECTED] wrote:
>
>
>>I guess the question that I really need a clear answer to is, if the
>>webserver successfully redirects a user to the same server does that mean
>>that analog will count 1 status 301 and also 1 status 200? When I want
>>
>>
>to
>
>
>>know the total number of page views do I look at the pages views or page
>>views plus redirects?
>>
>>
>>
>
>I'm still unclear about your setup. Your logfile sample had no redirects
>in.
>Are you confusing HTTP redirects from the server with file system
symlinks?
>
>But anyway, it doesn't really matter, because the answer is simple. Analog
>will count whatever's in the logfile. If there is an HTTP redirect, it
will
>count one redirect for the 301/302 and one successful request for the 200.
>
>--
>Stephen Turner, Cambridge, UK
>http://homepage.ntlworld.com/adelie/stephen/
> "Low Priced Cambridge Clare College. Big selection at eBay UK!"
> (Ad after Google search for Clare College Cambridge)
>
>
>
(See attached file: dxh0844.vcf)
+------------------------------------------------------------------------
| TO UNSUBSCRIBE from this list:
| http://lists.meer.net/mailman/listinfo/analog-help
|
| Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
| List archives: http://www.analog.cx/docs/mailing.html#listarchives
+------------------------------------------------------------------------
dxh0844.vcf
Description: Binary data
+------------------------------------------------------------------------ | TO UNSUBSCRIBE from this list: | http://lists.meer.net/mailman/listinfo/analog-help | | Usenet version: news://news.gmane.org/gmane.comp.web.analog.general | List archives: http://www.analog.cx/docs/mailing.html#listarchives +------------------------------------------------------------------------

