I checked out the squid log analyzer programs, But haven't found one that can provide a sample output like what I need to see on the report.
Say for example I go to microsoft.com, click on "products", then click on "visual studio .NET" I'd like to see this in the logfile: http://www.microsoft.com http://www.microsoft.com/products http://www.microsoft.com/products/visual_studio This is a theoretical example as if those are the actual URL locations typed into the address bar, or clicked via hyperlink. I don't see how the access.log can be used to provide this kind of report. For example, if I simply type microsoft.com in my address bar and click on "office" in the left pane, then check my access.log, I see 35 entries have been added just by clicking the "office" link once. I understand that there is a separate entry for each HTTP GET that the webpage calls for, but the access.log doesn't seem to differentiate between what the user clicked, and what the webpage requested to display the whole page correctly. More specifically, the first 3 entries say: 127.0.0.1 - - [22/Jan/2005:15:56:31 -0500] "GET http://g.microsoft.com/mh_mshp/2 HTTP/1.1" 301 538 TCP_MISS:DIRECT 127.0.0.1 - - [22/Jan/2005:15:56:32 -0500] "GET http://office.microsoft.com/home/default.aspx HTTP/1.1" 301 467 TCP_MISS:DIRECT 127.0.0.1 - - [22/Jan/2005:15:56:32 -0500] "GET http://office.microsoft.com/en-us/default.aspx HTTP/1.1" 200 52134 TCP_MISS:DIRECT How is ANY logfile analyzer going to tell the difference between the first entry (which the user clicked on) and the second/third entries (which were requested by the html from the first entry)? Is there is a squid configuration parameter that will allow the logs to be filtered appropriately? __________________________________ Do you Yahoo!? Meet the all-new My Yahoo! - Try it today! http://my.yahoo.com
