These test files aren't always in the same location (I never said 
consistency was my strong point!) and anyway, users can always override 
entries in the central analog.cfg by specifying their own .cfg file. 
(I haven't tried to put that FILEEXCLUDE in the mandatory.cfg file, 
though).

The reason I gave for not spidering log reports may not be a very good 
one, though. Old Analog reports are often left on line, for a variety of 
reasons, and so they provide links to documents that no longer exist. I 
end up with 404 errors that aren't due to dead links in actual content 
pages. Our search engine can also show us what page links to any given 
page, but that information is a bit less meaningful if it includes the 
12 archived Monthly log file reports.

I'd prefer to not index Analog reports. I can deal with the problem 
fairly well by specifying my Analog report directories in the ROBOTS.TXT 
file. But it would be handy to have Analog generate the exclusion 
commands for me.

Aengus



______________________________ Reply Separator _________________________________
Subject: Re: [analog-help] ROBOTS Meta tag
Author:  [EMAIL PROTECTED] at Internet
Date:    3/26/99 3:57 PM


Why can't you just add a

FILEEXCLUDE /test/*

to your analog.cfg file as explained in docs/include.html

Regards,

Roger Brown


On Fri, 26 Mar 1999, Aengus Lawlor wrote:

> I'm forever creating "test" web pages that aren't linked to, so that our 
> Intranet spider doesn't find them and index them. (Security through 
> obscurity). But recently I found that some of them were in our search 
> engine, and it turned out that an Analog report had been posted and 
> indexed, and the spider followed the links in the request report to find 
> my "test" pages. 
> 
> While I immediately added the directory that the Analog report was in to 
> the servers ROBOTS.TXT file, it would be handy if Analog could 
> automatically add the Robot Exclusion meta tag to its output. (Now that 
> I've made anlgform.exe available on the server, I can't always keep 
> track where an Analog report will be posted. (And I don't have the tools 
> to modify and recompile the code with the addition).
> 
> The Robots exclusion tag looks like this: 
> 
> <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> 
> 
> It's explained in greater detail at: 
> http://info.webcrawler.com/mak/projects/robots/exclusion.html#meta 
> 
> Aengus
> 
> -------------------------------------------------------------------- 
> This is the analog-help mailing list. To unsubscribe from this
> mailing list, send mail to [EMAIL PROTECTED] 
> with "unsubscribe analog-help" in the main BODY OF THE MESSAGE.
> -------------------------------------------------------------------- 
> 

-------------------------------------------------------------------- 
This is the analog-help mailing list. To unsubscribe from this 
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe analog-help" in the main BODY OF THE MESSAGE. 
--------------------------------------------------------------------
Received: from nmho05u.rohmhaas.com ([136.141.252.23]) by ima1.rohmhaas.com with
SMTP
  (IMA Internet Exchange 3.11) id 0017F4B4; Fri, 26 Mar 1999 18:06:38 -0500
Received: by nmho05u.rohmhaas.com; id SAA04755; Fri, 26 Mar 1999 18:08:32 -0500
(EST)
Received: from mb3.mailbank.com(209.133.104.8) by nmho05u.rohmhaas.com via smap
(3.2)
        id xma004749; Fri, 26 Mar 99 18:08:15 -0500
Received: from gateway1.isite.net (gateway1.isite.net [198.207.204.66])
        by mb3.mailbank.com (8.9.1a/8.9.1) with ESMTP id PAA08100
        for <[EMAIL PROTECTED]>; Fri, 26 Mar 1999 15:06:14 -0800
Received: from proxy1.noc.isite.net ([172.16.1.11] (may be forged)) by
gateway1.isite.net (8.8.6/8.8) with ESMTP id OAA25563; Fri, 26 Mar 1999 14:58:24
-0800 (PST)
Received: from mail2.noc.isite.net (mail2.noc.isite.net [172.16.1.22]) by
proxy1.noc.isite.net (8.8.6/8.8) with ESMTP id OAA04189; Fri, 26 Mar 1999
14:58:23 -0800 (PST)
Received: (from majordom@localhost) by mail2.noc.isite.net (8.8.6/8.8) id
OAA04052 for analog-help-localoutlist; Fri, 26 Mar 1999 14:58:10 -0800 (PST)
Received: from proxy1.noc.isite.net (proxy1.noc.isite.net [172.16.1.11]) by
mail2.noc.isite.net (8.8.6/8.8) with ESMTP id OAA04048 for
<[EMAIL PROTECTED]>; Fri, 26 Mar 1999 14:58:05 -0800 (PST)
Received: from gateway1.isite.net (gateway1.isite.net [198.207.204.66]) by
proxy1.noc.isite.net (8.8.6/8.8) with ESMTP id OAA04185 for
<[EMAIL PROTECTED]>; Fri, 26 Mar 1999 14:58:05 -0800 (PST)
Received: from rogb.iserver.net (rogb.iserver.net [192.41.50.237]) by
gateway1.isite.net (8.8.6/8.8) with ESMTP id OAA25559 for
<[EMAIL PROTECTED]>; Fri, 26 Mar 1999 14:58:04 -0800 (PST)
Received: from localhost (rogb@localhost) by rogb.iserver.net (8.8.5) id
PAA24695; Fri, 26 Mar 1999 15:57:54 -0700 (MST)
Date: Fri, 26 Mar 1999 15:57:53 -0700 (MST)
From: Roger L Brown <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Subject: Re: [analog-help] ROBOTS Meta tag
In-Reply-To: <>
Message-ID: <>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: [EMAIL PROTECTED]
Precedence: bulk
Reply-To: [EMAIL PROTECTED]

Reply via email to