Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
As far as I know whildcards like * are not supported by robots.txt
 standard. And they are not implemented in msearch.


> I had alot of problems with my indexer trying to index mailto cgi's, search cgi's 
>and forward/backwards links in documents.
> 
> This is related to discussion forums like UBB and others.
> 
> I got some advice on how to filter certain cgi's with options, but I think I must 
>have done something seriously wrong! 
> Before I modified the indexer.conf, I had 300.000 URLs in the database. Now I've got 
>2500!!!!
> 
> If someone could check out the following section of my indexer.conf - I'd be most 
>thankful:
> 
> Disallow *out.cgi
> Disallow *privatesend.cgi *action*
> Disallow *ubbmisc.cgi *findthread*
> Disallow *search.cgi *simplesearch*
> Disallow *Ultimate.cgi *email*
> Disallow *ultimatebb.cgi *get_ip*
> Disallow *ultimatebb.cgi *reply*
> Disallow *ultimatebb.cgi *send_topic*
> Disallow *ultimatebb.cgi *next_topic*
> Disallow *ultimatebb.cgir *edit_post*
> Disallow *ultimatebb.cgi *close_topic*
> Disallow *ultimatebb.cgi *email*
> Disallow *ultimatebb.cgi *delete_topic*
> #
> Disallow NoMatch String *www.rc-racing.com/*backtalk/pistachio*
> Disallow NoMatch String *www.rc-racing.com/*backtalk/abalone*
> 
> 
> Alot of URL processing are now down the drain. I need to fix the problem and index 
>the whole thing over again :-(.
> 
> I can add that I'm not using robots.txt.... (it seemed to stop alot of relevant URLs 
>getting inexed) but other than that - my config is a copy of the sample.
> Running multi-crc mode.
> 
> Thanks in advance.
> 
> Best regards,
> �rjan Sandland
> Senior Consultant
> Net Technology AS
> 

Reply: <http://search.mnogo.ru/board/message.php?id=2103>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to