Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Did you disallow /forums/moderator.php in robots.txt BEFORE
first indexing or during it?


> I am running vBulletin, a nice PHP and mySQL based BBS system. It uses a bunch of 
>.php files to access the features of the board.
> 
> I am trying to keep mnoGoSearch from indexing pages like:
> 
>   /forums/moderator.php?s=232093423908
>   /forums/postings.php?s=&action=editthread&threadid=24
> 
> Yet still have it index the actual forum messages, which would be at addresses like:
> 
>   /forums/forumdisplay.php?s=&forumid=20
>   /forums/announcement.php?s=&forumid=20
>   /forums/showthread.php?s=&threadid=24
> 
> I have a robots.txt file set up with names like:
> 
>   /forums/moderator.php
> 
> however these apparently don't match when the URL is compared against the robots 
>table.
> 
> It appears to me from reading the robots.txt specification that I should be able to 
>exclude:
> 
>   /forums/moderator.php?s=340293840
> 
> with the above entry, based on the statement from the spec that:
> 
> "An affirmative comparison is one in which each and every character of a path 
>root exactly matches the corresponding character in the complete path of the absolute 
>url"
> 
> The full URL of /forums/moderator.php?s=12345678 would match the path root of 
>/forums/moderator.php and would be excluded. This does not happen today.
> 
> Is this a bug? A misinterpretation of the robots.txt spec?  Any suggestions on 
>getting around this?
> 
> Thanks,
> 
> -Tim 
> 
> 

Reply: <http://search.mnogo.ru/board/message.php?id=2006>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to