Author: Ivan Mikhnevich
Email: [EMAIL PROTECTED]
Message:
Hello, everybody and especially developers of mnoGoSearch.

I've found a bug in robots.txt standard implementation. The only path, without 
filenames, is compared with Disallow-lines from robots.txt. While the standard at 
http://www.robotstxt.org/wc/norobots.html declares that Disallow line should be 
compared with path with filename.

The following code fixes it:
mnoGoSeach 3.1.12, robots.c (line 29)

UDM_ROBOT * UdmFindRobots(UDM_ENV * Conf,UDM_URL * URL){
        size_t i;
        char path[UDM_URLSIZE];

        strcpy(path, URL->path);
        strcat(path, URL->filename);

        for(i=0;i<Conf->Robots.nrobots;i++){
                
if(!strcmp(URL->hostinfo,Conf->Robots.robots[i].hostinfo)&&!strncmp(path,Conf->Robots.robots[i].path,strlen(Conf->Robots.robots[i].path)))
                        return(&Conf->Robots.robots[i]);
        }
        return(NULL);
}

Regards, Ivan

Reply: <http://search.mnogo.ru/board/message.php?id=2120>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to