Hi, Try run indexer with -v 6 option to see, why allowed or disallowed every link.
Jody McIntyre wrote: > I am trying to index our mailing list archives, starting at > https://mail.oeone.com/ > > I compiled mnogosearch with the --with-openssl option. Indexer doesn't follow > any links from https://mail.oeone.com/mailman/listinfo/ . An indexer run > follows: > > [root@rome doc]# /usr/local/mnogosearch/sbin/indexer -a -v 4 > Indexer[23861]: indexer from mnogosearch-3.1.19/MySQL started with >'/usr/local/mnogosearch/etc/indexer.conf' > Indexer[23861]: [1] https://mail.oeone.com/ > Indexer[23861]: [1] HTTP/1.1 200 OK text/html 351 > Indexer[23861]: [1] https://mail.oeone.com/images/logo.jpg > Indexer[23861]: [1] HTTP/1.1 200 OK image/jpeg 9328 > Indexer[23861]: [1] https://mail.oeone.com/mailman/listinfo/ > Indexer[23861]: [1] HTTP/1.1 200 OK text/html 0 > Indexer[23861]: [1] https://mail.oeone.com/webmail/ > Indexer[23861]: [1] HTTP/1.1 302 Found text/html 0 > Indexer[23861]: [1] https://mail.oeone.com/webmail/src/login.php > Indexer[23861]: [1] HTTP/1.1 200 OK text/html 1454 > Indexer[23861]: [1] https://mail.oeone.com/webmail/images/sm_logo.png > Indexer[23861]: [1] HTTP/1.1 200 OK image/png 7396 > Indexer[23861]: [1] Done (1 seconds) > > I have attached my indexer.conf file with comments stripped and password > removed. Can anyone see anything obviously wrong? If anyone wants to > try, the site is publically accessible but /pipermail/ requires authentication. > You should still be able to index the links from /mailman/listinfo though. > > Any suggestions would be appreciated. > > Thanks, > Jody > > > > ------------------------------------------------------------------------ > > DBAddr mysql://mnogo:removed@localhost/mnogosearch/ > DBMode crc-multi > Phrase yes > CrossWords yes > Allow * > Disallow *.b *.sh *.md5 *.rpm > Disallow *.arj *.tar *.zip *.tgz *.gz *.z *.bz2 > Disallow *.lha *.lzh *.rar *.zoo *.ha *.tar.Z > Disallow *.gif *.jpg *.jpeg *.bmp *.tiff *.tif *.xpm *.xbm *.pcx > Disallow *.vdo *.mpeg *.mpe *.mpg *.avi *.movie *.mov *.dat > Disallow *.mid *.mp3 *.rm *.ram *.wav *.aiff *.ra > Disallow *.vrml *.wrl *.png > Disallow *.exe *.com *.cab *.dll *.bin *.class *.ex_ > Disallow *.tex *.texi *.xls *.doc *.texinfo > Disallow *.rtf *.pdf *.cdf *.ps > Disallow *.ai *.eps *.ppt *.hqx > Disallow *.cpt *.bms *.oda *.tcl > Disallow *.o *.a *.la *.so > Disallow *.pat *.pm *.m4 *.am *.css > Disallow *.map *.aif *.sit *.sea > Disallow *.m3u *.qt *.mov > Disallow *D=A *D=D *M=A *M=D *N=A *N=D *S=A *S=D > Disallow Regex \.r[0-9][0-9]$ \.a[0-9][0-9]$ \.so\.[0-9]$ > AddType text/plain *.txt *.pl *.js *.h *.c *.pm *.e > AddType text/html *.html *.htm > AddType image/x-xpixmap *.xpm > AddType image/x-xbitmap *.xbm > AddType image/gif *.gif > AddType application/unknown *.* > Period 1d > MaxHops 256 > Follow site > Server https://mail.oeone.com/ > -- Maxim Zakharov http://sochi.net.ru/~maxime/ Sochi, Russia http://www.sochi.com/ ___________________________________________ If you want to unsubscribe send "unsubscribe general" to [EMAIL PROTECTED]
