I am trying to index our mailing list archives, starting at https://mail.oeone.com/
I compiled mnogosearch with the --with-openssl option. Indexer doesn't follow any links from https://mail.oeone.com/mailman/listinfo/ . An indexer run follows: [root@rome doc]# /usr/local/mnogosearch/sbin/indexer -a -v 4 Indexer[23861]: indexer from mnogosearch-3.1.19/MySQL started with '/usr/local/mnogosearch/etc/indexer.conf' Indexer[23861]: [1] https://mail.oeone.com/ Indexer[23861]: [1] HTTP/1.1 200 OK text/html 351 Indexer[23861]: [1] https://mail.oeone.com/images/logo.jpg Indexer[23861]: [1] HTTP/1.1 200 OK image/jpeg 9328 Indexer[23861]: [1] https://mail.oeone.com/mailman/listinfo/ Indexer[23861]: [1] HTTP/1.1 200 OK text/html 0 Indexer[23861]: [1] https://mail.oeone.com/webmail/ Indexer[23861]: [1] HTTP/1.1 302 Found text/html 0 Indexer[23861]: [1] https://mail.oeone.com/webmail/src/login.php Indexer[23861]: [1] HTTP/1.1 200 OK text/html 1454 Indexer[23861]: [1] https://mail.oeone.com/webmail/images/sm_logo.png Indexer[23861]: [1] HTTP/1.1 200 OK image/png 7396 Indexer[23861]: [1] Done (1 seconds) I have attached my indexer.conf file with comments stripped and password removed. Can anyone see anything obviously wrong? If anyone wants to try, the site is publically accessible but /pipermail/ requires authentication. You should still be able to index the links from /mailman/listinfo though. Any suggestions would be appreciated. Thanks, Jody -- Jody McIntyre, [EMAIL PROTECTED] - OEone Corporation, Hull, Quebec, Canada NMIC NORAD Lexis-Nexis SUSLO SALDV Clandestine CALCM Reno 5ESS Whitehouse CBNRC ASO SETA BITNET terrorist BLU-97 A/B USCG CDC USCODE MD4 S/Key DREC
DBAddr mysql://mnogo:removed@localhost/mnogosearch/ DBMode crc-multi Phrase yes CrossWords yes Allow * Disallow *.b *.sh *.md5 *.rpm Disallow *.arj *.tar *.zip *.tgz *.gz *.z *.bz2 Disallow *.lha *.lzh *.rar *.zoo *.ha *.tar.Z Disallow *.gif *.jpg *.jpeg *.bmp *.tiff *.tif *.xpm *.xbm *.pcx Disallow *.vdo *.mpeg *.mpe *.mpg *.avi *.movie *.mov *.dat Disallow *.mid *.mp3 *.rm *.ram *.wav *.aiff *.ra Disallow *.vrml *.wrl *.png Disallow *.exe *.com *.cab *.dll *.bin *.class *.ex_ Disallow *.tex *.texi *.xls *.doc *.texinfo Disallow *.rtf *.pdf *.cdf *.ps Disallow *.ai *.eps *.ppt *.hqx Disallow *.cpt *.bms *.oda *.tcl Disallow *.o *.a *.la *.so Disallow *.pat *.pm *.m4 *.am *.css Disallow *.map *.aif *.sit *.sea Disallow *.m3u *.qt *.mov Disallow *D=A *D=D *M=A *M=D *N=A *N=D *S=A *S=D Disallow Regex \.r[0-9][0-9]$ \.a[0-9][0-9]$ \.so\.[0-9]$ AddType text/plain *.txt *.pl *.js *.h *.c *.pm *.e AddType text/html *.html *.htm AddType image/x-xpixmap *.xpm AddType image/x-xbitmap *.xbm AddType image/gif *.gif AddType application/unknown *.* Period 1d MaxHops 256 Follow site Server https://mail.oeone.com/
