I am trying to index our mailing list archives, starting at
https://mail.oeone.com/

I compiled mnogosearch with the --with-openssl option.  Indexer doesn't follow
any links from https://mail.oeone.com/mailman/listinfo/ .  An indexer run
follows:

[root@rome doc]# /usr/local/mnogosearch/sbin/indexer -a -v 4
Indexer[23861]: indexer from mnogosearch-3.1.19/MySQL started with 
'/usr/local/mnogosearch/etc/indexer.conf'
Indexer[23861]: [1] https://mail.oeone.com/
Indexer[23861]: [1] HTTP/1.1 200 OK text/html 351
Indexer[23861]: [1] https://mail.oeone.com/images/logo.jpg
Indexer[23861]: [1] HTTP/1.1 200 OK image/jpeg 9328
Indexer[23861]: [1] https://mail.oeone.com/mailman/listinfo/
Indexer[23861]: [1] HTTP/1.1 200 OK text/html 0
Indexer[23861]: [1] https://mail.oeone.com/webmail/
Indexer[23861]: [1] HTTP/1.1 302 Found text/html 0
Indexer[23861]: [1] https://mail.oeone.com/webmail/src/login.php
Indexer[23861]: [1] HTTP/1.1 200 OK text/html 1454
Indexer[23861]: [1] https://mail.oeone.com/webmail/images/sm_logo.png
Indexer[23861]: [1] HTTP/1.1 200 OK image/png 7396
Indexer[23861]: [1] Done (1 seconds)

I have attached my indexer.conf file with comments stripped and password
removed.  Can anyone see anything obviously wrong?  If anyone wants to
try, the site is publically accessible but /pipermail/ requires authentication.
You should still be able to index the links from /mailman/listinfo though.

Any suggestions would be appreciated.

Thanks,
Jody

-- 
Jody McIntyre, [EMAIL PROTECTED] - OEone Corporation, Hull, Quebec, Canada

NMIC NORAD Lexis-Nexis SUSLO SALDV Clandestine CALCM Reno 5ESS Whitehouse 
CBNRC ASO SETA BITNET terrorist BLU-97 A/B USCG CDC USCODE MD4 S/Key DREC 
DBAddr          mysql://mnogo:removed@localhost/mnogosearch/
DBMode crc-multi
Phrase yes
CrossWords yes
Allow *
Disallow *.b    *.sh   *.md5  *.rpm
Disallow *.arj  *.tar  *.zip  *.tgz  *.gz   *.z     *.bz2 
Disallow *.lha  *.lzh  *.rar  *.zoo  *.ha   *.tar.Z
Disallow *.gif  *.jpg  *.jpeg *.bmp  *.tiff *.tif   *.xpm  *.xbm *.pcx
Disallow *.vdo  *.mpeg *.mpe  *.mpg  *.avi  *.movie *.mov  *.dat
Disallow *.mid  *.mp3  *.rm   *.ram  *.wav  *.aiff  *.ra
Disallow *.vrml *.wrl  *.png
Disallow *.exe  *.com  *.cab  *.dll  *.bin  *.class *.ex_
Disallow *.tex  *.texi *.xls  *.doc  *.texinfo
Disallow *.rtf  *.pdf  *.cdf  *.ps
Disallow *.ai   *.eps  *.ppt  *.hqx
Disallow *.cpt  *.bms  *.oda  *.tcl
Disallow *.o    *.a    *.la   *.so 
Disallow *.pat  *.pm   *.m4   *.am   *.css
Disallow *.map  *.aif  *.sit  *.sea
Disallow *.m3u  *.qt   *.mov
Disallow *D=A *D=D *M=A *M=D *N=A *N=D *S=A *S=D
Disallow Regex \.r[0-9][0-9]$ \.a[0-9][0-9]$ \.so\.[0-9]$
AddType text/plain      *.txt  *.pl *.js *.h *.c *.pm *.e
AddType text/html       *.html *.htm
AddType image/x-xpixmap *.xpm
AddType image/x-xbitmap *.xbm
AddType image/gif       *.gif
AddType application/unknown *.*
Period 1d
MaxHops 256
Follow site
Server https://mail.oeone.com/

Reply via email to