Author: Michael Huber
Email: [EMAIL PROTECTED]
Message:
Whenever I run the indexer on our intranet site is never stops, after several minutes 
we kill the process.  The details of our indexer.conf are below.  I ran indexer -v 5 
./indexer.conf and I see the same URL's being repeated over and over again.  I would 
expect to see the images used in the menu to appear repeatedly but, the indexer seems 
to constantly recurse over our site.

# indexer.conf for intranet.solipsys.com

DBAddr          pgsql://mjhuber@localhost/udm-intranet.solipsys.com/
DBMode          single

Server http://intranet.solipsys.com/

Allow .*

# Exclude some known extensions using fast "String" match:
Disallow *.b    *.sh   *.md5  *.rpm
Disallow *.arj  *.tar  *.zip  *.tgz  *.gz   *.z     *.bz2 
Disallow *.lha  *.lzh  *.rar  *.zoo  *.ha   *.tar.Z
Disallow *.gif  *.jpg  *.jpeg *.bmp  *.tiff *.tif   *.xpm  *.xbm *.pcx
Disallow *.vdo  *.mpeg *.mpe  *.mpg  *.avi  *.movie *.mov  *.dat
Disallow *.mid  *.mp3  *.rm   *.ram  *.wav  *.aiff  *.ra
Disallow *.vrml *.wrl  *.png
Disallow *.exe  *.com  *.cab  *.dll  *.bin  *.class *.ex_
Disallow *.tex  *.texi *.xls  *.doc  *.texinfo
Disallow *.rtf  *.pdf  *.cdf  *.ps
Disallow *.ai   *.eps  *.ppt  *.hqx
Disallow *.cpt  *.bms  *.oda  *.tcl
Disallow *.o    *.a    *.la   *.so 
Disallow *.pat  *.pm   *.m4   *.am   *.css
Disallow *.map  *.aif  *.sit  *.sea
Disallow *.m3u  *.qt   *.mov

Disallow Regex ~

# Exclude Apache directory list in different sort order using "string" match:
Disallow *D=A *D=D *M=A *M=D *N=A *N=D *S=A *S=D

# More complicated case. RAR .r00-.r99, ARJ a00-a99 files 
# and unix shared libraries. We use "Regex" match type here:
Disallow Regex \.r[0-9][0-9]$ \.a[0-9][0-9]$ \.so\.[0-9]$

AddType text/plain      *.txt  *.pl *.js *.h *.c *.pm *.e
AddType text/html       *.html *.htm
AddType image/x-xpixmap *.xpm
AddType image/x-xbitmap *.xbm
AddType image/gif       *.gif

AddType application/unknown *.*

# end indexer.conf

Reply: <http://www.mnogosearch.org/board/message.php?id=3840>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to