Author: Michael Huber Email: [EMAIL PROTECTED] Message: Whenever I run the indexer on our intranet site is never stops, after several minutes we kill the process. The details of our indexer.conf are below. I ran indexer -v 5 ./indexer.conf and I see the same URL's being repeated over and over again. I would expect to see the images used in the menu to appear repeatedly but, the indexer seems to constantly recurse over our site.
# indexer.conf for intranet.solipsys.com DBAddr pgsql://mjhuber@localhost/udm-intranet.solipsys.com/ DBMode single Server http://intranet.solipsys.com/ Allow .* # Exclude some known extensions using fast "String" match: Disallow *.b *.sh *.md5 *.rpm Disallow *.arj *.tar *.zip *.tgz *.gz *.z *.bz2 Disallow *.lha *.lzh *.rar *.zoo *.ha *.tar.Z Disallow *.gif *.jpg *.jpeg *.bmp *.tiff *.tif *.xpm *.xbm *.pcx Disallow *.vdo *.mpeg *.mpe *.mpg *.avi *.movie *.mov *.dat Disallow *.mid *.mp3 *.rm *.ram *.wav *.aiff *.ra Disallow *.vrml *.wrl *.png Disallow *.exe *.com *.cab *.dll *.bin *.class *.ex_ Disallow *.tex *.texi *.xls *.doc *.texinfo Disallow *.rtf *.pdf *.cdf *.ps Disallow *.ai *.eps *.ppt *.hqx Disallow *.cpt *.bms *.oda *.tcl Disallow *.o *.a *.la *.so Disallow *.pat *.pm *.m4 *.am *.css Disallow *.map *.aif *.sit *.sea Disallow *.m3u *.qt *.mov Disallow Regex ~ # Exclude Apache directory list in different sort order using "string" match: Disallow *D=A *D=D *M=A *M=D *N=A *N=D *S=A *S=D # More complicated case. RAR .r00-.r99, ARJ a00-a99 files # and unix shared libraries. We use "Regex" match type here: Disallow Regex \.r[0-9][0-9]$ \.a[0-9][0-9]$ \.so\.[0-9]$ AddType text/plain *.txt *.pl *.js *.h *.c *.pm *.e AddType text/html *.html *.htm AddType image/x-xpixmap *.xpm AddType image/x-xbitmap *.xbm AddType image/gif *.gif AddType application/unknown *.* # end indexer.conf Reply: <http://www.mnogosearch.org/board/message.php?id=3840> ___________________________________________ If you want to unsubscribe send "unsubscribe general" to [EMAIL PROTECTED]
