I'm testing out an indexer.conf file with 3.1.19

in my indexer.conf I have

# Do not index the index pages
HrefOnly \/$ \/index\.html$ \.index\.html$ \_index\.html$
\/*_index\.html$
HrefOnly \/.*index\.html$

and I'm running indexer for about 5 seconds and then querying to see
if indeed I am not indexing index pages, however I get results like:

        * http://www.austinchronicle.com/issues/dispatch/2002-02-01/

        * http://www.austinchronicle.com/issues/dispatch/2002-01-18/

These are what I don't want to index..
any idea?

My entire indexer.conf is:

# Yeah, this rulez
DBMode crc-multi

Robots no

LocalCharset iso-8859-1
CharSet iso-8859-1

# Ispell   
#Affix en en.aff
#Spell en en.dict

#FollowOutside No

ReadTimeOut 99999999 
MaxNetErrors 9999999999999

# MinWordLength 3 

#DeleteNoServer yes

# WEIGHTS
BodyWeight 1 
# Weight of the words in the <META NAME="Description" Content="...">
DescWeight 3
# Weight of the words in the <META NAME="Keywords" Content="...">
KeywordWeight 5 

# Our site
AuthBasic guest:guest
Server  http://www.austinchronicle.com/issues/
Alias http://www.austinchronicle.com/issues/ http://daisy/issues/ 

#UseRemoteContentType no
AddType text/plain *

# Do not index the index pages
HrefOnly \/$ \/index\.html$ \.index\.html$ \_index\.html$
\/*_index\.html$
HrefOnly \/1999\.html$ \/2000\.html$ \/1998\.html$ \/1997\.html$
\/1996\.html$
HrefOnly \/1995\.html$ \/2001\.html$ \/2002\.html$ \/film.html$
\/adverts\.html$
HrefOnly \/arts\.listings\/ \/music\.clubs\.html$
\/screens\.film\.html$
HrefOnly \/Film\d\d\.html$ \/screens\.filmtimes\.html$
\/showtimes\.html$
HrefOnly \/clubs\.html \/music\.clubs\/ \/music\.roadshows\.html$
HrefOnly /$

# Get rid of junk
Disallow \.bak$ /nav/ /temp/ /current/ /deep_focus/ /not_current/
Disallow /authors/ /calendar/ /filmvault/ /tmp/
Disallow /weeklywire\.com/ /bin/ /etc/ /lib/ /musicreg/\d
Disallow /cgi-bin/ /cgi/ /images/ /temp/

# Allow some known extensions and directory index
Allow \.html$ \.htm$

# Disallow everything else
Disallow .*



----- End forwarded message -----
___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to