Author: Lindsey Simon
Email: [EMAIL PROTECTED]
Message:

I cannot get the HrefOnly directive in my configuration file to
work for index pages. I have this line:

HrefOnly \/$

Shouldn't that prevent URL's ending in / from being indexed?

But I'm still getting search results like:
        * http://www.austinchronicle.com/issues/dispatch/columns/


Additionally, I noticed that when I added the line:
FollowOutside no (which I know is the default)

I get:
/usr/local/encap/mnogosearch-3.1.19/etc/indexer.conf' line 18:
Unknown follow type: Outside



****************************
My indexer.conf in entirety:

# Yeah, this rulez
DBMode crc-multi

Robots no

LocalCharset iso-8859-1
CharSet iso-8859-1

# Ispell   
#Affix en en.aff
#Spell en en.dict

#DeleteNoServer yes

# WEIGHTS
BodyWeight 1
# Weight of the words in the <META NAME="Description"
Content="...">
DescWeight 3
# Weight of the words in the <META NAME="Keywords" Content="...">
KeywordWeight 5

# Our site
AuthBasic guest:guest
Server  http://www.austinchronicle.com/issues/
Alias http://www.austinchronicle.com/issues/ http://daisy/issues/

#UseRemoteContentType no
#AddType text/plain *

# Do not index the index pages
HrefOnly \/$
HrefOnly \/index\.html$ \.index\.html$ \_index\.html$
\/*_index\.html$
HrefOnly \/.*index\.html$
HrefOnly \/1999\.html$ \/2000\.html$ \/1998\.html$ \/1997\.html$
\/1996\.html$
HrefOnly \/1995\.html$ \/2001\.html$ \/2002\.html$ \/film.html$
\/adverts\.html$
HrefOnly \/arts\.listings\/ \/music\.clubs\.html$
\/screens\.film\.html$
HrefOnly \/Film\d\d\.html$ \/screens\.filmtimes\.html$
\/showtimes\.html$
HrefOnly \/clubs\.html \/music\.clubs\/ \/music\.roadshows\.html$


# Get rid of junk 
Disallow \.bak$ /nav/ /temp/ /current/ /deep_focus/ /not_current/
Disallow /authors/ /calendar/ /filmvault/ /tmp/
Disallow /weeklywire\.com/ /bin/ /etc/ /lib/ /musicreg/\d
Disallow /cgi-bin/ /cgi/ /images/ /temp/

# Allow some known extensions and directory index
Allow \.html$ \.htm$ \.php$

# Disallow everything else
Disallow .*


Reply: <http://www.mnogosearch.org/board/message.php?id=4371>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to