> Im having trouble with the server_aliases however... ive tried a number of
> different combinations from searching the mailling list archive and reading the
>online help, but cant seem to get it right....
>
> Our site has 16 aliases (8 really, but both WWW and no WWW refer to the same
>site)... all of which point to the same site
>
> The problem is when i search for say the world 'help' it'll retrieve 5 or 6
>dupilicates the only thing different being the URL pointing to this page.
> This leads me to believe alot of duplication might be going on, and the database is
>larger then it needs to be.. (not to mention the duplicate results returned to the
>user)
Yup. htdig can't tell the difference between a server alias and a
virtual host. I had problems with this myself.
> allow_virtual_hosts: false
If you have virtual hosts, remove the above line.
> server_aliases: www.internettrash.com=internettrash.com \
I wrote a small script to generate a list of server aliases, that are
_not_ virtual hosts. My config includes:
server_aliases: `${config_dir}/tum.server_aliases`
and tum.server_aliases is generated by the shell script in
http://tum-index.www.ze.tu-muenchen.de/mapnames.pl
Format of tum.server_aliases:
Kinglui.fsmb.mw.tu-muenchen.de:80=www.fsmb.mw.tu-muenchen.de:80
Nathan.prakt.physik.tu-muenchen.de:80=www.prakt.physik.tu-muenchen.de:80
a12.cip.bauwesen.tu-muenchen.de:80=www.geodi.verm.tu-muenchen.de:80
[...]
-Walter
--
Walter Hafner__________________________________ [EMAIL PROTECTED]
<A href=http://www.tum.de/~hafner/>*CLICK*</A>
"Multiple exclamation marks," he went on, shaking his head,
"are a sure sign of a diseased mind." (Terry Pratchett, "Eric")
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.