> Im having trouble with the server_aliases however... ive tried a number of
 > different combinations from searching the mailling list archive and reading the  
 >online help, but cant seem to get it right....
 > 
 > Our site has 16 aliases (8 really, but both WWW and no WWW refer to the same 
 >site)... all of which point to the same site
 > 
 > The problem is when i search for say the world 'help'  it'll retrieve 5 or 6 
 >dupilicates the only thing different being the URL pointing to this page.
 > This leads me to believe alot of duplication might be going on, and the database is 
 >larger then it needs to be.. (not to mention the duplicate results returned to the 
 >user)

Yup. htdig can't tell the difference between a server alias and a
virtual host. I had problems with this myself.

 > allow_virtual_hosts: false    

If you have virtual hosts, remove the above line.

 > server_aliases: www.internettrash.com=internettrash.com \            

I wrote a small script to generate a list of server aliases, that are
_not_ virtual hosts. My config includes:

server_aliases: `${config_dir}/tum.server_aliases`

and tum.server_aliases is generated by the shell script in

http://tum-index.www.ze.tu-muenchen.de/mapnames.pl

Format of tum.server_aliases:

Kinglui.fsmb.mw.tu-muenchen.de:80=www.fsmb.mw.tu-muenchen.de:80
Nathan.prakt.physik.tu-muenchen.de:80=www.prakt.physik.tu-muenchen.de:80
a12.cip.bauwesen.tu-muenchen.de:80=www.geodi.verm.tu-muenchen.de:80
[...]

-Walter

-- 
Walter Hafner__________________________________ [EMAIL PROTECTED]
         <A href=http://www.tum.de/~hafner/>*CLICK*</A>
  "Multiple exclamation marks," he went on, shaking his head,
"are a sure sign of a diseased mind."  (Terry Pratchett, "Eric")
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to