On Wed, 14 Apr 1999 [EMAIL PROTECTED] wrote:

> Looking for some help with server_aliases
> 
> Im having trouble with the server_aliases however... ive tried a
> number of different combinations from searching the mailling list
> archive and reading the online help, but cant seem to get it right....
> 
> Our site has 16 aliases (8 really, but both WWW and no WWW refer to
> the same site)... all of which point to the same site
> 
> The problem is when i search for say the world 'help' it'll retrieve 5
> or 6 dupilicates the only thing different being the URL pointing to
> this page. This leads me to believe alot of duplication might be going
> on, and the database is larger then it needs to be.. (not to mention
> the duplicate results returned to the user)
> 
> here is what the relative keywords configuration are set as:
> 
> limit_urls_to:  nettrash.com netjunk.com netgarbage.com nettoilet.com
> #limit_urls_to:  internettrash.com    (also tried just internettrash.com with no 
>luck)
> 
> limit_normalized: http://internettrash.com
>                      
> start_url:              http://internettrash.com/ 
>http://internettrash.com/userlist.html
> allow_virtual_hosts: false
> 
> server_aliases: www.internettrash.com=internettrash.com \            
>                 www.internetgarbage.com=internettrash.com \          
>                     internetgarbage.com=internettrash.com \        
> [...]

You should adding the port number:
server_aliases: www.internettrash.com:80=internettrash.com:80 \
                www.internetgarbage.com:80=internettrash.com:80 \
                    internetgarbage.com:80=internettrash.com:80 \

Hope that helps!

Cheers,
         Leo

-----------------------------------------------------------------------
Alexander (Leo) Bergolth                          [EMAIL PROTECTED]
WU-Wien - Zentrum fuer Informatikdienste       http://leo.wu-wien.ac.at
Info Center
Linux - because reboots are for hardware changes

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to