According to ROLLE WALRAVEN:
> What follows is the debug (level 1) information i'm getting and man conf file,
> the local_url is over an nfs mount that is virtually a direct LAN connection on
> a 100Mb NIC .. any other ideas? The reason that there are two "local_urls" is
> b/c the site is bits and pieces of www.xxx.com and xxx.com
...
> New server: xxx, 80
> 0:0:0:http://xxx/new/: ---------+++---+++*++*++++++++++-+++++-+------+ size =
> 25757
> 1:1:1:http://xxx/new/lcurve/default.htm: -++*++++-+- size = 3731
> ...
> ...
> 7386:7386:3:http://xxx/new/stories/s58024.htm: ******-- size = 4394
> 7387:7387:3:http://xxx/new/stories/s57399.htm:  not found
> 7388:7388:3:http://xxx/new/stories/s57238.htm:  not found
> 7389:7389:3:http://xxx/new/stories/s56919.htm: Unable to build connection with
> xxx:80
>  no server running
> 7390:7390:3:http://xxx/new/stories/s56400.htm:  no server running
> 7391:7391:3:http://xxx/new/s56936.htm:  no server running
...
> database_dir:           /var/spool/htdig/db.new
> limit_urls_to:          ${start_url}
> exclude_urls:           /cgi-bin/ .cgi
> bad_extensions:         .wav .gz .z .sit .au .zip .tar .hqx .exe .com .gif \
>                 .jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi
> .dat .txt .bak .old .sav .log .LOG
> maintainer:             webmaster@xxx
> max_head_length:        1000000
> max_doc_size:           64000000
> no_excerpt_show_top:    true
> search_algorithm:       exact:1 synonyms:0.5 endings:0.1
> template_map: Long long ${common_dir}/long.html \
>                 Short short ${common_dir}/short.html \
>                 XXX xxx /opt/www/htdocs/xxx-template.html
> search_results_header: /opt/www/htdocs/xxx-header.html
> search_results_footer: /opt/www/htdocs/xxx-footer.html
> next_page_text:         Next<IMG SRC="http://xxx/img/arrow2right.gif"; WIDTH="20"
> HEIGHT="19" BORDER="0" ALIGN="middle" AL
> T="next">
> no_next_page_text:
> prev_page_text:         <img src="http://xxx/img/arrow2left.gif"; border="0"
> align="middle" width="20" height="19" alt="pr
> ev">Prev
> no_prev_page_text:
> page_number_text:       '1' '2' '3' '4' '5' '6' '7' '8' '9' '10' 
> no_page_number_text:    '<b>1</b>' \
>                         '<b>2</b>' \
>                         '<b>3</b>' \
>                         '<b>4</b>' \
>                         '<b>5</b>' \
>                         '<b>6</b>' \
>                         '<b>7</b>' \
>                         '<b>8</b>' \
>                         '<b>9</b>' \
>                         '<b>10</b>' 
> 
> local_urls_only: true
> local_urls:     http://xxx/new/=/web/xxx/new/
> local_urls:     http://xxx2/new/=/web/xxx/new/
> start_url:      http://xxx/new/
> 
> 
> use_meta_description: true

Well, you can't just define an attribute twice and expect htdig to remember
and use both settings.  The second setting of local_urls will override
the first one, so the first one isn't used.  In 3.1.5, the local_urls_only
attribute only affects URLs that are actually covered by the local_urls
and local_user_urls settings, so http://xxx2/new/ URLs will only be
fetched locally, but all other URLs, including http://xxx/new/, will be
fetched only via HTTP.  What you want is...

local_urls:     http://xxx/new/=/web/xxx/new/ \
                http://xxx2/new/=/web/xxx/new/

By the way, if you're going to xxx-out URLs in your debugging output,
it would be useful to do it consistently.  In your opening paragraph, you
talk about www.xxx.com and xxx.com, then in your config file you use xxx
and xxx2, and in the -v output you use only xxx, implying that htdig is
fetching some URLs locally and some URLs with the same base path by HTTP.
It makes the debugging output pretty useless when it's altered this way.

You may also want to use the server_aliases attribute to define two
or more different host names as equivalent, so they all map to a canonical
host name.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to