According to Jim Cole:
> Willy Calderon's bits of Fri, 12 Apr 2002 translated to:
...
> >At the moment my htdig.conf file looks something like this
> ...
> >start_url:              ${common_dir}/index.html
> 
> What does the index.html file in ${common_dir} look like? It
> shouldn't be HTML. It should just be a regular text file that
> lists all of the starting URL's for your indexing run.

That's not quite right.  If you want to feed a list of URLs from a
plain text file into htdig's start_url attribute, you have to do it
something like:

start_url:      `${common_dir}/urllist.txt`

Of course, you can put any file pathname within the left quotes, but the
left quote marks are necessary for feeding a file into an attribute like
this.  (It can be used for any attribute, by the way.)

See http://www.htdig.org/cf_variables.html
and http://www.htdig.org/FAQ.html#q5.25

> If on
> the other hand you actually want to start with a single HTML
> file and dig from there, then specify a valid URL in start_url.
> For example
> 
> start_url: http://www.somedomain.com/index.html

Yes, that's the key point here.  Regardless of how you set start_url, or
how many entries you put in start_url, each entry must be a valid URL which
specifies the protocol and server.  In 3.1.x, the protocol must be "http:".

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to