According to Chad M. Stewart: > I took a mail archive and converted to html using mhonarc. I put all of the > html files in http://server/tmp/. > > Here are what I think are the relevant parts of the config file I used. > > start_url: http://server/tmp/ > limit_urls_to: ${start_url} > > Then when I run '../bin/rundig -vv -c ml.conf' I get > > Rejected: URL not in the limits! > url rejected: (level 1)http://server/tmp/msg00002.html > href: http://server/tmp/msg00001.html ([Ipchains] Re: Ipchains Ipmasqadm, > portfw funktionallity) > > That goes on for every msg*.html file in the directory. I give up, can some > one apply a clue stick, please be nice. :) > > I'm trying to tell htdig here is a directory of files, index them. The doc > I've read shows limit_urls_to says "...specifies a set of patterns that all > URLs have to match against..." Logic tells me that the ${start_url} string is > found in the rejected URL. > > Am I going about this the wrong way?
I certainly can't see anything obviously wrong in what you've reported from your config file. Are you sure there aren't any other limit_urls_to definitions anywhere else in ml.conf, or in any other config files that ml.conf may be including, which would override the definition quoted above? The last definition htdig encounters for a given attribute is the one it uses. What version of htdig are you using? I think the earlier 3.2 betas had some problems in limit_urls_to handling. Do you get the same error with 3.1.6 (just released today), or with the latest snapshot of 3.2.0b4 in http://www.htdig.org/files/snapshots/? -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

