On Thu, 12 Oct 2000, Iosif Fettich wrote:

> hope someone has a clue. Something seems to be wrong with the way htdig
> indexes. In the config file, I have
> 
> start_url:http://www.pcconcrete.ro/phtml/
> limit_urls_to:${start_url}p0 ${start_url}r0
> 
> What I'd like to have indexed are the files that have names starting with
> 'p0' or 'r0' - and _nothing_ else.
> 
> However, when I search something, it happens that I get results that are
> in the main index of the directory /phtml/ - what shouldn't be indexed (if
> it would do it like I'd wish).
> 
> Am I doing something wrong, missing something, or is it time for looking
> into the code for a patch...?

Of course my usual suggestion is to turn on some debugging with the -vvv
flag. ;-)

My concern here is that the start_url *itself* doesn't match the pattern.
So it's going to index the index.html file first (since that's the
start_ulr) and then take the links that match from there.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>

Reply via email to