Many directories in my web tree have several versions of a document,  
or have ancillary files that I don't want to index.

For example, a directory /vol3/iss2/art1 may contain the following files:
        figure1.html
        figure2.html
        table1.html
        frameset.html
        inline.html
        main.html

I would like to set htdig up to index ONLY the inline.html file in  
this directory. Elsewhere in the system I may want to include files  
that contain "main.html" or "figure1.html".

Basically, if the URL contains the string "art" I'd like to exclude  
everything but inline.html. If the URL doesn't include "art" then I  
want to include everything.

The documentation makes frequent mention of "patterns" but doesn't  
define them in the context of the configuration file.

Is it legal to use a pattern like:   art*main.html    in the  
exclude_urls entry?
Could I make a pattern like: art?/main.html
or can I use a regex like:  art\d/main\.html$


If this isn't the way patterns work, is there anything that I can  
do? perhaps build 2 indices, then merge?

Thanks in advance
Colin

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.

Reply via email to