Many directories in my web tree have several versions of a document,
or have ancillary files that I don't want to index.
For example, a directory /vol3/iss2/art1 may contain the following files:
figure1.html
figure2.html
table1.html
frameset.html
inline.html
main.html
I would like to set htdig up to index ONLY the inline.html file in
this directory. Elsewhere in the system I may want to include files
that contain "main.html" or "figure1.html".
Basically, if the URL contains the string "art" I'd like to exclude
everything but inline.html. If the URL doesn't include "art" then I
want to include everything.
The documentation makes frequent mention of "patterns" but doesn't
define them in the context of the configuration file.
Is it legal to use a pattern like: art*main.html in the
exclude_urls entry?
Could I make a pattern like: art?/main.html
or can I use a regex like: art\d/main\.html$
If this isn't the way patterns work, is there anything that I can
do? perhaps build 2 indices, then merge?
Thanks in advance
Colin
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.