On Wed, 13 Mar 2002, Soriana Villanueva wrote:
> Date: Wed, 13 Mar 2002 17:43:30 -0500
> From: Soriana Villanueva <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Subject: [htdig] Capturing the Current URL Being Indexed
>
>
> Greetings from a newbie...
>
> Is there such an attribute as current_url? I am indexing a site that has
> both English and French pages but only want to index the English pages. The
> pages are organized as follows: The French pages have an "f" at the end of
> their file names.
>
> http://www.domain.org/directory/index.html
> http://www.domain.org/directory/indexf.html
>
> http://www.domain.org/directory/sample.html
> http://www.domain.org/directory/samplef.html
>
> http://www.domain.org/directory/leaf.html
> http://www.domain.org/directory/leaff.html
>
> I thought of using "exclude_urls: f.html" but this would exclude the
> English page named leaf.html. This is why I was thinking of something like
> "exclude_urls: $(current_url)f.html" Is this possible or perhaps there's an
> even better solution to this?
>
> Thanks, all suggestions are welcome,
If I were you, or whomever the webmaster of that site is, I would use a
more distinct convention, e.g. index.html versus index_fr.html. It is
unlikely to find a file_fr.html outside that convention.
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html