On Wed, 19 Dec 2007, Riccardo Cohen wrote:

> In my pages I have a menu : "product1 product2 product3"
> These menu items lead to pages describing each product.
> 
> Of course, if htdig indexes the pages normally, then searching for 
> "product1" gives all product pages because they all have this menu in 
> it..., while the only interesting page is product1
> 
> So I tried to use <!--htdig_noindex-->, but then the menu is not scanned 
> at all, and the products pages are not pushed and then not indexed at all.
> 
> Is there any way to follow links in a section without indexing the words 
> in this section ?

If you don't need to validate the pages in question, I believe you can 
enclose the relevant section in <noindex follow></noindex> tags. Ugly in 
terms of standards compliance, but a quick fix if you need it. A cleaner 
solution, but one requiring a lot more work, would be to write a parser
that removes the text that you don't want indexed and then apply the
parser as a preprocessor using ht://Dig's external_parsers attribute.

  http://www.htdig.org/dev/htdig-3.2/attrs.html#external_parsers


Jim

-------------------------------------------------------------------------
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to