On Wed, 19 Dec 2007, Riccardo Cohen wrote: > In my pages I have a menu : "product1 product2 product3" > These menu items lead to pages describing each product. > > Of course, if htdig indexes the pages normally, then searching for > "product1" gives all product pages because they all have this menu in > it..., while the only interesting page is product1 > > So I tried to use <!--htdig_noindex-->, but then the menu is not scanned > at all, and the products pages are not pushed and then not indexed at all. > > Is there any way to follow links in a section without indexing the words > in this section ?
If you don't need to validate the pages in question, I believe you can enclose the relevant section in <noindex follow></noindex> tags. Ugly in terms of standards compliance, but a quick fix if you need it. A cleaner solution, but one requiring a lot more work, would be to write a parser that removes the text that you don't want indexed and then apply the parser as a preprocessor using ht://Dig's external_parsers attribute. http://www.htdig.org/dev/htdig-3.2/attrs.html#external_parsers Jim ------------------------------------------------------------------------- SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace _______________________________________________ ht://Dig general mailing list: <htdig-general@lists.sourceforge.net> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general