At 7:02 PM -0700 10/10/01, Jerry Asher wrote: >For reasons I don't understand just yet, the table of contents >ALWAYS comes up first, and often comes up with five stars, while the >article itself will only show up with one or two stars. > >The only path that htDig can use to find the article is by starting >with the URL of the table of contents.
Sounds like an issue with the backlink_factor code. This is code to weight the "importance" of a page judging from the link structure. If a page has more links pointing to it than off of it, it's weighted higher. This is supposed to balance the TOC issue since the TOC will have plenty of links off of it... If you can think of a way to modify this to point more towards "nodes," I'd be glad to change it. What would you guess the ratio is for these TOC pages on your sites? >That said, what strategies are available for eliminating the table >of contents from the search results? You can mark these pages with the META robots tag: <meta name="robots" content="noindex,follow"> This will instruct indexing robots to follow links but not to include the page in results. -- -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

