At 7:02 PM -0700 10/10/01, Jerry Asher wrote:
>For reasons I don't understand just yet, the table of contents 
>ALWAYS comes up first, and often comes up with five stars, while the 
>article itself will only show up with one or two stars.
>
>The only path that htDig can use to find the article is by starting 
>with the URL of the table of contents.

Sounds like an issue with the backlink_factor code. This is code to 
weight the "importance" of a page judging from the link structure. If 
a page has more links pointing to it than off of it, it's weighted 
higher. This is supposed to balance the TOC issue since the TOC will 
have plenty of links off of it...

If you can think of a way to modify this to point more towards 
"nodes," I'd be glad to change it. What would you guess the ratio is 
for these TOC pages on your sites?

>That said, what strategies are available for eliminating the table 
>of contents from the search results?

You can mark these pages with the META robots tag:

<meta name="robots" content="noindex,follow">

This will instruct indexing robots to follow links but not to include 
the page in results.

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to