According to Terry Luedtke:
> Thanks, that is how our link looks. I didn't realize htDig remembers the referring
>link and uses it as part of the search database.
>
> --
> Terry Luedtke
> National Library of Medicine
>
> On Thu, Apr 19, 2001 at 09:20:07AM +0100, David Adams wrote:
> > Doc2html (latest versions) extracts the Subject, Title, and Keywords from
> > PDF files if they have them,
> > but your document doesn't.
> >
> > Most likely you give a relatively high score to the "Description", and you
> > have indexed a page with a link like:
> >
> > <A href="http://profiles.nlm.nih.gov/BB/A/A/A/A/_/bbaaaa.pdf">What I would
> > Like</a>
Yes, the weight assigned to link description text is controlled by
the description_factor attribute, and can be changed quite easily.
Interestingly, this document ranked quite a bit higher than the
HTML document that had "What I would like to Be" right in the title.
The default description_factor is larger than the default title_factor,
but I wonder if that alone would account for the different rankings.
I also noticed that the word "would" didn't get highlighted in the
excerpts, so I assume it was added to the bad_words file. That may have
affected the scoring too.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html