Sorry, didn't mean to make it sound like a complaint ;-), just wondering if it was me or if it was a known behavior...
I'll try ramping up the relative values of title, meta, and h tags to see if that fixes it. Using 3.2.0b4 9/29/02 snapshot Ted On Thursday, October 3, 2002, at 02:38 PM, Gilles Detillieux wrote: > According to Ted Stresen-Reuter: >> Any idea why pdf and Word files are consistently ranked higher than >> html >> files (which have keyword meta tags, TITLE tags, and H1 tags with >> closer >> matches)? > > Not really, but you're not the first person to complain about it. > I think in the past it's usually boiled down to the fact that the word > appears many more times in the text of the PDF or Word document than > in the HTML files. > > Is this still with a recent 3.2.0b4 snapshot, or have you gone back to > 3.1.6 now? Another scoring quirk in 3.1.x is that words near the start > of a document are ranked higher than words near the end. Mind you, meta > tags, titles and h1 tags tend to be near the start, so they should be > ranked high in 3.1.x. > > -- > Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> > Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ > Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) > > ------------------------------------------------------------------------------------ Homepage: http://www.tedmasterweb.com/ THE JavaScript Window Management Tool: http://www.tedmasterweb.com/wmo/ ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

