--- Gilles Detillieux <[EMAIL PROTECTED]> wrote: > According to Ted Stresen-Reuter: > > Any idea why pdf and Word files are consistently ranked higher than > html > > files (which have keyword meta tags, TITLE tags, and H1 tags with > closer > > matches)? > > Not really, but you're not the first person to complain about it. > I think in the past it's usually boiled down to the fact that the > word > appears many more times in the text of the PDF or Word document than > in the HTML files.
In trying to track down a similar issue we have on our site, I manually ran pdf2html.pl against the particular PDF files, putting the output to a text file and looked that over. Compare it with the HTML document. BTW: the problem on our site ended up being improperly coded HTML documents (some web coders were using FONT tags instead of H1, H2, etc...) greg_fenton. ===== Greg Fenton [EMAIL PROTECTED] __________________________________________________ Do you Yahoo!? New DSL Internet Access from SBC & Yahoo! http://sbc.yahoo.com ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

