--- Gilles Detillieux <[EMAIL PROTECTED]> wrote:
> According to Ted Stresen-Reuter:
> > Any idea why pdf and Word files are consistently ranked higher than
> html 
> > files (which have keyword meta tags, TITLE tags, and H1 tags with
> closer 
> > matches)?
> 
> Not really, but you're not the first person to complain about it.
> I think in the past it's usually boiled down to the fact that the
> word
> appears many more times in the text of the PDF or Word document than
> in the HTML files.


In trying to track down a similar issue we have on our site, I manually
ran pdf2html.pl against the particular PDF files, putting the output to
a text file and looked that over.  Compare it with the HTML document.

BTW: the problem on our site ended up being improperly coded HTML
documents (some web coders were using FONT tags instead of H1, H2,
etc...)

greg_fenton.

=====
Greg Fenton
[EMAIL PROTECTED]

__________________________________________________
Do you Yahoo!?
New DSL Internet Access from SBC & Yahoo!
http://sbc.yahoo.com


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to