Hi,

I've actually got two questions.

1.
On our intranet we have some pdf files that were made in adobe acrobat. The
files contain hyperlinks to other files. My guess is that the pdf2html (or
is it pdf2text) converter doesn't know how to follow links. Does anyone know
of a product that does or am I relegated to listing each pdf individually if
I want it to be indexed?

2.
Our intranet is sprinkled with links back to the firm directory. For
example, on each department's home page is a list of the staff that works in
that department and a link back to each persons profile in the firm
directory. Likewise, when viewing an individual's profile in the firm
directory, you see a list of other members of the same department with links
to their individual profiles as well. When I conduct a search on
'technology', expecting to see the Information Technology Home Page listed
first (it is the title of the page, has Information Technology in the
description and keywords and has an h1 tag at what is essentially the start
of the page) and yet it appears at the end of the list with only one star.
Each individual, however, is listed at the start of the list and with 5
stars. Is this because there are far more pages that point to each
individual's profile than there are that point to the Information Technology
Home Page and if so, what do the developers of htdig recommend changing so
that the home page comes up first?

I look forward to your replies!

Ted Stresen-Reuter
http://dev.susansexton.com/htdig for a php wrapper for htdig


_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to