On Tue, 8 Jan 2002, Daniel Escobar wrote:

+ >index the contents just fine and anyone following the hit link with IE
+ >will get it displayed just fine (but probably not in NN and in Opera it
+ >depends on how you have it configured) ... but ... because htdig didn't
+ >analyse the html content, it will just show the filename in the search
+ >results.
+
+ I don't understand how htdig would be browser dependent, when it runs
+ on the server side.

No Daniel, you've missed a point. HTSEARCH runs server side but HTDIG runs
client side when indexing the pages and the user who is looking at the
search results is also client side.

+ Most likely, I didn't explain my problem well enough, so here it goes
+ again:  When an htsearch query is performed, the results page contains
+ some results with the appropiate tittle, and others with the filename
+ btw brackets. [filename.html] Why is that happenning, when all the
+ files have the <title>...</title> tags?

You've convinced me still further with that extra comment. The filename is
.html and yet you have exactly the behaviour that I see when my htdig
indexes a .txt file. I surmise the following:

You have a page coded in HTML but when htdig (the robot) fetches it the
server hands it over with a MIME content type of text/plain. That being so
htdig does not attempt to decode the HTML structure and just indexes the
words. (The page should show up on searches for 'html', 'title' etc.)

When the user does a search the page shows up as [filename.html] but,
using IE, when the user clicks on the link the browser decides to ignore
the content type and display it as HTML because Bill thinks he knows
better than to believe the content type.

This is (one of many situations) where the Opera web browser comes in very
handy. It has configuration options either to believe the MIME content
type or to ignore it and believe the file extention instead.

If you could actually quote us a URL then we could check this theory but
until then it remains a theory.

regards,
        Malcolm.

 [EMAIL PROTECTED]     http://users.ox.ac.uk/~malcolm/


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to