Re: [htdig] search results and tittle

Daniel Escobar Tue, 08 Jan 2002 12:35:31 -0800

> If you could actually quote us a URL then we could check this theory but
> until then it remains a theory.


Malcolm,  below is a link to a search box that will search one of my sites.
You can do a query on whatever you would like, but I suggest using "Randy"
as one of your searches, since I know that will show you my problem.  (some
searches show as [filename.html], and others show the actual tittle).

http://fox97.com/common/htdig/ssi/wfox-fm.html

Please let me know if you have any other ideas/comments :)

Thanks again,

-daniel

Malcolm Austen wrote:

> On Tue, 8 Jan 2002, Daniel Escobar wrote:
>
> + >index the contents just fine and anyone following the hit link with IE
> + >will get it displayed just fine (but probably not in NN and in Opera it
> + >depends on how you have it configured) ... but ... because htdig didn't
> + >analyse the html content, it will just show the filename in the search
> + >results.
> +
> + I don't understand how htdig would be browser dependent, when it runs
> + on the server side.
>
> No Daniel, you've missed a point. HTSEARCH runs server side but HTDIG runs
> client side when indexing the pages and the user who is looking at the
> search results is also client side.
>
> + Most likely, I didn't explain my problem well enough, so here it goes
> + again:  When an htsearch query is performed, the results page contains
> + some results with the appropiate tittle, and others with the filename
> + btw brackets. [filename.html] Why is that happenning, when all the
> + files have the <title>...</title> tags?
>
> You've convinced me still further with that extra comment. The filename is
> .html and yet you have exactly the behaviour that I see when my htdig
> indexes a .txt file. I surmise the following:
>
> You have a page coded in HTML but when htdig (the robot) fetches it the
> server hands it over with a MIME content type of text/plain. That being so
> htdig does not attempt to decode the HTML structure and just indexes the
> words. (The page should show up on searches for 'html', 'title' etc.)
>
> When the user does a search the page shows up as [filename.html] but,
> using IE, when the user clicks on the link the browser decides to ignore
> the content type and display it as HTML because Bill thinks he knows
> better than to believe the content type.
>
> This is (one of many situations) where the Opera web browser comes in very
> handy. It has configuration options either to believe the MIME content
> type or to ignore it and believe the file extention instead.
>
> If you could actually quote us a URL then we could check this theory but
> until then it remains a theory.
>
> regards,
>         Malcolm.
>
>  [EMAIL PROTECTED]     http://users.ox.ac.uk/~malcolm/

--
Daniel Escobar
Software Engineer
Cox Radio Interactive
678.860.2749 (cellular)
404.979.7860 (office)




_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Re: [htdig] search results and tittle

Reply via email to