> >This is a common mistake that information creators think 'is a good 
> >thing'...  The web got popular for a number of reasons - one of them being 
> >"full text indexing of all content" (including headers/footers/etc).
> 
> Why?  There is no useful information in headers/footers.  By nature of 
> using a templating system, they are the same on every page in a given 
> section.  Including them in search results only increases the noise and the 
> amount of information that needs to be indexed.

Only says you - a user of your site may find the headers and footers to be very useful.

> >The point is that, it is the user of the system that wants to find the 
> >information - not the author telling you what you can and cant search 
> >for.   Classic example -> books used to have (and still do) an index in 
> >the last couple of pages of the book, yet the user could never find what 
> >they were looking for; until the book made it onto CDROM at which point 
> >full-text-searching was possible.
> 
> Almost every piece of a book is useful to search.  But what good would it 
> be to search for a chapter heading?  That information is already given to 
> you in the table of contents.

An index is a whole different beast to a table of contents.  And as you say, every 
peice of a book is useful to search -> thus why bother only indexing content that the 
author considers valid?  Why not just index _everything_, then let the user decide...

> >-> Full text searching is a _much better_ solution to search problems than 
> >indexing on what YOU think is the information they want.
> >
> >Mathew
> >
> >PS.  This means, use a spider.. or even better use google via a "site:..." 
> >search.
> 
> Google PageRank is very good at searching a broad sample of sites.  It's 
> not so good for individual sites.

You are kidding right?  The algorithms the Google use, already take into account 
common content on multiple pages.  OpenOffice.org use Google as their own site 
specific search engine.  As do a number of sites.  The only real problem with using 
Google is that they only spider the web every few weeks, thus if you update more 
frequently than that, you may have a problem.

Mathew


-------------------------------------------------------
This SF.Net email is sponsored by the new InstallShield X.
>From Windows to Linux, servers to mobile, InstallShield X is the one
installation-authoring solution that does it all. Learn more and
evaluate today! http://www.installshield.com/Dev2Dev/0504
_______________________________________________
Html-template-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/html-template-users

Reply via email to