On Thu, 16 Jan 2003, Michael D. Crawford wrote:

> I have a page about C++ programming that gets a lot of traffic referred from 
> search engines when people enter various queries that include "c++".
> 
> But analog's SEARCHQUERY report says that the search queries just include "c" 
> without the "++".
> 

Trying to grab the search queries in an automated way is a messy business.
It's really not good enough to show the raw search queries, because you tend
to get a lot of cruft in there, so a single query would occur on many
different lines with slight variations. So some munging has to take place,
and it's necessarily a bit heuristic. "+" is excluded because it's usually
not significant. It's used to tell the search engines "I definitely want
this word", or even just as a vague separator.

-- 
Stephen Turner, Cambridge, UK    http://homepage.ntlworld.com/adelie/stephen/
 "The question of whether a computer can think is no more interesting than
  the question of whether a submarine can swim."  (Edsger W. Dijkstra)

+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  Digest version: http://lists.isite.net/listgate/analog-help-digest/
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
+------------------------------------------------------------------------

Reply via email to