Nevermind. I discovered a 'REFARGEXCLUDE *' statement in my config file
in an out-of-the-way place. When I removed it, things started working as
I understood. For the archives, I discovered this with the '-settings'
command line argument, which was very helpful.

Thanks for being available to assist me.

-Kevin

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Zembower, Kevin
Sent: Friday, January 04, 2008 12:35 PM
To: [email protected]
Subject: [analog-help] Need help getting SEARCHQUERY working

I'm trying to get the SEARCHQUERY and SEARCHWORD sections working on my
Analog 6.0 system. I have this section in my Analog config file:

SEARCHQUERY ON
SEARCHWORD ON
CONFIGFILE SearchEngines.txt

SearchEngines.txt was downloaded from Mike Shor's site at
http://www.owen.vanderbilt.edu/mike.shor/diversions/analog. It contains
this line:

SEARCHENGINE    http://*.google.com/*   q,as_q,as_oq,as_epq,query

When run like this:
../analog-6.0/analog +gweb1.analog.cfg +F071201 +T071231
+O/var/www/centernet/htdocs/webstats/web1/200712/index.html

my output only contains a Search Query report that lists 19 pages
requested, even though the web logs of just the last day of my period
includes 128 requests from Google, like this sample:
[EMAIL PROTECTED]:/opt/analog/conf.d$ fgrep http://www.google.com/search
../logdata/web1/access_log.20071231 |wc -l
128
[EMAIL PROTECTED]:/opt/analog/conf.d$ fgrep http://www.google.com/search
../logdata/web1/access_log.20071231 |head 
ABTS-NCR-Dynamic-013.35.163.122.airtelbroadband.in - -
[31/Dec/2007:00:55:13 -0500] "GET
/igwg/presentations/Monday/SubplenB/PromotionMale.pdf HTTP/1.1" 200
44424
"http://www.google.com/search?q=graduate+housewives+in+india&hl=en&rlz=1
T4GGLJ_en-GBIN214IN214&start=20&sa=N" "Mozilla/4.0 (compatible; MSIE
7.0; Windows NT 5.1)"
85.185.229.106 - - [31/Dec/2007:00:55:43 -0500] "GET /pubs/sp/20/20.pdf
HTTP/1.0" 200 466095
"http://www.google.com/search?hl=fa&q=AIDS%2BPDF&btnG=%D8%AC%D8%B3%D8%AA
%D8%AC%D9%88%D9%8A+Google&lr=" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1)"
66.249.85.131 - - [31/Dec/2007:00:57:48 -0500] "GET
/asia/bangladesh/nsdp.shtml HTTP/1.1" 200 20061
"http://www.google.com/search?q=child+delivery+video&hl=en&start=70&sa=N
" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR
1.1.4322)"
c-98-204-115-120.hsd1.dc.comcast.net - - [31/Dec/2007:00:58:02 -0500]
"GET /pubs/ HTTP/1.1" 200 30575
"http://www.google.com/search?q=jhccp&ie=utf-8&oe=utf-8&aq=t&rls=org.moz
illa:en-US:official&client=firefox-a" "Mozilla/5.0 (Macintosh; U; Intel
Mac OS X; en-US; rv:1.8.1.11) Gecko/20071127 Firefox/2.0.0.11"
pool-71-182-79-153.ptldor.fios.verizon.net - - [31/Dec/2007:01:23:25
-0500] "GET /quality/expo.shtml HTTP/1.1" 200 10440
"http://www.google.com/search?hl=en&q=putting+quality+first&btnG=Search";
"Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.11)
Gecko/20071127 Firefox/2.0.0.11"
d47-69-11-157.nap.wideopenwest.com - - [31/Dec/2007:01:30:50 -0500] "GET
/pubs/fg/02/04-chapter2.pdf HTTP/1.1" 206 2810476
"http://www.google.com/search?sourceid=navclient&aq=t&ie=UTF-8&rls=GGIC,
GGIC:2007-39,GGIC:en&q=audience+segmentation" "Mozilla/4.0 (compatible;
MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
d47-69-11-157.nap.wideopenwest.com - - [31/Dec/2007:01:35:13 -0500] "GET
/pubs/fg/02/04-chapter2.pdf HTTP/1.1" 206 2568116
"http://www.google.com/search?sourceid=navclient&aq=t&ie=UTF-8&rls=GGIC,
GGIC:2007-39,GGIC:en&q=audience+segmentation" "Mozilla/4.0 (compatible;
MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
ABTS-NCR-Dynamic-011.123.163.122.airtelbroadband.in - -
[31/Dec/2007:01:52:42 -0500] "GET
/igwg/presentations/Monday/SubplenC/MenAgainst.pdf HTTP/1.1" 200 654982
"http://www.google.com/search?hl=en&rls=WZPA%2CWZPA%3A2007-25%2CWZPA%3Ae
n&q=women+population+of+india" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1)"
CPE-144-137-203-75.sa.bigpond.net.au - - [31/Dec/2007:02:02:10 -0500]
"GET /pubs/ci/2/index.shtml HTTP/1.1" 200 19646
"http://www.google.com/search?hl=en&lr=&as_qdr=all&q=Elizabeth+Omolo&btn
G=Search" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1;
FunWebProducts; .NET CLR 2.0.50727; .NET CLR 1.1.4322)"
194.225.53.2 - - [31/Dec/2007:02:08:25 -0500] "GET /topics/oral.shtml
HTTP/1.0" 200 14876
"http://www.google.com/search?q=oral+contraceptive&hl=en&start=10&sa=N";
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)"
[EMAIL PROTECTED]:/opt/analog/conf.d$

None of these query referrer requests, such as
"graduate+housewives+in+india" or "AIDS%2BPDF" (the first two in the
fgrep above), show up in the Search Query section. All other aspects of
the Analog report lead me to believe that it ran normally. The output
when it ran was:
[EMAIL PROTECTED]:/opt/analog/conf.d$ ../analog-6.0/analog +gweb1.analog.cfg
+F071201 +T071231
+O/var/www/centernet/htdocs/webstats/web1/200712/index.html
../analog-6.0/analog: analog version 6.0/Unix
../analog-6.0/analog: Warning R: In Directory Report, turning off pie
chart of
  only one wedge
  (For help on all errors and warnings, see docs/errors.html)
[EMAIL PROTECTED]:/opt/analog/conf.d$

Also, the report contains over a million successful requests and almost
7,000 referrers from http://www.google.com/search, leading me to believe
that it's reading the correct web log files.

Can anyone help me set up my Analog config files to correctly generate
the Search Query and Search Word sections?

Thanks so much for your advice and suggestions.

-Kevin

Kevin Zembower
Internet Services Group manager
Center for Communication Programs
Bloomberg School of Public Health
Johns Hopkins University
111 Market Place, Suite 310
Baltimore, Maryland  21202
410-659-6139 

+-----------------------------------------------------------------------
-
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+-----------------------------------------------------------------------
-

+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------

Reply via email to