According to [EMAIL PROTECTED]: > Thanks for your initial response. I have tried running htdig with -vvvv > (again) as you suggest. > An extract of the result is below. Can anyone confirm for me what it is > saying. > My specific problem is trying to return results based on the 'postcode' > written into the meta tag. > If you go to www.edinburgh.gov.uk, choose search from the top menu and enter > 'postcode' you will not get back any results _FOR which the only match is > from the META tag_ (You will get lots of results where postcode appears in > the body.) > Thanks in advance for any help that can be provided, > Mike > > pick: www.edinburgh.gov.uk, # servers = 1 > 28:28:2:http://www.edinburgh.gov.uk/CEC/Recreation/Libraries/Local_Organisat > ions/local_Action_Group.html: Retrieval command for > http://www.edinburgh.gov.uk/CEC/Recreation/Libraries/Local_Organisations/loc > al_Action_Group.html: GET > http://www.edinburgh.gov.uk/CEC/Recreation/Libraries/Local_Organisations/loc > al_Action_Group.html HTTP/1.0 > User-Agent: htdig/3.1.2 ('')
Version 3.1.2 is very old! You should upgrade to the 3.1.6 snapshot in http://www.htdig.org/files/snapshots/, which has tons of bug fixes, including closing some nasty security holes. > Referer: > http://www.edinburgh.gov.uk/CEC/Recreation/Libraries/Local_Organisations/loc > al_A.html > Host: www.edinburgh.gov.uk > > Header line: HTTP/1.1 200 OK > Header line: Date: Fri, 09 Nov 2001 09:40:15 GMT > Header line: Server: Apache/1.3.14 (Win32) > Header line: Last-Modified: Mon, 05 Nov 2001 17:32:57 GMT > Translated Mon, 05 Nov 2001 17:32:57 GMT to 05 Nov 2001 (101) > And converted to Mon, 05 Nov 2001 > Header line: ETag: "0-fe1-3be6cd49" > Header line: Accept-Ranges: bytes > Header line: Content-Length: 4065 > Header line: Connection: close > Header line: Content-Type: text/html > Header line: > returnStatus = 0 > Read 4065 from document > Read a total of 4065 bytes > Tag: HTML>, matched -1 > Tag: HEAD>, matched -1 > Tag: META name='htdig-keywords' content='EH7 postcode 5QY postcode EH75QY > postcode disabled community care disability mentally handicapped Learning > disabilities PEOPLE WITH DISABILITIES Learning disabilities'>, matched 20 At debug level 4 (-vvvv) you should be getting output for each word parsed here. The reason you aren't is because version 3.1.2 didn't handle single quotes in meta tags. That wasn't fixed until two versions later. There have been lots more parser fixes since then. Just another example of why problem reports should ALWAYS indicate the htdig version number. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

