Good news for once. For the record what was happening was that for some reason the first page of one section of my site (which is a database driven A-Z list) had a list of URLS (letter A - Z) in it, but coded slightly strangely. The parameter to choose the right page should have been delimited with an ampersand character, but the HTML entity had been used instead. Most if not all browsers translate this entity into the correct ampersand character when requesting the URL, so everything appears okay. However ht://dig does not translate this character, so the database script does not recognise the parameter and returns the page for the letter A by default (not sure why but it doesnt matter) Of course ht://dig still thinks that these are 26 seperate pages, and indexes them all, even though the content is identical, and returns the same link to the browser from htsearch. Of course what confused me was that then the browser translates the 'malformed' URL again, and returns the correct page when you click on the link in the search results - not the page that is shown in the excerpt.
Hope everyone understood that, and that it helps someone in the future. NB What do the developers think about this issue? Is it something that should be checked in future versions? Ht://dig is 'supposed' to behave the same as a browser and in this case it doesn't. Mike Who we are ...... What we do ..... The City of Edinburgh Council: www.edinburgh.gov.uk www.edinburgh.gov.uk/events - for what's on www.edinburgh.gov.uk/libraries - renew your books on line and check our catalogue www.edinburgh.gov.uk/atoz - for your online guide to Council Services ********************************************************************** This Email and files transmitted with it are confidential and are intended for the sole use of the individual or organisation to whom they are addressed. If you have received this Email in error please notify the sender immediately and delete it without using, copying, storing, forwarding or disclosing its contents to any other person. The Council has endeavoured to scan this Email message and attachments for computer viruses and will not be liable for any losses incurred by the recipient. ********************************************************************** ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

