The original synonyms file contains many common spelling mistakes (e.g. "accomodate"), and many common abbreviations (e.g. "feb" for "February"). In my synonyms file I have tried to avoid both, and only to include spellings which are correct either in the USA or the UK.
My synonyms file is an alternative, and I think should be most useful in the searching of English-language, non-USA sites. I am fairly confident that it contains no duplication. I am, of course, happy to receive corrections and additions. I agree that I should have had "battleax" not "battle-ax" (this is the only punctuation). Is "tonite" regarded as a corrrect, alternative spelling in the USA, I don't have a Webster's to check it? As for combining the two synonym files, that is up to you, but I took the decision to exclude spellings that are incorrect both side of the Atlantic. -- David Adams Computing Services Southampton University ----- Original Message ----- From: "Gilles Detillieux" <[EMAIL PROTECTED]> To: "David Adams" <[EMAIL PROTECTED]> Cc: "ht://Dig mailing list" <[EMAIL PROTECTED]> Sent: Tuesday, January 29, 2002 5:02 PM Subject: [htdig] Re: Synonym file > According to David Adams: > > News that version 3.1.6 is to be officially released has impelled me to send > > you as a contribution the synonym file that has been used on the University > > of Southampton search pages > > (http://www.search.soton.ac.uk/soton/) > > for the last several months. It consists of alternative (mostly UK and > > USA) spellings of English language words. > > > > Thus a search for "Archaeology Centre for Coloured Fibres" > > results in: > > (archaeology or archeology) and (centre or center) and for and (coloured or > > colored) and (fibres or fibers) > > > > It is not complete (for example it does not include "archaeological"), but > > it has proved very useful to us. > > Thanks for the contribution. I've only had time for a quick look at it, > but I did notice at least one punctuation character in it (a hyphen) > which htfuzzy doesn't strip out, but which htdig/htsearch do, so the > synonyms file shouldn't have any punctuation. Could you tell me, so > I don't have to check for myself, whether your synonyms file includes > all the items in the original file? If so, yours could be a direct > replacement. Otherwise, we may want to merge the two. > > In either case, we should also check to make sure no word appears > more than once in the file, because htfuzzy synonyms doesn't properly > handle duplicates. I believe the original synonyms file does contain > a few duplicates, which cause some incomplete entries in the database, > as the last occurrence of a word overrides previous ones. > > > I won't have a new version of Doc2html for you for some time, though the > > present version does need improving, because I want to revise the > > documentation and try out some alternative conversion utitlities. I don't > > have time for either at present. > > That's OK. The one in 3.1.6 right now is up to date as far as submitted > patches to the last version you released, and is (I think) quite solid. > > -- > Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> > Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil > Dept. Physiology, U. of Manitoba Phone: (204)789-3766 > Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 > > _______________________________________________ > htdig-general mailing list <[EMAIL PROTECTED]> > To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe > FAQ: http://htdig.sourceforge.net/FAQ.html > _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

