Ah, thanks. This is VERY useful. I sent another email which crossed
with this, but you might have fun playing with my listing anyway.
The more I looked at it, the more interesting this got!
In fact, I've spent most of today tinkering with this, I got so
carried away. Your information will be most useful if more
dictionaries turn out to use this system. So far, most of them seem to
be ASCII listings which need either:
1. Simple character set translation, or
2. Simple search and replace for symbols, e.g. vowel&'\' replaced by
accented vowel or vowel&'^' replaced by vowel with circumflex or
vowel&':' replaced by vowel¨aut or dierisis, for example.
Drat, why did I get into this in the first place...I have a feeling
I'll be programming around this all week now :-))
--
Dilwyn Jones
----- Original Message -----
From: "Robert Newson" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Sunday, July 31, 2005 3:48 PM
Subject: Re: [ql-users] dictionaries
Dilwyn Jones wrote:
...
One thing I need help on is on what appears to be a simple ASCII
compression scheme on some of the unix-sourced word lists. I'm
assuming they're from Unix systems because the end of line
character
is only a linefeed, no carriage returns. I need to find out if the
following is a known standard or not:
(snip)
Found it. It is ispell
(http://www.lasr.cs.ucla.edu/geoff/ispell.html). Each letter is a
code for prefixes and suffixes that are permitable in the given
language - every language can have its own affix file to define what
each letter means!
--
No virus found in this outgoing message.
Checked by AVG Anti-Virus.
Version: 7.0.338 / Virus Database: 267.9.7/60 - Release Date: 28/07/2005
_______________________________________________
QL-Users Mailing List
http://www.q-v-d.demon.co.uk/smsqe.htm