Re: Illume dictionary for Dutch (Nederlands)

2008-11-27 Thread Pander
Is it possible to put comments in the .dic file? If so, in what format? E.g. only the first couple of lines which start with a #. Carsten Haitzler (The Rasterman) wrote: On Thu, 20 Nov 2008 10:55:02 +0100 (CET) Pander [EMAIL PROTECTED] babbled: any dictionary should not care about gsm

Re: Illume dictionary for Dutch (Nederlands)

2008-11-27 Thread The Rasterman
On Fri, 28 Nov 2008 00:20:38 +0100 Pander [EMAIL PROTECTED] babbled: Is it possible to put comments in the .dic file? If so, in what format? E.g. only the first couple of lines which start with a #. no. it doesnt support comments. Carsten Haitzler (The Rasterman) wrote: On Thu, 20 Nov 2008

Re: Illume dictionary for Dutch (Nederlands)

2008-11-27 Thread Guillaume Chereau
On Thu, 2008-11-20 at 10:14 +1100, Carsten Haitzler wrote: (japanese) sakana - さかな | 魚 | 肴 | 坂な | 茶菓な | 阪な | 差かな | 左かな | 差かな | 査かな | 鎖かな | サカナ | sakana Hi raster, I am curious how we can pass unicode character to applications like those via X. I though the keyboard could only send keycode.

Re: Illume dictionary for Dutch (Nederlands)

2008-11-20 Thread Pander
Hi all, I intent to generate the following: - a full list utf-8 (for 8 bit SMS and regular use, default) - b full list utf-8 GSM 03.38[1] (for 7 bit SMS) - c truncated list utf-8 (for 8 bit SMS and regular use) - d truncated list utf-8 GSM 03.38[1] (for 7 bit SMS, default) [1] These utf-8

Re: Illume dictionary for Dutch (Nederlands)

2008-11-20 Thread Rui Miguel Silva Seabra
I have no idea... I might only make a new version with utf-8 encoded characters. :) On Thu, Nov 20, 2008 at 10:40:46AM +0100, Pander wrote: Hi all, I intent to generate the following: - a full list utf-8 (for 8 bit SMS and regular use, default) - b full list utf-8 GSM 03.38[1] (for 7 bit

Re: Illume dictionary for Dutch (Nederlands)

2008-11-20 Thread Pander
Small correction to my text: Note that more characters must be Note that certain special characters are in GSM 03.38 which are not in extended ASCII Nevertheless, one complete utf-8 dictionary could be used by most applications, also SMS. The conversion I do for GSM 03.38 could also be done

Re: Illume dictionary for Dutch (Nederlands)

2008-11-20 Thread The Rasterman
On Thu, 20 Nov 2008 10:55:02 +0100 (CET) Pander [EMAIL PROTECTED] babbled: any dictionary should not care about gsm encodings. it should be just a utf8 dictionary file. it is the job of the sms app to convert normal utf8 unicode to whatever encoding used by the network, and back. :) Small

Illume dictionary for Dutch (Nederlands)

2008-11-19 Thread Pander
Hi all, Together with http://opentaal.org , I'm working on a special Illume dictionary for Dutch word completion. It will be available in the near future. Of course this particular word list is very long and contains about 250,000 words and has a typical lng tail. Many words or compositions

Re: Illume dictionary for Dutch (Nederlands)

2008-11-19 Thread Rui Miguel Silva Seabra
On Wed, Nov 19, 2008 at 11:25:22PM +0100, Pander wrote: Hi all, Together with http://opentaal.org , I'm working on a special Illume dictionary for Dutch word completion. It will be available in the near future. Of course this particular word list is very long and contains about 250,000

Re: Illume dictionary for Dutch (Nederlands)

2008-11-19 Thread The Rasterman
On Wed, 19 Nov 2008 23:25:22 +0100 Pander [EMAIL PROTECTED] babbled: Hi all, Together with http://opentaal.org , I'm working on a special Illume dictionary for Dutch word completion. It will be available in the near future. Of course this particular word list is very long and contains

Re: Illume dictionary for Dutch (Nederlands)

2008-11-19 Thread Marco Trevisan (Treviño)
Pander wrote: Of course this particular word list is very long and contains about 250,000 words and has a typical lng tail. Many words or compositions or occur seldom in average day use. What would be a good cut off point in number of words, also in terms of performance? The

Re: Illume dictionary for Dutch (Nederlands)

2008-11-19 Thread Rui Miguel Silva Seabra
On Thu, Nov 20, 2008 at 03:02:41AM +0100, Marco Trevisan (Treviño) wrote: Pander wrote: Of course this particular word list is very long and contains about 250,000 words and has a typical lng tail. Many words or compositions or occur seldom in average day use. What would be a good