Re: [fpc-devel] Unicode Letters

2008-07-17 Thread theo
ik schrieb: In hebrew (at least) the punctuation is a different char that comes after the letter, but painted like it was part of the letter, so you can parse each word and ignore non letter value (it arrives in different range in the unicode table). Yes, the question is: which are non

Re: [fpc-devel] Unicode Letters

2008-07-17 Thread Felipe Monteiro de Carvalho
On Wed, Jul 16, 2008 at 4:09 PM, theo [EMAIL PROTECTED] wrote: Is there a better way to do what I need? I think that writing such a routine which compares the char to a table will be the best solution. -- Felipe Monteiro de Carvalho ___ fpc-devel

[fpc-devel] Unicode Letters

2008-07-16 Thread theo
Is there a way to separate unicode letters from punctuation and the like? The reason is simple: I would like to separate words in a text for a spell-checker. I see there are tables which list unicode categories http://www.sql-und-xml.de/unicode-database/#kategorien Is there already something for

Re: [fpc-devel] Unicode Letters

2008-07-16 Thread ik
In hebrew (at least) the punctuation is a different char that comes after the letter, but painted like it was part of the letter, so you can parse each word and ignore non letter value (it arrives in different range in the unicode table). Ido On Wed, Jul 16, 2008 at 10:09 PM, theo [EMAIL