subject:"\[fpc\-devel\] Unicode Letters"

Re: [fpc-devel] Unicode Letters

2008-07-17 Thread theo

ik schrieb: In hebrew (at least) the punctuation is a different char that comes after the letter, but painted like it was part of the letter, so you can parse each word and ignore non letter value (it arrives in different range in the unicode table). Yes, the question is: which are non

Re: [fpc-devel] Unicode Letters

2008-07-17 Thread Felipe Monteiro de Carvalho

On Wed, Jul 16, 2008 at 4:09 PM, theo [EMAIL PROTECTED] wrote: Is there a better way to do what I need? I think that writing such a routine which compares the char to a table will be the best solution. -- Felipe Monteiro de Carvalho ___ fpc-devel

[fpc-devel] Unicode Letters

2008-07-16 Thread theo

Is there a way to separate unicode letters from punctuation and the like? The reason is simple: I would like to separate words in a text for a spell-checker. I see there are tables which list unicode categories http://www.sql-und-xml.de/unicode-database/#kategorien Is there already something for

Re: [fpc-devel] Unicode Letters

2008-07-16 Thread ik

In hebrew (at least) the punctuation is a different char that comes after the letter, but painted like it was part of the letter, so you can parse each word and ignore non letter value (it arrives in different range in the unicode table). Ido On Wed, Jul 16, 2008 at 10:09 PM, theo [EMAIL