ik schrieb:
In hebrew (at least) the punctuation is a different char that comes
after the letter, but painted like it was part of the letter, so you
can parse each word and ignore non letter value (it arrives in
different range in the unicode table).
Yes, the question is: which are non
On Wed, Jul 16, 2008 at 4:09 PM, theo [EMAIL PROTECTED] wrote:
Is there a better way to do what I need?
I think that writing such a routine which compares the char to a table
will be the best solution.
--
Felipe Monteiro de Carvalho
___
fpc-devel
Is there a way to separate unicode letters from punctuation and the like?
The reason is simple: I would like to separate words in a text for a
spell-checker.
I see there are tables which list unicode categories
http://www.sql-und-xml.de/unicode-database/#kategorien
Is there already something for
In hebrew (at least) the punctuation is a different char that comes
after the letter, but painted like it was part of the letter, so you
can parse each word and ignore non letter value (it arrives in
different range in the unicode table).
Ido
On Wed, Jul 16, 2008 at 10:09 PM, theo [EMAIL