On 9/26/12 11:12 PM, Martin Wierschin wrote:
I'm trying to split CJK text using the kind of word boundaries detected by
-[NSAttributedString doubleClickAtIndex:]. That method does the job
correctly, but only if the system preferences have the Word Break mode set to
Japanese. I need to ensure this kind of word splitting independent of the
user's system preferences.

It was my understanding that I could use CFStringTokenizer for this task, but
it doesn't seem to be working. Test code that produces improper results:

I have no idea if the system frameworks expose functions for this - since it
knows about it, it could/should. If you end up needing to do it on your own:

There are the Kinsoku rules with are wrap rules for Japanese. Semantially similar rules exist for Chinese and Korean. A simple implementation it not too difficult, see here for a quick overview:

http://en.wikipedia.org/wiki/Line_breaking_rules_in_East_Asian_languages

Regards
Markus
--
__________________________________________
Markus Spoettl
_______________________________________________

Cocoa-dev mailing list ([email protected])

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
https://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to [email protected]

Reply via email to