Re: [rust-dev] unicode support and core

Graydon Hoare Fri, 23 Dec 2011 19:04:01 -0800

On 11-12-23 12:03 PM, Marijn Haverbeke wrote:

I'm also curious what people think are "the important parts" of unicode.


Character classification is very important, and should be in core I
think (if only to encourage people to actually use it instead of
rolling their own... badly).

Yeah. I looked at ways of doing a minimalist build of libicu today andit just gets really, really gross. Also it's a lot of layers ofindirection for something that ought to be pretty fast (core lexingroutines and such). So I just did a python-conversion monstrosity intorust code. Adds about 80kb to libcore optimized and gets us the generalcategories and a couple important derived properties (XID_Start /Continue, Alphabetic).

Encodings are something people will occasionally need, but a much less
important thing. This doesn't have to be in core, I think. (And, if I
understand correctly, much of libicu is encoding tables.)

Agreed. I think it's fine if we keep this stuff in a "full" binding tolibicu outside core. I'll keep updating the char API and munged unicodedata tables as needed, but this seems semi-workable.

(We'll probably need NFKC and a couple other bits in core, but hopefullynot *too* much.)


-Graydon
_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] unicode support and core

Reply via email to