Hi group.

I did a port of ICU character detection to javascript. (the same charset 
detection used in chromium e.g)

NPM module: charset-detector
Repo: https://github.com/btd/node-charset-detector

I did it for several reasons: to replace my binding for icu4c in one of my 
projects, to learn a bit about encodings and as i have some free time.

I know that exists already jschardet - which is a port of mozilla charset 
detector, but 
by license MIT (my) vs LGPL
by speed my ~30% faster (i did simple benchmarks of utf8 detection and 
several multibyte encodings)
by list of supported encodings: jschardet knows more =) (if the ICU would 
have it, I would have also ported =)


-- 
Job board: http://jobs.nodejs.org/
New group rules: 
https://gist.github.com/othiym23/9886289#file-moderation-policy-md
Old group rules: 
https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines
--- 
You received this message because you are subscribed to the Google Groups 
"nodejs" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/nodejs/9e7b1967-818e-4ebf-9ef0-e0b3ecb5b05f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to