java-nio-charset-enhanced -- Milestone 4 is released

Ulf Zibis Fri, 27 Mar 2009 13:29:58 -0700

Hi folks,

milestone 4 of charset enhancement is released.

- I reduced the jar-footprint, concerning entire single-byte needs,compared to original JDK 6 binaries, down to 7 %, which also shouldperform class loading, (not to forget: encoder maps are lazyinitialized), even though there are added 21 specialized coder algorithms.- In this release there is only 1 class <SingleByteCharset> for allsingle-byte charsets, which reads decoder mapping + all names includingaliases from a small data file (69..731 Bytes, average 250 Bytes). Thisis possible, because numerous charsets can inherit their mappings (2562-byte chars) from each other, and empty or 1:1 ranges (especially\u0000..\u007F) are filled by constructor.- Additionally a set of 7 Decoder and 14 Encoder classes do there work,specially speed + memory optimised for the charsets, having diversecharacter spreading and frequency of occurrence. A special MapCalculatorclass for playing with different parameters is provided in the test package.- The aliases and historical names should no more statically andentirely loaded, provided and linked from StandardCharsets class. Theyadditionally could be easy edited in files standard-charsets andextended-charsets (refer Bug Id: 6795538<http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6795538>). If someday they are defined entirely upper-case, they could be omittedcompletely, as they are redundantly case-standardised existing in theFastCharsetProvider lookup maps. Determining the 'contains()' referencesby this way would be also reasonable (refer Bug Id: 6761481<http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6761481>), butcontainment of ASCII is already calculated automatically.


See my projects home: ---> https://java-nio-charset-enhanced.dev.java.net/

I believe, these techniques could also be used for most multi-bytecharsets, especially inheriting maps to reduce entire charsets footprint.

_Outlook Milestone 5_ : Final performance optimisation by dedicatedinlining, exception catching, surrogate handling etc..*Urgently waiting* for ChristianThalinger's optimization of "widening conversions".



Happy easter,

-Ulf

P.S.: I'm on the way, providing changesets slice by slice for OpenJDK 7.

BTW: Is there a way to add author and/or contributor annotation in thesources to _honour the investigation of external collaborators_ (almost1 year in my case)?

java-nio-charset-enhanced -- Milestone 4 is released

Reply via email to