toCharArray()

Alan Bateman Thu, 28 Apr 2011 04:16:11 -0700

Xueming Shen wrote:

 Hi
This is motivated by Neil's request to optimize common-case UTF8 pathfor native ZipFile.getEntry calls [1].As I said in my replying email [2] I believe a better approach mightbe to "patch" UTF8 charset directly toimplement sun.nio.cs.ArrayDecoder/Encoder interface to speed up thecoding operation for array basedencoding/decoding under certain circumstance, as we did for all singlebyte charsets in #6636323 [3]. I
have a old blog [4] that has some data for this optimization.
The original plan was to do the same thing for our new UTF8 [5] aswell in JDK7, but then (excuse, excuse)I was just too busy to come back to this topic till 2 days ago. Aftertwo days of small tweaking here and thereand testing those possible corner cases I can think of, I'm happy withthe result and think it might beworth sending it out for a codereview for JDK7, knowing we only havecouple days left.

I skimmed through the webrev and I agree this is a better approach. Iwill try to do a detailed review before Monday. It would be great ifothers on the list could jump in and help too as we are running out of time.

Neil - I don't know if you've had a chance to look at Sherman's changesbut I think it's better than checking if mUTF-8 can be used. If youagree then would you have time to run your tests that altered you tothis performance regression? There's a patch file in the webrev.


-Alan.

Re: Codereview request: CR 7040220 java/char_encodin Optimize UTF-8 charset for String.getBytes()/toCharArray()

Reply via email to