Re: The use of UTIUtil.toUsingCharset?

Sung-Gu Tue, 04 Feb 2003 02:36:50 -0800

Hi Oleg,
Again... well..
Ok... let me try to make you understand it again.  HmmHmm...


BTW, sorry to bother you that I haven't got you to get it right away
at that time even with a diagram and still...  :(

Actually, that's very easy...
And not that important unless it's not going to be support multilinqual.

As you see the diagram, bytes informations created from the original charset
should be restored.  That's all.

There isn't any uni-one to support the various charsets.(Let you regard it!)
Then, once it was tranformed, it should be tranformed back to the original.
That makes the transformed one to the original one.

If you understand this theory, you should try to understand why there would
need have some container for protocol processing and document browser
thingy.

You're supposed to find a way for tranfomation from and to Unicode to me?
It was your question?
But I don't get it really what you ask to me.
Because it's not for. There isn't that kinda of things.
(You should suppose it and agree with me!)

Then... why is that required?  Please, guess what for?
I said that it's required to support multi-langual!
Then why don't you try to do that?  Just simple.

Well, I'm afraid you still doesn't understand it...

Sung-Gu

P.S.: This time I just ignored any technical issues on my article.
  I hope that may help you...


> ----- Original Message -----
> From: "Kalnichevski, Oleg" <[EMAIL PROTECTED]>
> To: "Commons HttpClient Project"
<[EMAIL PROTECTED]>
> Sent: Tuesday, January 28, 2003 5:37 PM
> Subject: RE: The use of UTIUtil.toUsingCharset?
>
>
> Sung-Gu
> You are right. The examples I presented are meaningless. They are
meaningless, because URIUtil.toUsingCharset method is meaningless in the
very first place. I did my best to explain why
>
> Again, please give me an example (or better a unit test) demonstrating a
meaningful transformation of one Unicode string into another Unicode string
using the method in question
>
> Oleg
>
> -----Original Message-----
> From: Sung-Gu [mailto:[EMAIL PROTECTED]]
> Sent: Montag, 27. Januar 2003 06:01
> To: Commons HttpClient Project
> Subject: Re: The use of UTIUtil.toUsingCharset?
>
>
> Hi,
>
> I'm sorry that I wasn't reaching your point...
> You're interested in only single-byte encodings with Unicode.
> I haven't realized it...
>
> That's why you haven't seen the correct use and display of that method.
> I guessed so though. (So, I tried to display byte code values)
>
> And I'd like to comment you that your below examples're not
> correct to use...   They're meaning-less...
> For display (what you want I guess), you should use code set
> or char set supported by your operating system or ISO-8859-1.
> For UTF-8 is capable to use only by purposes of transformation
> for storage and transmission.
> The case you want to use Unicode for display, ISO-10464 is
> fully supported and transformation to UTF-8 should be applied
> from UCS....
>
> I made it as TODO comment for simple diagram 2 in text file.
>  It was not my right previous issue.
> (As you know, I'm intersted in double-byte encodings...
>  and it would be general way to solve character encoding)
> I'll do it sometime later...
>
> Sung-Gu
>
> ----- Original Message -----
> From: <[EMAIL PROTECTED]>
> Subject: Re: The use of UTIUtil.toUsingCharset?
>
>
> Please take no offense, but URIUtil.toUsingCharset method still does not
> make even slightest sense to me. Your example shows how to invoke this
> method but does not explain what it is useful for, apart from garbling
> unicode strings
>
> Have a look at a simpler example. Here I attempt to (supposedly) convert
> "Zurich" from one encoding into another. However, as you can see
> URIUtil.toUsingCharset() always produces garbage
>
> ===================================================================
> public static void main(String[] args) throws Exception
> {
>   System.out.println(
>     URIUtil.toUsingCharset("Zurich", "UTF-8", "US-ASCII"));
>   System.out.println(
>     URIUtil.toUsingCharset("Zurich", "ASCII", "UTF-8"));
>   System.out.println(
>     URIUtil.toUsingCharset("Zurich", "UTF-8", "ISO-8859-1"));
>   System.out.println(
>     URIUtil.toUsingCharset("Zurich", "ISO-8859-1", "UTF-8"));
> }
>
>
> Output:
>
> Zi��i��rich
> Z?rich
> ZA&#131;A�rich
> Zi��
>
> =================================================================
>
> Java uses 16 bit to represent characters. Therefore the concept of
character
> encoding is only applicable when working with arrays of bytes, 8 bit
units,
> that represent a sequence of characters. One indeed needs to take
character
> encoding into account when converting from byte[] to String or visa versa.
> However, converting from Unicode String to an array of bytes to a Unicode
> String using different encoding (especially in one method call), in my
> opinion, does not produce any sensible results.
>
> If you see things differently, please help me understand what
> URIUtil.toUsingCharset() can be USEFUL for
>
> Cheers
>
> Oleg

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: The use of UTIUtil.toUsingCharset?

Reply via email to