Re: [Users] utf-8 subject in outgoing messages
Should work. Can you test it and advise if it works? Then I'll apply your patch. On Nov 25, 2010, at 18:01, Piotr Isajew wrote: > On Thu, Nov 25, 2010 at 11:19:32AM +0100, Piotr Isajew wrote: >> 0x16 0xea: looks like encoding indication for me, doesn't match utf-8 >> above, I'm not sure, why > > 0xEA is an indication of UTF-8 charset (i'm not sure why it differs > from the value I quoted before) and 0x16 is a number of octets in > subject including charset octet and null-terminator. > > I think the fix would need to modify mms_pack_well_known_field(), > create separate case for MMS_HEADER_SUBJECT and push those two octets > into os before call to wsp_pack_text() > > Paul, what do you think about this? > > ___ > Users mailing list > Users@mbuni.org > http://lists.mbuni.org/mailman/listinfo/users ___ Users mailing list Users@mbuni.org http://lists.mbuni.org/mailman/listinfo/users
Re: [Users] utf-8 subject in outgoing messages
On Thu, Nov 25, 2010 at 11:19:32AM +0100, Piotr Isajew wrote: > 0x16 0xea: looks like encoding indication for me, doesn't match utf-8 > above, I'm not sure, why 0xEA is an indication of UTF-8 charset (i'm not sure why it differs from the value I quoted before) and 0x16 is a number of octets in subject including charset octet and null-terminator. I think the fix would need to modify mms_pack_well_known_field(), create separate case for MMS_HEADER_SUBJECT and push those two octets into os before call to wsp_pack_text() Paul, what do you think about this? pgpy0fK3uKDQd.pgp Description: PGP signature ___ Users mailing list Users@mbuni.org http://lists.mbuni.org/mailman/listinfo/users
Re: [Users] utf-8 subject in outgoing messages
Maybe it's specific for MM1 with which I'm so concerned, that I often forget about existence of other protocols ;-) However, fast google in the morning led me to the following: 1. OMA-MMS-ENC-V1_2: (BNF) Subject-value = Encoded-string-value Encoded-string-value = Text-string | Value-length Char-set Text-string The Char-set values are registered by IANA as MIBEnum value. UTF-8 character-set encoding SHOULD be supported in Encoded-string-value. If the MMS Client uses UTF-8 character-set encoding, the Char-set parameter SHOULD be used to indicate its usage. Encoding according to [RFC2047] MAY be supported in the MMS Client and/or MMS Proxy-Relay. Encoding according to [RFC2047] SHOULD only be used without "Value-length Char-set" parameters. [RFC2047] encoding for UTF-8 character- set encoding MAY be supported in the MMS Client and/or MMS Proxy-Relay. Note: The usage of Unicode character-set encoding is recommended. The supported set of actual character-sets in the MMS Client is up to the implementation. The MMS Client must not rely on the MMS Proxy-Relay doing any character-set transformation. 2. WAP-203-WSP: Table 42. Character Set Assignment: utf-8: 0x6a 3. tcpdumps (my comments is just guessing for now): incoming to mbuni from handset with utf-8 (I hope so - polish characters properly decoded) encoded subject: 0x96: start of subject field 0x16 0xea: looks like encoding indication for me, doesn't match utf-8 above, I'm not sure, why 0x20: space ... maybe UA inserts it 0xc5 0xbb: first UTF-8 encoded character outgoing from mbuni to handset (UTF-8 chars are badly decoded on handset): 0x96: start of subject field 0x7f: don't know what's this 0xc5 0xbb: first UTF-8 encoded character On Thu, Nov 25, 2010 at 12:56:13PM +0300, Paul Bagyenda wrote: > How should the charset information be encoded? > On Nov 25, 2010, at 12:34, Piotr Isajew wrote: > > > Hi, > > > > Is there any way to properly use Send MMS service to send MMS message > > with UTF-8 encoded subject? > > > > I tried both passing raw utf-8 string as 'subject' parameter of get > > request and encoding it according to RFC-2047 before. > > > > From tcpdump it looks for me that no charset information is encoded in > > Subject field. > > > > Regards, > > > > Piotr > > ___ > > Users mailing list > > Users@mbuni.org > > http://lists.mbuni.org/mailman/listinfo/users > > ___ > Users mailing list > Users@mbuni.org > http://lists.mbuni.org/mailman/listinfo/users > > pgp6gl1nSXx3c.pgp Description: PGP signature ___ Users mailing list Users@mbuni.org http://lists.mbuni.org/mailman/listinfo/users
Re: [Users] utf-8 subject in outgoing messages
How should the charset information be encoded? On Nov 25, 2010, at 12:34, Piotr Isajew wrote: > Hi, > > Is there any way to properly use Send MMS service to send MMS message > with UTF-8 encoded subject? > > I tried both passing raw utf-8 string as 'subject' parameter of get > request and encoding it according to RFC-2047 before. > > From tcpdump it looks for me that no charset information is encoded in > Subject field. > > Regards, > > Piotr > ___ > Users mailing list > Users@mbuni.org > http://lists.mbuni.org/mailman/listinfo/users ___ Users mailing list Users@mbuni.org http://lists.mbuni.org/mailman/listinfo/users
[Users] utf-8 subject in outgoing messages
Hi, Is there any way to properly use Send MMS service to send MMS message with UTF-8 encoded subject? I tried both passing raw utf-8 string as 'subject' parameter of get request and encoding it according to RFC-2047 before. From tcpdump it looks for me that no charset information is encoded in Subject field. Regards, Piotr pgp26k5bC3mPw.pgp Description: PGP signature ___ Users mailing list Users@mbuni.org http://lists.mbuni.org/mailman/listinfo/users