Re: [Users] utf-8 subject in outgoing messages

2010-11-25 Thread Paul Bagyenda
Should work. Can you test it and advise if it works? Then I'll apply your patch.
On Nov 25, 2010, at 18:01, Piotr Isajew wrote:

> On Thu, Nov 25, 2010 at 11:19:32AM +0100, Piotr Isajew wrote:
>> 0x16 0xea: looks like encoding indication for me, doesn't match utf-8
>> above, I'm not sure, why
> 
> 0xEA is an indication of UTF-8 charset (i'm not sure why it differs
> from the value I quoted before) and 0x16 is a number of octets in
> subject including charset octet and null-terminator.
> 
> I think the fix would need to modify mms_pack_well_known_field(),
> create separate case for MMS_HEADER_SUBJECT and push those two octets
> into os before call to wsp_pack_text()
> 
> Paul, what do you think about this?
> 
> ___
> Users mailing list
> Users@mbuni.org
> http://lists.mbuni.org/mailman/listinfo/users

___
Users mailing list
Users@mbuni.org
http://lists.mbuni.org/mailman/listinfo/users


Re: [Users] utf-8 subject in outgoing messages

2010-11-25 Thread Piotr Isajew
On Thu, Nov 25, 2010 at 11:19:32AM +0100, Piotr Isajew wrote:
> 0x16 0xea: looks like encoding indication for me, doesn't match utf-8
>  above, I'm not sure, why

0xEA is an indication of UTF-8 charset (i'm not sure why it differs
from the value I quoted before) and 0x16 is a number of octets in
subject including charset octet and null-terminator.

I think the fix would need to modify mms_pack_well_known_field(),
create separate case for MMS_HEADER_SUBJECT and push those two octets
into os before call to wsp_pack_text()

Paul, what do you think about this?



pgpy0fK3uKDQd.pgp
Description: PGP signature
___
Users mailing list
Users@mbuni.org
http://lists.mbuni.org/mailman/listinfo/users


Re: [Users] utf-8 subject in outgoing messages

2010-11-25 Thread Piotr Isajew
Maybe it's specific for MM1 with which I'm so concerned, that I often
forget about existence of other protocols ;-) However, fast google in
the morning led me to the following:

1. OMA-MMS-ENC-V1_2:

(BNF)

Subject-value = Encoded-string-value
 

Encoded-string-value = Text-string | Value-length Char-set Text-string

The Char-set values are registered by IANA as MIBEnum value.  UTF-8
character-set encoding SHOULD be supported in Encoded-string-value. If
the MMS Client uses UTF-8 character-set encoding, the Char-set
parameter SHOULD be used to indicate its usage.  Encoding according to
[RFC2047] MAY be supported in the MMS Client and/or MMS
Proxy-Relay. Encoding according to [RFC2047] SHOULD only be used
without "Value-length Char-set" parameters. [RFC2047] encoding for
UTF-8 character- set encoding MAY be supported in the MMS Client
and/or MMS Proxy-Relay.  Note: The usage of Unicode character-set
encoding is recommended. The supported set of actual character-sets in
the MMS Client is up to the implementation. The MMS Client must not
rely on the MMS Proxy-Relay doing any character-set transformation.

2. WAP-203-WSP:

Table 42. Character Set Assignment:

utf-8: 0x6a

3. tcpdumps (my comments is just guessing for now):

incoming to mbuni from handset with utf-8 (I hope so - polish
characters properly decoded) encoded subject:

0x96:  start of subject field
0x16 0xea: looks like encoding indication for me, doesn't match utf-8
   above, I'm not sure, why
0x20:  space ... maybe UA inserts it
0xc5 0xbb: first UTF-8 encoded character


outgoing from mbuni to handset (UTF-8 chars are badly decoded on
handset):

0x96:  start of subject field
0x7f:  don't know what's this
0xc5 0xbb: first UTF-8 encoded character




On Thu, Nov 25, 2010 at 12:56:13PM +0300, Paul Bagyenda wrote:
> How should the charset information be encoded?
> On Nov 25, 2010, at 12:34, Piotr Isajew wrote:
> 
> > Hi,
> > 
> > Is there any way to properly use Send MMS service to send MMS message
> > with UTF-8 encoded subject?
> > 
> > I tried both passing raw utf-8 string as 'subject' parameter of get
> > request and encoding it according to RFC-2047 before.
> > 
> > From tcpdump it looks for me that no charset information is encoded in
> > Subject field.
> > 
> > Regards,
> > 
> > Piotr
> > ___
> > Users mailing list
> > Users@mbuni.org
> > http://lists.mbuni.org/mailman/listinfo/users
> 
> ___
> Users mailing list
> Users@mbuni.org
> http://lists.mbuni.org/mailman/listinfo/users
> 
> 


pgp6gl1nSXx3c.pgp
Description: PGP signature
___
Users mailing list
Users@mbuni.org
http://lists.mbuni.org/mailman/listinfo/users


Re: [Users] utf-8 subject in outgoing messages

2010-11-25 Thread Paul Bagyenda
How should the charset information be encoded?
On Nov 25, 2010, at 12:34, Piotr Isajew wrote:

> Hi,
> 
> Is there any way to properly use Send MMS service to send MMS message
> with UTF-8 encoded subject?
> 
> I tried both passing raw utf-8 string as 'subject' parameter of get
> request and encoding it according to RFC-2047 before.
> 
> From tcpdump it looks for me that no charset information is encoded in
> Subject field.
> 
> Regards,
> 
> Piotr
> ___
> Users mailing list
> Users@mbuni.org
> http://lists.mbuni.org/mailman/listinfo/users

___
Users mailing list
Users@mbuni.org
http://lists.mbuni.org/mailman/listinfo/users


[Users] utf-8 subject in outgoing messages

2010-11-25 Thread Piotr Isajew
Hi,

Is there any way to properly use Send MMS service to send MMS message
with UTF-8 encoded subject?

I tried both passing raw utf-8 string as 'subject' parameter of get
request and encoding it according to RFC-2047 before.

From tcpdump it looks for me that no charset information is encoded in
Subject field.

Regards,

Piotr


pgp26k5bC3mPw.pgp
Description: PGP signature
___
Users mailing list
Users@mbuni.org
http://lists.mbuni.org/mailman/listinfo/users