Then I don't understand what should be the issue here :-) ? Thanks, Alex
Am 01.04.2012 um 23:15 schrieb spameden: > Exactly what I've said :) > > If your source text is in utf8 you need to specify charset=utf8 and coding=2. > > 2012/4/2 Alexander Malysh <amal...@kannel.org> > Hi, > > cyrillic can only be send with ucs2 therefore coding=2. > > Kannel behavior for coding=2 and 3 is simple: don't touch it it's binary and > up to user to encode it BUT > if you need that kannel converts some charset to ucs2 for you then just use > two params: > charset=YOUR_CHARSET > coding=2 > > Then kannel will do it for you. > > Thanks, > Alex > > Am 31.03.2012 um 00:45 schrieb chad selph: > >> I understand that coding=2 stands for UCS-2 but the problem I'm pointing out >> is that it doesn't actually re-encode the UTF8 bytes into actual UCS-2 >> bytes. This is inconsistent because it will convert utf8 to GSM, or to >> Latin-1 (if the alt-charset is set to Latin1). >> >> As far as the "charset" parameter: from my understand of the docs, it's >> actually irrelevant to the SMPP stuff, this is just for you to tell smsbox >> which percent encoding your text is in (URLs only support ascii). It >> defaults to UTF-8 in the newer versions and this is what prefer to use. But >> the important thing is that it has no relevance to the data_coding that gets >> sent over SMPP. >> >> >> On Fri, Mar 30, 2012 at 3:20 PM, spameden <spame...@gmail.com> wrote: >> utf8 + coding=0 never worked for me for cyrillic text messages. >> >> the only combination is coding=2 & charset=utf8, otherwise I'm getting >> bollocks on mobile screen. >> >> according to the kannel's documentation, coding is: >> >> coding number >> Optional. Sets the coding >> scheme bits in DCS field. >> Accepts values 0 to 2, for 7bit, >> 8bit or UCS-2. If unset, defaults >> to 7 bits unless a udh is defined, >> which sets coding to 8bits. >> >> so coding=2 stands for UCS-2 message. >> >> >> 2012/3/31 chad selph <chad.se...@gmail.com> >> I'm trying to figure out how to send different data encodings from Kannel >> 1.5.0 over SMPP. The SMPP Spec lists the following options for data_coding >> field: >> >> 0 0 0 0 0 0 0 0 SMSC Default Alphabet >> 0 0 0 0 0 0 0 1 IA5(CCITTT.50)/ASCII(ANSIX3.4) >> 0 0 0 0 0 0 1 0 Octet unspecified (8-bit binary) >> 0 0 0 0 0 0 1 1 Latin1(ISO-8859-1) >> 0 0 0 0 0 1 0 0 Octet unspecified (8-bit binary) >> 0 0 0 0 0 1 0 1 JIS(X0208-1990) >> 0 0 0 0 0 1 1 0 Cyrllic(ISO-8859-5) >> 0 0 0 0 0 1 1 1 Latin/Hebrew (ISO-8859-8) >> 0 0 0 0 1 0 0 0 UCS2(ISO/IEC-10646) >> ... and some others. >> >> To initiate MT messages, we're using the sendsms http interface on smsbox >> (the one here: >> http://www.kannel.org/download/1.5.0/userguide-1.5.0/userguide.html#AEN4623 >> ). It looks like the only relevant parameter into the sendsms is the >> "coding" parameter, which can only be 0, 1, or 2. "0" causes data_coding 0, >> 1 causes 4, and 2 causes 8. I don't see a way to set data_coding to 3, for >> example, in order to do Latin-1. >> >> Another thing is that only 0 causes the message text to get encoded from >> UTF-8 (input encoding from http) into the correct encoding. For example, >> sending the UTF-8 data with coding=2 does not re-encode the message into >> USC-2, but just sends your UTF-8 bytes as if they were UCS-2 but sending >> utf8 data with coding=0 does re-encode them into GSM. >> >> These things seem to me to be incorrect behavior, however given the wide use >> of kannel I figured I should make sure I'm not missing something obvious >> before I draft a patch to attempt to fix them. Am I missing something? >> >> > >