Disclaimer: These archives are mirrored from smsforum.net in 2007 before the forum got closed. Please only part of the forum is available here.
For any clarifications regarding these archives you can contact us at http://www.telecomspace.com/contact.

      TELECOMSPACE HOME PAGE         TELECOM DISCUSSION FORUM          CONTACT

+  SMS Forum Online Discussion
|-+  SMS Technologies
| |-+  ANSI-41 (CDMA & TDMA) Related (Moderator: SMS Forum Support)
| | |-+  Encoding method for sms : OCTET
« previous next »
Pages: [1] Go Down Print
Author Topic: Encoding method for sms : OCTET  (Read 1168 times)
milo
Full Member
***
Offline Offline

Posts: 12


« on: April 17, 2006, 09:13:27 UTC »

Hi all,

Anyone knows OCTET encoding method? We are considering to use UTF-8 or Octet encoding method for SMSC, but I can't find standard for OCTET.

Highly appreciate all your help.

Thanks,
milo
Logged
itsnomihere
Sr. Member
****
Offline Offline

Posts: 44


« Reply #1 on: April 17, 2006, 16:43:38 UTC »

http://smsforum.net/smf/index.php?topic=2270.msg6028#msg6028
this post might help
Logged
shad
Sr. Member
****
Offline Offline

Posts: 54


« Reply #2 on: April 17, 2006, 17:14:57 UTC »

Hi Milo

(This is an opinion, not a definite explanation:)

TSB-58 / C.R1001 shows this encoding as "octet, unspecified". I think this deliberately doesn't have a single associated standard - you are free to use any structure you like, so long as the SME at the other end can work out how to decode it. An example would be WAP, where the teleservice ID defines how the message is coded.

For simple (english character) text, you might be better off using 7-bit ASCII.

:-)
shad

Logged
SMS Forum Support
SMS Forum Support
Administrator
Sr. Member
*****
Offline Offline

Posts: 1754


WWW
« Reply #3 on: April 17, 2006, 18:52:44 UTC »

octet unspecified is a raw binary octet stream.. there is no encoding method behind it.. its a simple stream of raw octets of data and is used to carry binary messages. Its never used to carry readable text.
Logged

Regards,
   Cormac Long
   Webmaster & Technical Enquiry Moderator,
   SMS Forum

milo
Full Member
***
Offline Offline

Posts: 12


« Reply #4 on: April 18, 2006, 03:29:57 UTC »

Dear all,

Thanks so much for all your kind help.

Due to Nokia handset doesn't support UTF-8, just Octet. So, our SMSC has to transform all kind encoded messages (UTF-8, UCS2, ...) to Octet before sending to MT terminal.

If SMSC transform UTF-8 message (MO) to Octet (MT), I wonder it will be kept original and display correctly or not? (I am mentioning to vietnamese language).

Thanks,
milo
Logged
SMS Forum Support
SMS Forum Support
Administrator
Sr. Member
*****
Offline Offline

Posts: 1754


WWW
« Reply #5 on: April 18, 2006, 23:37:25 UTC »

In CDMA or GSM.. Vietnamese is encoded in UCS2.. 2-octet unicode pairs.. big endian with no BOM (byte order marker). The most common means of an ESME sending this encoding is in UCS2, having the SMSC merely copy the raw data as-is to the MT message (no conversion needed).

the above is what most SMSCs will do and similarly ESMEs that are using Vietnamese and other Asian/Arabic (non-Latin) character sets.

Logged

Regards,
   Cormac Long
   Webmaster & Technical Enquiry Moderator,
   SMS Forum

milo
Full Member
***
Offline Offline

Posts: 12


« Reply #6 on: April 20, 2006, 02:52:39 UTC »

As I know, Nokia support only OCTET for signed languages (Latin language with sign such as French, German, Vietnamese, ...).

So I am wondering OCTET can be used editing and displaying correctly all character of Latin signed language or not?
« Last Edit: April 20, 2006, 02:57:36 UTC by milo » Logged
SMS Forum Support
SMS Forum Support
Administrator
Sr. Member
*****
Offline Offline

Posts: 1754


WWW
« Reply #7 on: April 20, 2006, 16:33:38 UTC »

You are wrong there.. Nokia support the standard 7-bit GSM alphabet for Latin languages.. English, French etc and UCS2 for the Asian languages.. CJK, Thai, Cyrillic etc. the very same applies for other manufacturers as it is strictly defined in GSM standards.

OCTET unspecified is not used to display text.. its encoded as GSM 7-bit or as UCS2. There are no exceptions to this.

UCS2 can be used to display Latin1.. 'A' is encoded as 00 41 .. (2 octets), 0x41 is the standard ASCII code for 'A'
Logged

Regards,
   Cormac Long
   Webmaster & Technical Enquiry Moderator,
   SMS Forum

Pages: [1] Go Up Print 
« previous next »
Jump to:  


Login with username, password and session length