Character Sets; Ascii Character Set Management; Gsm Character Set Management; Ucs2 Character Set Management - Motorola g20 Developer's Manual

At commands
Hide thumbs Also See for g20:
Table of Contents

Advertisement

2.8

CHARACTER SETS

The following lists references to various tables that provide conversions between the different character sets.
From \ To
ETSI 03.38
ISO/IEC 10646
ISO/IEC 8859-1
ISO-8859-1
For the full content of a specific conversion table, refer to Appendix A, Character Set Tables.
2.8.1

ASCII Character Set Management

The American Standard Code for Information Interchange (ASCII) is a standard seven-bit code that was proposed by ANSI in
1963, and finalized in 1968. ASCII was established to achieve compatibility between various types of data processing
equipment. Later-day standards that document ASCII include ISO-14962-1997 and ANSI-X3.4-1986 (R1997).
2.8.2

GSM Character Set Management

GSM is the default alphabet, as described in section 8.7 (GSM character table) .
g20 can store messages coded in any alphabet on the SIM, irrespective of support of an individual alphabet.
The default alphabet is based on 7bit characters.
For more information, refer to ETSI GSM 3.38 v561.
2.8.3

UCS2 Character Set Management

UCS is the first officially standardized coded character set, eventually to include the characters of all the written languages in
the world, as well as all mathematical and other symbols.
Unicode can be characterized as the (restricted) 2-octet form of UCS on (the most general) implementation level 3, with the
addition of a more precise specification of the bi-directional behavior of characters, as used in the Arabic and Hebrew scripts.
The 65,536 positions in the 2-octet form of UCS are divided into 256 rows with 256 cells in each. The first octet of a character
representation denotes the row number, the second the cell number. The first row (row 0) contains exactly the same characters
as ISO/IEC 8859-1. The first 128 characters are thus the ASCII characters. The octet representing an ISO/IEC 8859-1 character
is easily transformed to the representation in UCS by placing a 0 octet in front of it. UCS includes the same control characters
as ISO/IEC 8859 (also in row 0).
98-08901C68-O

Table 2. References to Character Set Conversion Tables

GSM
GSM
Table CS5
ASCII
UTF8
UCS2
Table CS4
ASCII
UTF8
Table CS7
Table CS2
Table CS2
Table CS3
Product Features
UCS2
ISO-8859-1
Table CS1
Table CS3
Table CS6
15

Advertisement

Table of Contents
loading

Table of Contents