Example K.3-1. Example of Long Text Value Representation in the Chinese Language with Explicit Escape Sequences Between GB2312 G0 and GB2312 G1
Chinese (ISO 2022 IR 58) and ASCII (ISO 646) character sets can be used intermingled with explicit escape sequences between them. The Chinese character set ISO IR 58 is invoked to the G1 area, and the ASCII character set is invoked the G0 area. The following is an example of a Long Text value representation that includes ASCII and Chinese character set. Every line must start in ASCII, end in ASCII.
Specific Character Set:
(0008,0005) \ISO 2022 IR 58
Character String:
1) 第一行文字。
2) 第二行文字。
3) 第三行文字。
Encoded String:
1) ESC 02/04 02/09 04/01 第一行文字。ESC 02/08 04/02
2) ESC 02/04 02/09 04/01 第二行文字。ESC 02/08 04/02
3) ESC 02/04 02/09 04/01 第三行文字。ESC 02/08 04/02
Character encoded representation (GB2312):
0x31 0x2e
0x1B 0x24 0x29 0x41
0xB5 0xDA 0xD2 0xBB 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3
0x1B 0x28 0x42
0x0D 0x0A
0x32 0x2e
0x1B 0x24 0x29 0x41
0xB5 0xDA 0xB6 0xFE 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3
0x1B 0x28 0x42
0x0D 0x0A
0x33 0x2e
0x1B 0x24 0x29 0x41
0xB5 0xDA 0xC8 0xFD 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3
0x1B 0x28 0x42
0x0D 0x0A 0x20
The underlined byte codes correspond to double byte characters, the bold byte codes to escape sequences.
Table K.3-1. Character Sets and Escape Sequences used in the Examples of Person Name
|
Character Set Description |
Component Group |
Value of (0008,0005) Defined Term |
ISO registration number |
Standard for Code Extension |
ESC Sequence |
Code Element |
Character Set: Purpose of Use |
|---|---|---|---|---|---|---|---|
|
Chinese |
First: Phonetic |
Value 1: none |
ISO-IR 6 |
G0 |
ISO 646: |
||
|
Second: Ideographic |
Value 1: ISO 2022 IR 58 |
ISO-IR58 |
ISO 2022 |
ESC 02/04 02/09 04/01 |
G1 |
ISO 2022 CN: Chinese |
|
|
Third: Alphabetic (English name) |
Value 1: none |
ISO-IR 6 |
ISO 2022 |
ESC 02/08 04/02 |
G0 |
ISO 646: For delimiters |