Project

General

Profile

Actions

Feature #4066

closed

Encoding GBK needs update

Added by oCameLo (oCameLo oTnTh) about 14 years ago. Updated over 13 years ago.

Status:
Rejected
Target version:

Description

=begin
When GBK was released in 1995, it included 95 characters were not included in Unicode 1.1. Until now (Windows 7), these characters were still assigned Unicode PUA code points in CP936.

GBK isn't an official standard, so I think it won't be updated anymore. But GB18030 is official, and the subset consisting of one-byte and two-byte characters is sometimes also referred to as GBK. In GB18030-2005, 81 characters were assigned to PUA, are now defined in Unicode.

(Reference: http://en.wikipedia.org/wiki/GBK#History)

Actually, the remaining 14 characters are now defined in Unicode, too. Please take a look at gbk_fe05.gif, light grey and light yellow ones.

These 95 characters are all defined in Unicode now (see gbk_mod.htm), so I think we should add these characters to gbk-tbl.rb. It won't cause any compatibility issue, at least in Ruby side.
=end


Files

gbk-tbl.95_chars.diff (4.27 KB) gbk-tbl.95_chars.diff oCameLo (oCameLo oTnTh), 11/18/2010 03:54 AM
gbk_fe05.gif (33.9 KB) gbk_fe05.gif oCameLo (oCameLo oTnTh), 11/18/2010 03:54 AM
gbk_mod.htm (6.36 KB) gbk_mod.htm oCameLo (oCameLo oTnTh), 11/18/2010 03:54 AM
Actions

Also available in: Atom PDF

Like0
Like0Like0Like0Like0Like0