Project

General

Profile

Actions

Feature #4073

closed

HKSCS-2008

Added by oCameLo (oCameLo oTnTh) over 13 years ago. Updated almost 13 years ago.

Status:
Closed
Assignee:
-
Target version:
[ruby-core:33256]

Description

=begin
I suspect that Big5-HKSCS in Ruby missed out some mappings during use it, so I extracted a Big5-HKSCS conversion table from c_951.nls (HKSCS-2001, [1]) and used it to check big5-hkscs-tbl.rb. Except the characters were assigned to PUA in HKSCS-2001, the tbl file missed out 8 characters:

 cF9E9 => u255E # '╞'
 cF9EA => u256A # '╪'
 cF9EB => u2561 # '╡'
 cF9F9 => u2550 # '═'
 cF9FA => u256D # '╭'
 cF9FB => u256E # '╮'
 cF9FC => u2570 # '╰'
 cF9FD => u256F # '╯'

And these 8 characters were included in CP951 but not in the tbl file:

 cA15A => u2574 # '╴'
 cA1C3 => uFFE3 # ' ̄'
 cA1C5 => u02CD # 'ˍ'
 cA1FE => uFF0F # '/'
 cA240 => uFF3C # '\'
 cA2CC => u5341 # '十'
 cA2CE => u5345 # '卅'
 cA3E1 => u20AC # '€'

HKSCS is just a supplementary set of Big5, but they didn't finger out which kind of Big5 base on. Consider for compatibility, I think CP951 is a good choice. Also, HKSCS-2008 ([2]) was released for a long time. So I tried to combine CP951 and HKSCS-2008 together to make a new tbl file, but I met some problems.

First, HKSCS-2008 included 3 ligatures:

 c8862 => <00CA,0304>
 c8864 => <00CA,030C>
 c88A3 => <00EA,0304>
 c88A5 => <00EA,030C>

Does Ruby support this, if so, how to code?

Then, these mappings were included in CP951:

 cA2CC, cA451 => u5341
 cA2CE, cA4CA => u5345
 cA2A5, cF9E9 => u255E
 cA2A6, cF9EA => u256A
 cA2A7, cF9EB => u2561
 cA2A4, cF9F9 => u2550
 cA27E, cF9FA => u256D
 cA2A1, cF9FB => u256E
 cA2A2, cF9FC => u2570
 cA2A3, cF9FD => u256F

 u5341 => cA451
 u5345 => cA4CA
 u255E => cF9E9
 u256A => cF9EA
 u2561 => cF9EB
 u2550 => cF9F9
 u256D => cF9FA
 u256E => cF9FB
 u2570 => cF9FC
 u256F => cF9FD

Same question, does Ruby support this, if so, how to code?

Thanks.

[1] http://www.microsoft.com/hk/hkscs/
[2] http://www.ogcio.gov.hk/ccli/chs/hkscs/mapping_table_2008.html
=end


Files

big5_hkscs_2008.patch.gz (311 KB) big5_hkscs_2008.patch.gz oCameLo (oCameLo oTnTh), 11/22/2010 07:18 PM

Related issues 1 (0 open1 closed)

Related to Ruby master - Feature #1784: More encoding (Big5 series) support?Closedduerst (Martin Dürst)07/16/2009Actions
Actions

Also available in: Atom PDF

Like0
Like0Like0Like0Like0Like0Like0Like0Like0Like0