Project

General

Profile

Actions

Bug #9415

closed

Strings#codepoints doesn't respect BOM on UTF-{16,32} pseudo encodings

Added by nobu (Nobuyoshi Nakada) over 8 years ago. Updated about 8 years ago.

Status:
Closed
Priority:
Normal
Target version:
[ruby-dev:<unknown>]

Description

String#codepointsUTF-16UTF-32でのBOMを考慮していません。

$ ruby -e 'puts "%x" % "\u{feff}".encode("UTF-16BE").force_encoding("UTF-16").codepoints'
feff
$ ruby -e 'puts "%x" % "\u{feff}".encode("UTF-16LE").force_encoding("UTF-16").codepoints'
fffe

String#ordなども同様です。

$ ruby -e 'printf "%x\n", "\u{feff}".encode("UTF-16BE").force_encoding("UTF-16").ord'
feff
$ ruby -e 'printf "%x\n", "\u{feff}".encode("UTF-16LE").force_encoding("UTF-16").ord'
fffe

Related issues 1 (0 open1 closed)

Related to Ruby master - Bug #8940: printing UTF-32 crashs rubyClosed09/23/2013Actions
Actions #1

Updated by nobu (Nobuyoshi Nakada) over 8 years ago

  • ruby -v changed from r44601 to -

チケット #9415 が Nobuyoshi Nakada によって報告されました。


Bug #9415: Strings#codepoints doesn't respect BOM on UTF-{16,32} pseudo encodings
https://bugs.ruby-lang.org/issues/9415

  • 作成者: Nobuyoshi Nakada
  • ステータス: Open
  • 優先度: Normal
  • 担当者: Yui NARUSE
  • カテゴリ: M17N
  • 対象バージョン: current: 2.2.0
  • ruby -v: r44601
  • Backport: 1.9.3: UNKNOWN, 2.0.0: UNKNOWN, 2.1: REQUIRED

String#codepointsUTF-16UTF-32でのBOMを考慮していません。

$ ruby -e 'puts "%x" % "\u{feff}".encode("UTF-16BE").force_encoding("UTF-16").codepoints'
feff
$ ruby -e 'puts "%x" % "\u{feff}".encode("UTF-16LE").force_encoding("UTF-16").codepoints'
fffe

String#ordなども同様です。

$ ruby -e 'printf "%x\n", "\u{feff}".encode("UTF-16BE").force_encoding("UTF-16").ord'
feff
$ ruby -e 'printf "%x\n", "\u{feff}".encode("UTF-16LE").force_encoding("UTF-16").ord'
fffe

--
http://bugs.ruby-lang.org/

Actions #2

Updated by nobu (Nobuyoshi Nakada) over 8 years ago

  • Status changed from Open to Closed
  • % Done changed from 0 to 100

Applied in changeset r44606.


string.c: respect BOM

Actions #3

Updated by nobu (Nobuyoshi Nakada) over 8 years ago

  • Related to Bug #8940: printing UTF-32 crashs ruby added

Updated by usa (Usaku NAKAMURA) over 8 years ago

  • Backport changed from 1.9.3: UNKNOWN, 2.0.0: UNKNOWN, 2.1: REQUIRED to 1.9.3: REQUIRED, 2.0.0: REQUIRED, 2.1: REQUIRED

Updated by naruse (Yui NARUSE) about 8 years ago

  • Backport changed from 1.9.3: REQUIRED, 2.0.0: REQUIRED, 2.1: REQUIRED to 1.9.3: REQUIRED, 2.0.0: REQUIRED, 2.1: DONE

r45074

Actions

Also available in: Atom PDF