Bug #7090: UTF-16LE String#<< append 0x0 for certain codepoints - Ruby - Ruby Issue Tracking System

Actions

Copy link

Bug #7090

closed

UTF-16LE String#<< append 0x0 for certain codepoints

Bug #7090: UTF-16LE String#<< append 0x0 for certain codepoints

Added by stefan (Stefan Lang) over 13 years ago. Updated over 13 years ago.

Status:

Closed

Assignee:

Target version:

ruby -v:

ruby 1.9.3p194 (2012-04-20) [x86_64-linux]

Backport:

[ruby-core:47751]

Description

IMO, the behaviour with the UTF-8 string is correct.

$ ri193 'String#<<'
= String#<<

(from ruby core)¶

str << integer       -> str
str.concat(integer)  -> str
str << obj           -> str
str.concat(obj)      -> str

Append---Concatenates the given object to str. If the object is a
Integer, it is considered as a codepoint, and is converted to a character
before concatenation.

a = "hello "
a << "world"   #=> "hello world"
a.concat(33)   #=> "hello world!"

AFAIK, a Ruby 1.9 string can be viewed as either 1) a sequence of raw bytes,
or 2) a sequence of codepoints.

Except for maybe regexes, Ruby has no higher level concept of a "character"
than a codepoint. Insofar I don't know what the "and is converted to
a character before concatenation" means.

If we take the sequence of codepoints view, than "str << integer" is simply
appending a codepoint.

If we take the sequence of bytes view, then "str << integer" is converting
the codepoint into a sequence of bytes that correspond to the codepoint
in str.encoding and appending that sequence of bytes.

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Bug #7090

UTF-16LE String#<< append 0x0 for certain codepoints

(from ruby core)¶

Updated by stefan (Stefan Lang) over 13 years ago Actions
Copy link
#1 [ruby-core:47753]

Updated by stefan (Stefan Lang) over 13 years ago Actions
Copy link
#2 [ruby-core:47754]

Updated by naruse (Yui NARUSE) over 13 years ago Actions
Copy link
#3

Project

General

Profile

Ruby

Custom queries

Bug #7090

UTF-16LE String#<< append 0x0 for certain codepoints

(from ruby core)¶

Updated by stefan (Stefan Lang) over 13 years ago ActionsCopy link #1 [ruby-core:47753]

Updated by stefan (Stefan Lang) over 13 years ago ActionsCopy link #2 [ruby-core:47754]

Updated by naruse (Yui NARUSE) over 13 years ago ActionsCopy link #3

Updated by stefan (Stefan Lang) over 13 years ago Actions
Copy link
#1 [ruby-core:47753]

Updated by stefan (Stefan Lang) over 13 years ago Actions
Copy link
#2 [ruby-core:47754]

Updated by naruse (Yui NARUSE) over 13 years ago Actions
Copy link
#3