Bug #7156: Invalid byte sequence in US-ASCII when using URI from std lib - Ruby - Ruby Issue Tracking System

Actions

Copy link

Bug #7156

closed

Invalid byte sequence in US-ASCII when using URI from std lib

Bug #7156: Invalid byte sequence in US-ASCII when using URI from std lib

Added by t0d0r (Todor Dragnev) over 13 years ago. Updated over 7 years ago.

Status:

Rejected

Assignee:

naruse (Yui NARUSE)

Target version:

ruby -v:

1.9.3

Backport:

2.3: UNKNOWN, 2.4: UNKNOWN, 2.5: UNKNOWN

[ruby-core:47966]

Description

Invalid byte sequence in US-ASCII on ruby 1.9.3

I receive that error when trying to open url with bulgarian text (utf-8: "История"). It seems that the problem is in uri/common.rb from ruby standard library...

adding str.force_encoding(Encoding::BINARY) to following method fix the problem

class URI::Parser
def escape(str, unsafe = @regexp[:UNSAFE])
unless unsafe.kind_of?(Regexp)
# perhaps unsafe is String object
unsafe = Regexp.new("[#{Regexp.quote(unsafe)}]", false)
end
str.force_encoding(Encoding::BINARY) # FIX
str.gsub(unsafe) do
us = $&
tmp = ''
us.each_byte do |uc|
tmp << sprintf('%%%02X', uc)
end
tmp
end.force_encoding(Encoding::US_ASCII)
end
end

One more suggestion - maybe US_ASCII must be replaced to Encoding::BINARY too?

Files

bulgarian.rb (61 Bytes) bulgarian.rb

mame (Yusuke Endoh), 11/06/2012 08:43 PM

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Bug #7156

Invalid byte sequence in US-ASCII when using URI from std lib

Updated by meta (mathew murphy) over 13 years ago Actions
Copy link
#1 [ruby-core:48011]

Updated by mame (Yusuke Endoh) over 13 years ago Actions
Copy link
#2 [ruby-core:48972]

Updated by ko1 (Koichi Sasada) over 13 years ago Actions
Copy link
#3 [ruby-core:52322]

Updated by ko1 (Koichi Sasada) over 13 years ago Actions
Copy link
#4 [ruby-core:52414]

Updated by naruse (Yui NARUSE) over 8 years ago Actions
Copy link
#5

Updated by naruse (Yui NARUSE) over 7 years ago Actions
Copy link
#6 [ruby-core:89492]

Project

General

Profile

Ruby

Custom queries

Bug #7156

Invalid byte sequence in US-ASCII when using URI from std lib

Updated by meta (mathew murphy) over 13 years ago ActionsCopy link #1 [ruby-core:48011]

Updated by mame (Yusuke Endoh) over 13 years ago ActionsCopy link #2 [ruby-core:48972]

Updated by ko1 (Koichi Sasada) over 13 years ago ActionsCopy link #3 [ruby-core:52322]

Updated by ko1 (Koichi Sasada) over 13 years ago ActionsCopy link #4 [ruby-core:52414]

Updated by naruse (Yui NARUSE) over 8 years ago ActionsCopy link #5

Updated by naruse (Yui NARUSE) over 7 years ago ActionsCopy link #6 [ruby-core:89492]

Updated by meta (mathew murphy) over 13 years ago Actions
Copy link
#1 [ruby-core:48011]

Updated by mame (Yusuke Endoh) over 13 years ago Actions
Copy link
#2 [ruby-core:48972]

Updated by ko1 (Koichi Sasada) over 13 years ago Actions
Copy link
#3 [ruby-core:52322]

Updated by ko1 (Koichi Sasada) over 13 years ago Actions
Copy link
#4 [ruby-core:52414]

Updated by naruse (Yui NARUSE) over 8 years ago Actions
Copy link
#5

Updated by naruse (Yui NARUSE) over 7 years ago Actions
Copy link
#6 [ruby-core:89492]