Backport #8183

CGI.unescapeHTML does not decode あ

Added by Kazuhiro NISHIYAMA over 2 years ago. Updated about 2 years ago.

[ruby-dev:47203]
Status:Closed
Priority:Normal
Assignee:Tomoyuki Chikanaga

Description

数値文字参照で あ のように X が大文字の時に CGI.unescapeHTML で "あ" に戻りません。

http://www.w3.org/TR/html5/syntax.html#character-references
の Hexadecimal numeric character reference によると

which must be followed by either a "x" (U+0078) character or a "X" (U+0058) character,

となっているので、大文字でも受け付けてほしいです。

Associated revisions

Revision 40181
Added by Takeyuki FUJIOKA about 2 years ago

  • lib/cgi/util.rb (CGI::unescapeHTML): fix Hexadecimal numeric character. [Bug #8183]

Revision 40315
Added by Tomoyuki Chikanaga about 2 years ago

merge revision(s) 40181: [Backport #8183]

* lib/cgi/util.rb (CGI::unescapeHTML): fix Hexadecimal numeric character.

[Bug #8183]

History

#1 Updated by Zachary Scott over 2 years ago

  • Status changed from Open to Assigned

#2 Updated by Takeyuki FUJIOKA about 2 years ago

バグですね。修正しておきます。

#3 Updated by Takeyuki FUJIOKA about 2 years ago

  • Status changed from Assigned to Closed
  • % Done changed from 0 to 100

This issue was solved with changeset r40181.
Kazuhiro, thank you for reporting this issue.
Your contribution to Ruby is greatly appreciated.
May Ruby be with you.


  • lib/cgi/util.rb (CGI::unescapeHTML): fix Hexadecimal numeric character. [Bug #8183]

#4 Updated by Tomoyuki Chikanaga about 2 years ago

  • Tracker changed from Bug to Backport
  • Project changed from Ruby trunk to Backport200
  • Category deleted (lib)
  • Status changed from Closed to Assigned
  • Assignee changed from Takeyuki FUJIOKA to Tomoyuki Chikanaga

#5 Updated by Tomoyuki Chikanaga about 2 years ago

  • Status changed from Assigned to Closed

This issue was solved with changeset r40315.
Kazuhiro, thank you for reporting this issue.
Your contribution to Ruby is greatly appreciated.
May Ruby be with you.


merge revision(s) 40181: [Backport #8183]

* lib/cgi/util.rb (CGI::unescapeHTML): fix Hexadecimal numeric character.

[Bug #8183]

Also available in: Atom PDF