Bug #2423

REXML edge case with <a>A &gt; B</a>

Added by Kurt Werle about 2 years ago. Updated 10 months ago.

[ruby-core:27008]
Status:Open Start date:12/03/2009
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:-
Target version:-
ruby -v:ruby 1.8.6 (2008-08-11 patchlevel 287) [universal-darwin9.0]

Description

The attached file: rexml_test.rb was written in the context of a rails app, and used ./script/runner to execute.

It uses Nokogiri by way of comparison to REXML.

There are 4 test cases, and only the last one fails: REXML trying to parse the contents of <c>A &gt; B</c>
It turns out, surprisingly, that the line <!DOCTYPE x SYSTEM "licenseOut.xsd"> is required for this to fail - without it, all tests will work.
========================================
% gem list nokogiri

*** LOCAL GEMS ***

nokogiri (1.4.0, 1.3.3, 1.3.2, 1.2.3, 1.1.0, 1.0.7)
========================================
My output (Mac OS X, Leopard):
% ./script/runner rexml_test.rb
.../monkeypatches.rb:13: warning: already initialized constant RAILS_DEFAULT_LOGGER
HI.  You're using libxml2 version 2.6.16 which is over 4 years old and has
plenty of bugs.  We suggest that for maximum HTML/XML parsing pleasure, you
upgrade your version of libxml2 and re-install nokogiri.  If you like using
libxml2 version 2.6.16, but don't like this warning, please define the constant
I_KNOW_I_AM_USING_AN_OLD_AND_BUGGY_VERSION_OF_LIBXML2 before requring nokogiri.

/Users/lkwerle/.gem/ruby/1.8/gems/nokogiri-1.4.0/lib/nokogiri/xml/builder.rb:272: warning: parenthesize argument(s) for future version
noko_value: A bigger than B
return_string: A bigger than B
noko_value: A > B
/System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/entity.rb:76:in `unnormalized': You have a nil object when you didn't expect it! (NoMethodError)
The error occurred while evaluating nil.record_entity_expansion
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/doctype.rb:139:in `entity'
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:325:in `unnormalize'
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:323:in `each'
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:323:in `unnormalize'
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:174:in `value'
	from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/element.rb:452:in `text'
	from rexml_test.rb:32
	from /Library/Ruby/Site/1.8/rubygems/custom_require.rb:31:in `eval'
	from /.../vendor/rails/railties/lib/commands/runner.rb:46
	from /Library/Ruby/Site/1.8/rubygems/custom_require.rb:31:in `gem_original_require'
	from /Library/Ruby/Site/1.8/rubygems/custom_require.rb:31:in `require'
	from ./script/runner:3

rexml_test.rb - ./script/runner rexml_test.rb # in the context of rails 2.3.2 (828 Bytes) Kurt Werle, 12/03/2009 10:48 am

rexml_test.rb - modified rexml test (847 Bytes) Aaron Patterson, 12/03/2009 12:20 pm

History

Updated by Aaron Patterson about 2 years ago

Can you try with a script that runs outside rails?

The script you've attached doesn't work.  I modified it to what I *think* it should be, but nothing breaks on 1.8.7 or 1.8.6.  I've attached the test script I used.

Also available in: Atom PDF