REXML edge case with <a>A > B</a>
The attached file: rexml_test.rb was written in the context of a rails app, and used ./script/runner to execute.
It uses Nokogiri by way of comparison to REXML.
There are 4 test cases, and only the last one fails: REXML trying to parse the contents of A > B
It turns out, surprisingly, that the line is required for this to fail - without it, all tests will work.
% gem list nokogiri
*** LOCAL GEMS ***
nokogiri (1.4.0, 1.3.3, 1.3.2, 1.2.3, 1.1.0, 1.0.7)¶
My output (Mac OS X, Leopard):
% ./script/runner rexml_test.rb
.../monkeypatches.rb:13: warning: already initialized constant RAILS_DEFAULT_LOGGER
HI. You're using libxml2 version 2.6.16 which is over 4 years old and has
plenty of bugs. We suggest that for maximum HTML/XML parsing pleasure, you
upgrade your version of libxml2 and re-install nokogiri. If you like using
libxml2 version 2.6.16, but don't like this warning, please define the constant
I_KNOW_I_AM_USING_AN_OLD_AND_BUGGY_VERSION_OF_LIBXML2 before requring nokogiri.
/Users/lkwerle/.gem/ruby/1.8/gems/nokogiri-1.4.0/lib/nokogiri/xml/builder.rb:272: warning: parenthesize argument(s) for future version
noko_value: A bigger than B
return_string: A bigger than B
noko_value: A > B
unnormalized': You have a nil object when you didn't expect it! (NoMethodError) The error occurred while evaluating nil.record_entity_expansion from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/doctype.rb:139:in entity'
unnormalize' from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:323:in each'
unnormalize' from /System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/rexml/text.rb:174:in value'
text' from rexml_test.rb:32 from /Library/Ruby/Site/1.8/rubygems/custom_require.rb:31:in eval'
gem_original_require' from /Library/Ruby/Site/1.8/rubygems/custom_require.rb:31:in require'
Updated by tenderlovemaking (Aaron Patterson) about 13 years ago
Can you try with a script that runs outside rails?
The script you've attached doesn't work. I modified it to what I think it should be, but nothing breaks on 1.8.7 or 1.8.6. I've attached the test script I used.