Bug #15779
closedAfter NoMemoryError, ruby freezes and takes 100% CPU
Added by buzztaiki (Taiki Sugawara) over 6 years ago. Updated about 4 years ago.
Description
Run following reproduce code, ruby freezes and takes 100% CPU.
require 'open-uri'
begin
  "a" * 10000000000
ensure
  p open('https://www.ruby-lang.org/')
end
But interestingly, the following code does not reproduce this issue.
require 'open-uri'
begin
  begin
    "a" * 10000000000
  rescue NoMemoryError
    raise
  end
ensure
  p open('https://www.ruby-lang.org/')
end
It was also reproduced when put sleep 100 in ensure clause, run it and hit Ctrl-C. But puts 'XXX' does not reproduce it.
Files
| strace.txt (377 KB) strace.txt | strace log | buzztaiki (Taiki Sugawara), 04/29/2019 01:34 AM | 
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #1
            [ruby-core:92343]
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #1
            [ruby-core:92343]
        
      
      Sorry, I wote bug report in Japanese to ruby-core.
In English:
Subject: After NoMemoryError, ruby freezes and takes 100% CPU
Description:
Run following reproduce code, ruby freezes and takes 100% CPU.
require 'open-uri'
begin
  "a" * 10000000000
ensure
  p open('https://www.ruby-lang.org/')
end
But interestingly, the following code does not reproduce this issue.
require 'open-uri'
begin
  begin
    "a" * 10000000000
  rescue NoMemoryError
    raise
  end
ensure
  p open('https://www.ruby-lang.org/')
end
It was also reproduced when put sleep 100 in ensure clause. But puts 'XXX' does not reproduce it.
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #2
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #2
        
      
      - Subject changed from NoMemoryError 後の ensure で ruby が CPU 100% で固まる現象 to After NoMemoryError, ruby freezes and takes 100% CPU
- Description updated (diff)
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #3
            [ruby-core:92344]
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #3
            [ruby-core:92344]
        
      
      I rewrote this issue to English.
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #4
            [ruby-core:92471]
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #4
            [ruby-core:92471]
        
      
      - File strace.txt strace.txt added
In this situation, I want ruby to finish GC quickly or to crash.
In my production code, this issue has been avoided by not creating huge objects and by using rescue and re-raise.
I attach strace log (strace -ttt -o strace.txt ruby a.rb).
It seems to cause loop of Resource temporarily unavailable.
...
1556496315.537693 read(6, 0x55794d88b333, 5) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538029 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538245 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538404 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538524 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538612 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538682 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538751 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538820 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538889 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.538958 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539027 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539094 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539161 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539320 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539401 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
1556496315.539470 read(4, 0x7ffeaf4b5f40, 8) = -1 EAGAIN (Resource temporarily unavailable)
...
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #5
            [ruby-core:92472]
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #5
            [ruby-core:92472]
        
      
      I test this reproduction code at ruby 2.5.5 (ruby 2.5.5p157 (2019-03-15 revision 67260) [x86_64-linux]). In this version, this issue is not reproduced (ruby is not frozen).
~/opt/ruby/2.5.5/bin/ruby a.rb
#<StringIO:0x0000560b317f53b8 @base_uri=#<URI::HTTPS https://www.ruby-lang.org/>, @meta={"server"=>"Cowboy", "strict-transport-security"=>"max-age=31536000", "content-type"=>"text/html", "etag"=>"W/\"91e880089832f50e7e7bf1371600cc60\"", "x-frame-options"=>"SAMEORIGIN", "via"=>"1.1 vegur, 1.1 varnish", "content-length"=>"424", "accept-ranges"=>"bytes", "date"=>"Mon, 29 Apr 2019 01:11:21 GMT", "age"=>"1800", "connection"=>"keep-alive", "x-served-by"=>"cache-hnd18749-HND", "x-cache"=>"HIT", "x-cache-hits"=>"1", "x-timer"=>"S1556500281.267699,VS0,VE6", "vary"=>"Accept-Encoding"}, @metas={"server"=>["Cowboy"], "strict-transport-security"=>["max-age=31536000"], "content-type"=>["text/html"], "etag"=>["W/\"91e880089832f50e7e7bf1371600cc60\""], "x-frame-options"=>["SAMEORIGIN"], "via"=>["1.1 vegur", "1.1 varnish"], "content-length"=>["424"], "accept-ranges"=>["bytes"], "date"=>["Mon, 29 Apr 2019 01:11:21 GMT"], "age"=>["1800"], "connection"=>["keep-alive"], "x-served-by"=>["cache-hnd18749-HND"], "x-cache"=>["HIT"], "x-cache-hits"=>["1"], "x-timer"=>["S1556500281.267699,VS0,VE6"], "vary"=>["Accept-Encoding"]}, @status=["200", "OK"]>
Traceback (most recent call last):
a.rb: failed to allocate memory (NoMemoryError)
        
           Updated by wanabe (_ wanabe) over 6 years ago
          
          
        
        
          
            Actions
          
          #6
            [ruby-core:92486]
          Updated by wanabe (_ wanabe) over 6 years ago
          
          
        
        
          
            Actions
          
          #6
            [ruby-core:92486]
        
      
      I think it is from r58380.
before r58380, rb_memerror calls rb_longjmp via rb_exc_raise and clear raised_flag.
But now rb_memerror keeps raised_flag and just calls EC_JUMP_TAG.
rb_threadptr_execute_interrupts should raise Interrupt exception on hitting ctrl-c,
but just return immediately because th->raised_flag is truthy.
I've confirmed above behavior with the script to use miniruby.
begin
  "a" * 10000000000
ensure
  Thread.new(Thread.current) do |t|
    Thread.pass
    t.raise Interrupt
    sleep 0.01
    STDERR.puts "shutting down..."
    Process.kill :KILL, $$
  end
  sleep
end
        
           Updated by wanabe (_ wanabe) over 6 years ago
          
          
        
        
          
            Actions
          
          #7
            [ruby-core:92487]
          Updated by wanabe (_ wanabe) over 6 years ago
          
          
        
        
          
            Actions
          
          #7
            [ruby-core:92487]
        
      
      How about this?
diff --git a/vm.c b/vm.c
index 41064f07c3..be394b0dae 100644
--- a/vm.c
+++ b/vm.c
@@ -1895,7 +1895,7 @@ vm_exec(rb_execution_context_t *ec, int mjit_enable_p)
     }
     else {
        result = ec->errinfo;
-        rb_ec_raised_reset(ec, RAISED_STACKOVERFLOW);
+        rb_ec_raised_reset(ec, RAISED_STACKOVERFLOW | RAISED_NOMEMORY);
         while ((result = vm_exec_handle_exception(ec, state, result, &initial)) == Qundef) {
             /* caught a jump, exec the handler */
             result = vm_exec_core(ec, initial);
        
           Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #8
            [ruby-core:92685]
          Updated by buzztaiki (Taiki Sugawara) over 6 years ago
          
          
        
        
          
            Actions
          
          #8
            [ruby-core:92685]
        
      
      Thank you for your reply!
In my environment, this patch with ruby 2.6.3 has been fixed this issue (ruby was not frozen)!!
~/opt/ruby/bug15779_v2_6_3_fix/bin/ruby ~/tmp/a.rb
#<StringIO:0x000055914e06fda8 @base_uri=#<URI::HTTPS https://www.ruby-lang.org/>, @meta={"server"=>"Cowboy", "strict-transport-security"=>"max-age=31536000", "content-type"=>"text/html", "etag"=>"W/\"91e88008983
2f50e7e7bf1371600cc60\"", "x-frame-options"=>"SAMEORIGIN", "via"=>"1.1 vegur, 1.1 varnish", "content-length"=>"424", "accept-ranges"=>"bytes", "date"=>"Thu, 16 May 2019 17:04:03 GMT", "age"=>"1082", "connection"=>"keep-alive", "x-served-by"=>"cache-hnd18751-HND", "x-cache"=>"HIT", "x-cache-hits"=>"1", "x-timer"=>"S1558026243.322961,VS0,VE0", "vary"=>"Accept-Encoding"}, @metas={"server"=>["Cowboy"], "strict-transport-security"=>["max-age=31536000"], "content-type"=>["text/html"], "etag"=>["W/\"91e880089832f50e7e7bf1371600cc60\""], "x-frame-options"=>["SAMEORIGIN"], "via"=>["1.1 vegur", "1.1 varnish"], "content-length"=>["424"], "accept-ranges"=>["bytes"], "date"=>["Thu, 16 May 2019 17:04:03 GMT"], "age"=>["1082"], "connection"=>["keep-alive"], "x-served-by"=>["cache-hnd18751-HND"], "x-cache"=>["HIT"], "x-cache-hits"=>["1"], "x-timer"=>["S1558026243.322961,VS0,VE0"], "vary"=>["Accept-Encoding"]}, @status=["200", "OK"]>
Traceback (most recent call last):
/home/taiki/tmp/a.rb: failed to allocate memory (NoMemoryError)
cat ~/tmp/a.rb
require 'open-uri'
begin
  "a" * 10000000000
ensure
  p open('https://www.ruby-lang.org/')
end
        
           Updated by jeremyevans0 (Jeremy Evans) over 4 years ago
          
          
        
        
          
            Actions
          
          #9
            [ruby-core:104321]
          Updated by jeremyevans0 (Jeremy Evans) over 4 years ago
          
          
        
        
          
            Actions
          
          #9
            [ruby-core:104321]
        
      
      This is still an issue in the master branch. I've submitted a pull request for it: https://github.com/ruby/ruby/pull/4577. The pull request uses @wanabe's fix and a test based on @buzztaiki's example.
        
           Updated by jeremyevans (Jeremy Evans) about 4 years ago
          
          
        
        
          
            Actions
          
          #10
          Updated by jeremyevans (Jeremy Evans) about 4 years ago
          
          
        
        
          
            Actions
          
          #10
        
      
      - Status changed from Open to Closed
Applied in changeset git|a6dc48f86a8ee7b1f713105b5c7136824d6b5720.
Fix infinite loop in ensure after NoMemoryError
VM patch from wanabe.
Test based on example from buzztaiki (Taiki Sugawara).
Test fails when compiles with -DRUBY_DEBUG, as that can
can use rb_bug instead of NoMemoryError, which doesn't
allow testing this case. Test also fails on MingW, as
RangeError is used instead of NoMemoryError. Skip the
test in either case.
Fixes [Bug #15779]