Bug #13526
closedSegmentation fault at 0x0055c2e58e8920 ruby 2.3.1p112 (2016-04-26 revision 54768) [x86_64-linux]
Description
see attached log
Files
Updated by wanabe (_ wanabe) over 7 years ago
vlad-leonov (Vladislav Leonov) wrote:
-- Ruby level backtrace information ----------------------------------------
/home/vlad/.rbenv/versions/2.3.1/lib/ruby/gems/2.3.0/gems/puma-3.8.2/lib/puma/thread_pool.rb:120:in block in spawn_thread'
Ruby level backtrace shows there are some threads.
-- C level backtrace information -------------------------------------------
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(rb_vm_bugreport+0x535) [0x55c2e5985175] vm_dump.c:688
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(rb_bug_context+0xd1) [0x55c2e5961f71] error.c:435
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(sigsegv+0x3e) [0x55c2e585c3ee] signal.c:890
/lib/x86_64-linux-gnu/libpthread.so.0 [0x7fb325847390]
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(autoload_reset+0xa3) [0x55c2e58acd43] variable.c:2129
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(rb_ensure+0xf0) [0x55c2e57991f0] eval.c:906
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(rb_autoload_load+0x156) [0x55c2e58b3156] variable.c:2190
/home/vlad/.rbenv/versions/2.3.1/bin/ruby(rb_const_get_0.constprop.24+0x104) [0x55c2e58b3304] variable.c:2243
C level backtrace suggests that the issue may be related to autoload.
I found same issue on rails/rails repository. https://github.com/rails/rails/issues/27455
Here is a reduced script.
This code causes SEGV with a 60-70% possibility as far as I see.
sleep if $load
$load = true
n = 10
threads = Array.new(n) do
Thread.new do
begin
autoload :Foo, "#{File.dirname($0)}/#{$0}"
Thread.pass
Foo
ensure
Thread.pass
end
end
end
Thread.pass while threads.all?(&:stop?)
100.times { Thread.pass }
Updated by usa (Usaku NAKAMURA) over 7 years ago
Updated by usa (Usaku NAKAMURA) over 7 years ago
It seems that the HEAD of ruby_2_3 still has this problem but trunk nor ruby_2_4 don't.
Updated by wanabe (_ wanabe) over 7 years ago
- File bug13526.log bug13526.log added
I'm sorry to reproduce with trunk at my environment.
Log attached.
Updated by usa (Usaku NAKAMURA) over 7 years ago
- Backport changed from 2.2: UNKNOWN, 2.3: REQUIRED, 2.4: UNKNOWN to 2.2: UNKNOWN, 2.3: REQUIRED, 2.4: REQUIRED
wanabe (_ wanabe) wrote:
I'm sorry to reproduce with trunk at my environment.
Oh, thank you.
Updated by wanabe (_ wanabe) over 7 years ago
- File bug13526_dp.log bug13526_dp.log added
- File bug13526_dp.patch bug13526_dp.patch added
git bisect
shows that this issue is appeared at r52332 [Bug #11384].
I ran https://bugs.ruby-lang.org/issues/13526#note-1 script with attached debug-print patch and got the log.
- Thread 0x000055eacc7089d0 had "waitq: 0x7f737581fa18" on the stack. (line 4)
- Thread 0x000055eacc7089d0 was killed and its stack was shrinked. (line 20)
-
rb_autoload_reset()
tried to delete "waitq: 0x7f737581fa18" from list. (line 22) - But the node have been out of stack and corrupted!
And the log alerts us that check_autoload_required()
returns inconsistent values.
I guess this is caused by the flow: rb_autoload_str()
-> rb_const_set(mod, id, Qundef)
-> const_tbl_update()
-> autoload_delete()
-> st_delete(RCLASS_IV_TBL(mod), &autoload, &val)
.
(But maybe other issue...)
Updated by normalperson (Eric Wong) over 7 years ago
s.wanabe@gmail.com wrote:
git bisect
shows that this issue is appeared at r52332 [Bug #11384].I ran https://bugs.ruby-lang.org/issues/13526#note-1 script with attached debug-print patch and got the log.
Thank you! Fix coming.
Updated by Anonymous over 7 years ago
- Status changed from Open to Closed
Applied in changeset trunk|r58587.
variable.c: cleanup waitq upon thread death
- variable.c (autoload_reset): use idempotent list_del_init
(autoload_sleep): moved code from rb_autoload_load
(autoload_sleep_done): cleanup for use with rb_ensure
(rb_autoload_load): ensure list delete happens in case the
thread dies during sleep - test/ruby/bug-13526.rb: new script for separate execution
- test/ruby/test_autoload.rb (test_bug_13526): new test
[ruby-core:81016] [Bug #13526]
Updated by normalperson (Eric Wong) over 7 years ago
Eric Wong normalperson@yhbt.net wrote:
s.wanabe@gmail.com wrote:
git bisect
shows that this issue is appeared at r52332 [Bug #11384].I ran https://bugs.ruby-lang.org/issues/13526#note-1 script with attached debug-print patch and got the log.
Thank you! Fix coming.
OK, committed as r58587 with that script.
Everyone: In the future, feel free to Cc: me regarding any
problems I may have (even tangentially) caused. I am on many
mailing lists so I prioritize stuff I'm Cc-ed on. Thanks.
Updated by wanabe (_ wanabe) over 7 years ago
normalperson (Eric Wong) wrote:
OK, committed as r58587 with that script.
Thank you!
It works perfectly as far as I can see.
Everyone: In the future, feel free to Cc: me regarding any
problems I may have (even tangentially) caused. I am on many
mailing lists so I prioritize stuff I'm Cc-ed on. Thanks.
Sorry. I've commented by Redmine with my web browser, so I could not Cc: you.
So I add you as the watcher. Does Redmine Cc this comment to you?
Updated by normalperson (Eric Wong) over 7 years ago
s.wanabe@gmail.com wrote:
normalperson (Eric Wong) wrote:
OK, committed as r58587 with that script.
Thank you!
It works perfectly as far as I can see.
Thank you for confirming. Your small reproduction script was
very helpful.
Everyone: In the future, feel free to Cc: me regarding any
problems I may have (even tangentially) caused. I am on many
mailing lists so I prioritize stuff I'm Cc-ed on. Thanks.Sorry. I've commented by Redmine with my web browser, so I could not Cc: you.
So I add you as the watcher. Does Redmine Cc this comment to you?
Yes, adding as redmine watcher works for me. Thanks.
Updated by nagachika (Tomoyuki Chikanaga) over 7 years ago
- Backport changed from 2.2: UNKNOWN, 2.3: REQUIRED, 2.4: REQUIRED to 2.2: UNKNOWN, 2.3: REQUIRED, 2.4: DONE
ruby_2_4 r58637 merged revision(s) 58587,58588.
Updated by usa (Usaku NAKAMURA) over 7 years ago
- Backport changed from 2.2: UNKNOWN, 2.3: REQUIRED, 2.4: DONE to 2.2: UNKNOWN, 2.3: DONE, 2.4: DONE
ruby_2_3 r59219 merged revision(s) 58587,58588.
Updated by Eregon (Benoit Daloze) about 7 years ago
Note that the original example only works if it's a file under the current directory invoked like "ruby test.rb",
Otherwise "#{File.dirname($0)}/#{$0}"
does not refer to the current file.
I fixed it in r60854, to use File.expand_path(__FILE__)
.
The bug still happens on 2.4.1 whether the file exists or not, so it seems fine to test the case where the file does exist.
The bug seems caused by the interaction of autoload threads and the shutdown triggered by the main thread.