Bug #20286
closedTracePoint does not emit `thread_end` event when thread exits with exception
Description
Using TracePoint to trace thread_begin
and thread_end
events fails to emit the thread_end
event when an exception (e.g., Interrupt) is raised within a thread. This behavior occurs because the exception handling bypasses the internal thread finishing logic, including trace point and fiber scheduler cleanup code. This issue affects the ability to accurately monitor thread lifecycle events in scenarios involving exception handling or abrupt thread terminations.
Steps to Reproduce:¶
- Set up
TracePoint
to tracethread_begin
andthread_end
events. - Create a new thread that raises an exception.
- Join the thread and observe that only the
thread_begin
event is emitted without a correspondingthread_end
event.
Example Code¶
TracePoint.trace(:thread_begin, :thread_end) do |tp|
p [tp.event, tp.lineno, tp.path, tp.defined_class, tp.method_id]
end
thread = Thread.new do
raise Interrupt
end
thread.join
Current Behavior:¶
The TracePoint
emits the thread_begin
event but fails to emit the thread_end
event when an exception is raised within the thread, indicating an incomplete tracing of thread lifecycle events.
I've confirmed this as far back as Ruby 2.6.
> ruby ./test.rb
[:thread_begin, 0, nil, nil, nil]
#<Thread:0x000000010384b5a8 ./test.rb:5 run> terminated with exception (report_on_exception is true):
./test.rb:6:in `block in <main>': Interrupt (Interrupt)
./test.rb:6:in `block in <main>': Interrupt (Interrupt)
Expected Behavior:¶
The TracePoint
should emit both thread_begin
and thread_end
events, accurately reflecting the lifecycle of the thread, even when an exception is raised within the thread.
> ruby ./test.rb
[:thread_begin, 0, nil, nil, nil]
[:thread_end, 0, nil, nil, nil]
#<Thread:0x0000000105435db8 ./test.rb:5 run> terminated with exception (report_on_exception is true):
./test.rb:6:in 'block in <main>': Interrupt (Interrupt)
./test.rb:6:in 'block in <main>': Interrupt (Interrupt)
Possible Fix¶
Changing the implementation of thread_do_start
to have what amounts to an "ensure" block.
static void
thread_do_start(rb_thread_t *th)
{
native_set_thread_name(th);
VALUE result = Qundef;
rb_execution_context_t *ec = th->ec;
int state;
EXEC_EVENT_HOOK(ec, RUBY_EVENT_THREAD_BEGIN, th->self, 0, 0, 0, Qundef);
EC_PUSH_TAG(ec);
if ((state = EC_EXEC_TAG()) == TAG_NONE) {
switch (th->invoke_type) {
case thread_invoke_type_proc:
result = thread_do_start_proc(th);
break;
case thread_invoke_type_ractor_proc:
result = thread_do_start_proc(th);
rb_ractor_atexit(th->ec, result);
break;
case thread_invoke_type_func:
result = (*th->invoke_arg.func.func)(th->invoke_arg.func.arg);
break;
case thread_invoke_type_none:
rb_bug("unreachable");
}
}
EC_POP_TAG();
VALUE errinfo = ec->errinfo;
if (!NIL_P(errinfo) && !RB_TYPE_P(errinfo, T_OBJECT)) {
ec->errinfo = Qnil;
}
rb_fiber_scheduler_set(Qnil);
EXEC_EVENT_HOOK(th->ec, RUBY_EVENT_THREAD_END, th->self, 0, 0, 0, Qundef);
ec->errinfo = errinfo;
if (state)
EC_JUMP_TAG(ec, state);
th->value = result;
}
It's possible rb_fiber_scheduler_set(Qnil);
can emit an exception itself. How do we write the code to handle that case?
Updated by ioquatix (Samuel Williams) 9 months ago
Proposed fix: https://github.com/ruby/ruby/pull/10039
Updated by ioquatix (Samuel Williams) 9 months ago
- Assignee set to ioquatix (Samuel Williams)
Updated by ioquatix (Samuel Williams) 9 months ago
- Status changed from Open to Closed
- Backport changed from 3.0: UNKNOWN, 3.1: UNKNOWN, 3.2: UNKNOWN, 3.3: UNKNOWN to 3.1: REQUIRED, 3.2: REQUIRED, 3.3: REQUIRED
Updated by ioquatix (Samuel Williams) 9 months ago
- Backport changed from 3.1: REQUIRED, 3.2: REQUIRED, 3.3: REQUIRED to 3.0: REQUIRED, 3.1: REQUIRED, 3.2: REQUIRED, 3.3: REQUIRED
We can backport this to Ruby 3.0, however the PR will be a bit different. I am okay if we choose not to backport to 3.0 since it is EOL soon.
Updated by k0kubun (Takashi Kokubun) 6 months ago
- Backport changed from 3.0: REQUIRED, 3.1: REQUIRED, 3.2: REQUIRED, 3.3: REQUIRED to 3.0: REQUIRED, 3.1: REQUIRED, 3.2: REQUIRED, 3.3: DONE
ruby_3_3 a8b2317d16fa172edd3cd7e6fcb3bc694287d109 merged revision(s) 78d9fe69479d32214a52ad7291c3973f1b6b7f6f.
Updated by nagachika (Tomoyuki Chikanaga) 5 months ago
- Backport changed from 3.0: REQUIRED, 3.1: REQUIRED, 3.2: REQUIRED, 3.3: DONE to 3.0: REQUIRED, 3.1: REQUIRED, 3.2: DONE, 3.3: DONE
ruby_3_2 89de66dbb0d8454c9d69faa331d6e35f8b315cce merged revision(s) 78d9fe69479d32214a52ad7291c3973f1b6b7f6f, 04729fe68dceddab045be7324e26c2bb15aa62c7.