Feature #9632: [PATCH 0/2] speedup IO#close with linked-list from ccan - Ruby - Ruby Issue Tracking System

Actions

Copy link

Feature #9632

closed

[PATCH 0/2] speedup IO#close with linked-list from ccan

Feature #9632: [PATCH 0/2] speedup IO#close with linked-list from ccan

Added by normalperson (Eric Wong) about 12 years ago. Updated almost 9 years ago.

Status:

Closed

Assignee:

ko1 (Koichi Sasada)

Target version:

2.2.0

[ruby-core:61452]

Description

This imports the ccan linked-list (BSD-MIT licensed version of the Linux kernel
linked list). I cut out some of the unused str* code (only for debugging),
but it's still a big import of new code. Modifications to existing code is
minimal, and it makes the living_threads iteration functions simpler.

The improvement is great, and there may be future places where we could
use a doubly linked list.

= vm->living_threads:

before: st hash table had extra malloc overhead, and slow iteration due
to bad cache locality
after: guaranteed O(1) insert/remove performance (branchless!)
iteration is still O(n), but performance is improved in IO#close
due to less pointer chasing

= IO#close: further improvement with second linked list

before: IO#close is linear based on number of living threads
after: IO#close is linear based on number of waiting threads

No extra malloc is needed (only 2 new pointers in existing structs)
for a secondary linked-list for waiting FDs.

I chose the ccan linked list over BSD <sys/queue.h> for two reasons:

insertion and removal are both branchless
locality is improved if a struct may be a member of multiple lists

git://80x24.org/ruby.git threads-list

Files

Download all files

0002-speedup-IO-close-with-many-living-threads.patch (2.86 KB) 0002-speedup-IO-close-with-many-living-threads.patch		normalperson (Eric Wong), 03/13/2014 03:25 AM
0001-doubly-linked-list-from-ccan-to-manage-vm-living_thr.patch (68.1 KB) 0001-doubly-linked-list-from-ccan-to-manage-vm-living_thr.patch		normalperson (Eric Wong), 03/13/2014 03:25 AM

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#1 [ruby-core:61593]

normalperson@yhbt.net wrote:

0001-doubly-linked-list-from-ccan-to-manage-vm-living_thr.patch (68.1 KB)

I'll de-duplicate the CC0 declaration files if allowed to commit this.
The original had symlinks, but I assume symlinks are not allowed in this
source tree for portability.

I really like the Linux-kernel-style of linked-list.

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#2 [ruby-core:61759]

Eric Wong normalperson@yhbt.net wrote:

normalperson@yhbt.net wrote:

0001-doubly-linked-list-from-ccan-to-manage-vm-living_thr.patch (68.1 KB)

I'll de-duplicate the CC0 declaration files if allowed to commit this.
The original had symlinks, but I assume symlinks are not allowed in this
source tree for portability.

Updated 0001 patch with deduplicated license files:
http://bogomips.org/ruby.git/patch?id=b5401cdc6f72

I also renamed CCAN_INCLUDES to CCAN_LIST_INCLUDES in common.mk; in case
we import other modules from ccan[1].

[1] - http://ccodearchive.net/

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#3 [ruby-core:61871]

normalperson@yhbt.net wrote:

Updated 0001 patch with deduplicated license files:
http://bogomips.org/ruby.git/patch?id=b5401cdc6f72

Any comment? My main concern is it's a large import of new code;
but it is also highly reusable. I'll commit in 2-4 weeks if no response.
The 0002 patch can wait longer.

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#4 [ruby-core:62519]

Eric Wong normalperson@yhbt.net wrote:

Any comment? My main concern is it's a large import of new code;
but it is also highly reusable. I'll commit in 2-4 weeks if no response.
The 0002 patch can wait longer.

Committed as r45913. Hopefully nothing breaks, I tested extensively
on my "production" server. Sorry for the delay, was busy.

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#5 [ruby-core:62523]

Sorry for late response.

Just curious (I'm not against of this change).

How performance improved?
Should we modify ccan/* files? Or should we sync with originals?
What mean the name "CCAN"?

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#6 [ruby-core:62524]

Should we use it on compile.c?

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#7 [ruby-core:62525]

ko1@atdot.net wrote:

How performance improved?

There is less pointer chasing for iteration:

Before: st_table_entry->rb_thread_t->st_table_entry->rb_thread_t ...
After: rb_thread->rb_thread ...

This is made possible by the container_of macro.

I plan to use container_of in method/constant/symbol table, too
(ihash in Feature #9614).

Should we modify ccan/* files? Or should we sync with originals?

I probably best to sync with originals. I removed parts of
ccan/str/str.h we are not using, but we can use more of str.h later.
I may also put ihash in CCAN so other projects may use it easily.
But I am not sure about the name "ihash".

What mean the name "CCAN"?

Comprehensive C Archive Network - ccodearchive.net

Should we use it on compile.c?

Maybe. I do not know compile.c well enough...
If we can reduce allocations and pointer chasing without regressions,
we should use it.

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#8 [ruby-core:62527]

Eric Wong normalperson@yhbt.net wrote:

Before: st_table_entry->rb_thread_t->st_table_entry->rb_thread_t ...

Sorry, bad picture for Before, this is more accurate:

st_table_entry -> st_table_entry -> st_table_entry
    |                 |                 |
    V                 V                 V
rb_thread_t       rb_thread_t       rb_thread_t

Updated by akr (Akira Tanaka) almost 12 years ago Actions
Copy link
#9 [ruby-core:62556]

2014-05-11 8:50 GMT+09:00 Eric Wong normalperson@yhbt.net:

Eric Wong normalperson@yhbt.net wrote:

Any comment? My main concern is it's a large import of new code;
but it is also highly reusable. I'll commit in 2-4 weeks if no response.
The 0002 patch can wait longer.

Committed as r45913. Hopefully nothing breaks, I tested extensively
on my "production" server. Sorry for the delay, was busy.

I found that doxygen produces many warnings in ccan/ directory.
http://www.rubyist.net/~akr/chkbuild/debian/ruby-trunk/log/20140510T235500Z.diff.html.gz

It seems the comments in ccan/ directory is not doxygen-compatible.

Anyone use doxygen?
If no one use it, we can drop doxygen support.
(It makes the CI faster.)¶

Tanaka Akira

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#10 [ruby-core:62558]

Tanaka Akira akr@fsij.org wrote:

I found that doxygen produces many warnings in ccan/ directory.
http://www.rubyist.net/~akr/chkbuild/debian/ruby-trunk/log/20140510T235500Z.diff.html.gz

It seems the comments in ccan/ directory is not doxygen-compatible.

Sorry about that.

Anyone use doxygen?
If no one use it, we can drop doxygen support.
(It makes the CI faster.)

I do not use it.

We may also fix the comments to be doxygen-compatible and send patches
upstream to ccan. But if nobody uses doxygen, we save time by dropping
it.

Updated by nobu (Nobuyoshi Nakada) almost 12 years ago Actions
Copy link
#11 [ruby-core:62559]

(2014/05/13 16:29), Eric Wong wrote:

Tanaka Akira akr@fsij.org wrote:

Anyone use doxygen?
If no one use it, we can drop doxygen support.
(It makes the CI faster.)

I do not use it.

I don't use it too (it's too time consuming)

We may also fix the comments to be doxygen-compatible and send patches
upstream to ccan. But if nobody uses doxygen, we save time by dropping
it.

Or adding ccan to EXCLUDE in template/Doxyfile.tmpl.

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#12 [ruby-core:65022]

ko1@atdot.net wrote:

Should we use it on compile.c?

Yes, and probably gc.c, too. I think it would help improve readability
and remove some branches in our current code.

I have submitted patches for list_add_after, list_add_before and
list_swap functions:
https://lists.ozlabs.org/pipermail/ccan/2014-September/thread.html

I think this will be next year for Ruby 2.3.

Updated by Anonymous almost 9 years ago Actions
Copy link
#13

Status changed from Open to Closed

Applied in changeset trunk|r58812.

speed up IO#close with many threads

Today, it increases IO#close performance with many threads:

Execution time (sec)
name trunk after
vm_thread_close 4.276 3.018

Speedup ratio: compare with the result of `trunk' (greater is better)
name after
vm_thread_close 1.417

This speedup comes because rb_notify_fd_close only scans threads
inside rb_thread_io_blocking_region, not all threads in the VM.

In the future, this type data structure may allow us to notify
waiters of multiple FDs on a single thread (when using
Fibers).

thread.c (struct waiting_fd): declare
(rb_thread_io_blocking_region): use on-stack list waiter
(rb_notify_fd_close): walk vm->waiting_fds instead
(call_without_gvl): remove old field setting
(th_init): ditto
vm_core.h (typedef struct rb_vm_struct): add waiting_fds list
(typedef struct rb_thread_struct): remove waiting_fd field
(rb_vm_living_threads_init): initialize waiting_fds list

I am now kicking myself for not thinking about this 3 years ago
when I introduced ccan/list in [Feature #9632] to optimize this
same function :<

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Feature #9632

[PATCH 0/2] speedup IO#close with linked-list from ccan

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#1 [ruby-core:61593]

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#2 [ruby-core:61759]

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#3 [ruby-core:61871]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#4 [ruby-core:62519]

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#5 [ruby-core:62523]

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#6 [ruby-core:62524]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#7 [ruby-core:62525]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#8 [ruby-core:62527]

Updated by akr (Akira Tanaka) almost 12 years ago Actions
Copy link
#9 [ruby-core:62556]

Anyone use doxygen?
If no one use it, we can drop doxygen support.
(It makes the CI faster.)¶

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#10 [ruby-core:62558]

Updated by nobu (Nobuyoshi Nakada) almost 12 years ago Actions
Copy link
#11 [ruby-core:62559]

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#12 [ruby-core:65022]

Updated by Anonymous almost 9 years ago Actions
Copy link
#13

Project

General

Profile

Ruby

Custom queries

Feature #9632

[PATCH 0/2] speedup IO#close with linked-list from ccan

Updated by normalperson (Eric Wong) about 12 years ago ActionsCopy link #1 [ruby-core:61593]

Updated by normalperson (Eric Wong) about 12 years ago ActionsCopy link #2 [ruby-core:61759]

Updated by normalperson (Eric Wong) about 12 years ago ActionsCopy link #3 [ruby-core:61871]

Updated by normalperson (Eric Wong) almost 12 years ago ActionsCopy link #4 [ruby-core:62519]

Updated by ko1 (Koichi Sasada) almost 12 years ago ActionsCopy link #5 [ruby-core:62523]

Updated by ko1 (Koichi Sasada) almost 12 years ago ActionsCopy link #6 [ruby-core:62524]

Updated by normalperson (Eric Wong) almost 12 years ago ActionsCopy link #7 [ruby-core:62525]

Updated by normalperson (Eric Wong) almost 12 years ago ActionsCopy link #8 [ruby-core:62527]

Updated by akr (Akira Tanaka) almost 12 years ago ActionsCopy link #9 [ruby-core:62556]

Anyone use doxygen? If no one use it, we can drop doxygen support. (It makes the CI faster.)¶

Updated by normalperson (Eric Wong) almost 12 years ago ActionsCopy link #10 [ruby-core:62558]

Updated by nobu (Nobuyoshi Nakada) almost 12 years ago ActionsCopy link #11 [ruby-core:62559]

Updated by normalperson (Eric Wong) over 11 years ago ActionsCopy link #12 [ruby-core:65022]

Updated by Anonymous almost 9 years ago ActionsCopy link #13

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#1 [ruby-core:61593]

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#2 [ruby-core:61759]

Updated by normalperson (Eric Wong) about 12 years ago Actions
Copy link
#3 [ruby-core:61871]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#4 [ruby-core:62519]

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#5 [ruby-core:62523]

Updated by ko1 (Koichi Sasada) almost 12 years ago Actions
Copy link
#6 [ruby-core:62524]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#7 [ruby-core:62525]

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#8 [ruby-core:62527]

Updated by akr (Akira Tanaka) almost 12 years ago Actions
Copy link
#9 [ruby-core:62556]

Anyone use doxygen?
If no one use it, we can drop doxygen support.
(It makes the CI faster.)¶

Updated by normalperson (Eric Wong) almost 12 years ago Actions
Copy link
#10 [ruby-core:62558]

Updated by nobu (Nobuyoshi Nakada) almost 12 years ago Actions
Copy link
#11 [ruby-core:62559]

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#12 [ruby-core:65022]

Updated by Anonymous almost 9 years ago Actions
Copy link
#13