Project

General

Profile

Actions

Feature #20394

closed

Add an offset parameter to `String#to_i`

Added by byroot (Jean Boussier) about 1 month ago. Updated about 1 month ago.

Status:
Closed
Assignee:
-
Target version:
-
[ruby-core:117324]

Description

Context

I maintain the redis-client gem, and it comes with an optional swapable implementation in C that binds the hiredis C client, which used to performs up to 5 times faster in some cases.

I recently paired with @tenderlovemaking (Aaron Patterson) to try to close this gap, or even try to make the pure Ruby version faster, and we came up with several optimizations that now almost make both version on par (assuming YJIT is enabled).

An important source of performance loss, is that the Redis protocol is line based and to parse it in Ruby requires to slice a lot of small strings from the buffer. To give an example, here's how an Array with two String (["foo", "plop"]) is serialized in RESP3 (Redis protocol):

*2\r\n
$3\r\n
foo\r\n
$4\r\n
plop\r\n

From this you can understand that a big hotspot in the parser is essentially Integer(gets).

With @tenderlovemaking (Aaron Patterson) we managed to get a fairly significant perf boost by avoiding these string allocation using String#getbyte and basically implementing a rudimentary String#to_i(offset: ) in Ruby.

But while the gains are huge with YJIT enabled, they are much more tame with the interpreter. And it feels a bit wrong to have to implement this sorts of things for performance reasons.

String#to_i(offset: )

Similar to String#unpack(offset:) ([Feature #18254]), I believe String#to_i(offset: ) would be useful.

Alternative new String#unpack format

Another possibility would be to add a new format to String#pack String#unpack for decimal numbers. It sounds a bit weird at first, but given it supports things like Base64 and hexadecimal, perhaps it's not that much of a stretch?

Actions

Also available in: Atom PDF

Like0
Like1Like0Like0Like1Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0