Feature #13767: add something like python's buffer protocol to share memory between different narray like classes - Ruby - Ruby Issue Tracking System

Actions

Copy link

Feature #13767

closed

add something like python's buffer protocol to share memory between different narray like classes

Feature #13767: add something like python's buffer protocol to share memory between different narray like classes

Added by dsisnero (Dominic Sisneros) about 9 years ago. Updated almost 6 years ago.

Status:

Closed

Assignee:

Target version:

[ruby-core:<unknown>]

Description

In order for ruby to be used in more scientific or machine learning applications, it will be necessary to be able to use more memory efficient data structures. Python has a concept called Buffer Protocol that allows different representations to utilize the memory without copying the date with memory views. This is a proposal to add something similar to ruby as a c-api

https://jakevdp.github.io/blog/2014/05/05/introduction-to-the-python-buffer-protocol/

The Python buffer protocol, also known in the community as PEP 3118, is a framework in which Python objects can expose raw byte arrays to other Python objects. This can be extremely useful for scientific computing, where we often use packages such as NumPy to efficiently store and manipulate large arrays of data. Using the buffer protocol, we can let multiple objects efficiently manipulate views of the same data buffers, without having to make copies of the often large datasets.

Here, for example, we'll use Python's built-in array object to create an array:

In [1]:
import array
A = array.array('i', range(10))
A
Out[1]:
array('i', [0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
Note that the array object is different than a Python list, in that it stores the data as a contiguous block of integers. For this reason, the data are stored much more compactly than a list, and certain operations can be performed much more quickly.

array objects by themselves are not particularly useful, but a similar type of object can be found in the numpy array. Because both Python's array and NumPy's ndarray objects implement the buffer protocol, it's possible to seamlessly pass data between them using views – that is, without the need to copy the raw data:

Related issues 1 (0 open — 1 closed)

Updated by shevegen (Robert A. Heiler) about 9 years ago Actions
Copy link
#1 [ruby-core:82180]

I am not sure I understand the proposal, also partially due to python's confusing naming scheme such as List and arrays. The API is also abysmal ... array.array().

I have nothing at all against anything that makes ruby more useful in scientific applications though. Perhaps we can get some ruby people that work on the sci-ruby and similar project to comment here. https://github.com/SciRuby

I think that actually one reason or ruby to not be used as much as python is that python has more developers in the scientific world. I have seen lots of people writing C++ software and then adding of course only one language, which tends to be python. This may have been perl in the past but python seemed to have gained the foothold some years ago.

Updated by dsisnero (Dominic Sisneros) almost 9 years ago Actions
Copy link
#2 [ruby-core:83199]

The naming theme is not what is important. What is important is to have an api to share memory buffers between libraries.

SEE: https://jeffknupp.com/blog/2017/09/15/python-is-the-fastest-growing-programming-language-due-to-a-feature-youve-never-heard-of/

Python's Buffer Protocol: The #1 Reason Python Is The Fastest Growing Programming Language Today
The buffer protocol was (and still is) an extremely low-level API for direct manipulation of memory buffers by other libraries. These are buffers created and used by the interpreter to store certain types of data (initially, primarily "array-like" structures where the type and size of data was known ahead of time) in contiguous memory.

The primary motivation for providing such an API is to eliminate the need to copy data when only reading, clarify ownership semantics of the buffer, and to store the data in contiguous memory (even in the case of multi-dimensional data structures), where read access is extremely fast. Those "other libraries" that would make use of the API would almost certainly be written in C and highly performance sensitive. The new protocol meant that if I create a NumPy array of ints, other libraries can directly access the underlying memory buffer rather than requiring indirection or, worse, copying of that data before it can be used.

Updated by mrkn (Kenta Murata) about 6 years ago Actions
Copy link
#3

Related to Feature #14722: python's buffer protocol clone added

Updated by mrkn (Kenta Murata) almost 6 years ago Actions
Copy link
#4 [ruby-core:100144]

Status changed from Open to Closed

Fixed by https://github.com/ruby/ruby/commit/890bc2cdde4097390f3b71dfeaa36dd92ee0afe2

Updated by Eregon (Benoit Daloze) almost 6 years ago Actions
Copy link
#5 [ruby-core:100188]

@mrkn (Kenta Murata) Is the native memory layout introduced in Ruby compatible with Python?
If it is/was it might be useful to e.g. be able to access a NumPy array from Ruby without copying (e.g. in PyCall.rb)

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Feature #13767

add something like python's buffer protocol to share memory between different narray like classes

Updated by shevegen (Robert A. Heiler) about 9 years ago Actions
Copy link
#1 [ruby-core:82180]

Updated by dsisnero (Dominic Sisneros) almost 9 years ago Actions
Copy link
#2 [ruby-core:83199]

Updated by mrkn (Kenta Murata) about 6 years ago Actions
Copy link
#3

Updated by mrkn (Kenta Murata) almost 6 years ago Actions
Copy link
#4 [ruby-core:100144]

Updated by Eregon (Benoit Daloze) almost 6 years ago Actions
Copy link
#5 [ruby-core:100188]

Project

General

Profile

Ruby

Custom queries

Feature #13767

add something like python's buffer protocol to share memory between different narray like classes

Updated by shevegen (Robert A. Heiler) about 9 years ago ActionsCopy link #1 [ruby-core:82180]

Updated by dsisnero (Dominic Sisneros) almost 9 years ago ActionsCopy link #2 [ruby-core:83199]

Updated by mrkn (Kenta Murata) about 6 years ago ActionsCopy link #3

Updated by mrkn (Kenta Murata) almost 6 years ago ActionsCopy link #4 [ruby-core:100144]

Updated by Eregon (Benoit Daloze) almost 6 years ago ActionsCopy link #5 [ruby-core:100188]

Updated by shevegen (Robert A. Heiler) about 9 years ago Actions
Copy link
#1 [ruby-core:82180]

Updated by dsisnero (Dominic Sisneros) almost 9 years ago Actions
Copy link
#2 [ruby-core:83199]

Updated by mrkn (Kenta Murata) about 6 years ago Actions
Copy link
#3

Updated by mrkn (Kenta Murata) almost 6 years ago Actions
Copy link
#4 [ruby-core:100144]

Updated by Eregon (Benoit Daloze) almost 6 years ago Actions
Copy link
#5 [ruby-core:100188]