To use Ruby for data processing widely, Apache Arrow support is important.
We can do the followings with Apache Arrow:
- Super fast large data interchange and processing
- Reading/writing data in several famous formats such as CSV and Apache Parquet
- Reading/writing partitioned large data on cloud storage such as Amazon S3
This talk describes the followings:
- What is Apache Arrow
- How to use Apache Arrow with Ruby
- How to integrate with Ruby 3.0 features such as MemoryView and Ractor
RubyKaigi Takeout 2021: https://rubykaigi.org/2021-takeout/presentations/ktou.html
RubyKaigi 2021 Takeout