Leveraging Apache Arrow for Zero-copy, Zero-serialization Cluster Shared Memory

التفاصيل البيبلوغرافية
العنوان: Leveraging Apache Arrow for Zero-copy, Zero-serialization Cluster Shared Memory
المؤلفون: Groet, Philip, Hoozemans, Joost, Grapentin, Andreas, Eberhardt, Felix, Al-Ars, Zaid, Hofstee, H. Peter
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Emerging Technologies
الوصف: This paper describes a distributed implementation of Apache Arrow that can leverage cluster-shared load-store addressable memory that is hardware-coherent only within each node. The implementation is built on the ThymesisFlow prototype that leverages the OpenCAPI interface to create a shared address space across a cluster. While Apache Arrow structures are immutable, simplifying their use in a cluster shared memory, this paper creates distributed Apache Arrow tables and makes them accessible in each node.
Comment: Presented at the 3rd Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS 2024)
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2404.03030
رقم الأكسشن: edsarx.2404.03030
قاعدة البيانات: arXiv