Serving Hybrid-Cloud SQL Interactive Queries at Twitter

التفاصيل البيبلوغرافية
العنوان: Serving Hybrid-Cloud SQL Interactive Queries at Twitter
المؤلفون: Tang, Chunxu, Wang, Beinan, Wu, Huijun, Wang, Zhenzhao, Li, Yao, Channapattan, Vrushali, Luo, Zhenxiao, Kabra, Ruchin, Ghosh, Mainak, Navadiya, Nikhil Kantibhai, Mishra, Prachi, Mukhedkar, Prateek, Lu, Anneliese
سنة النشر: 2022
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Databases
الوصف: The demand for data analytics has been consistently increasing in the past years at Twitter. In order to fulfill the requirements and provide a highly scalable and available query experience, a large-scale in-house SQL system is heavily relied on. Recently, we evolved the SQL system into a hybrid-cloud SQL federation system, compliant with Twitter's Partly Cloudy strategy. The hybrid-cloud SQL federation system is capable of processing queries across Twitter's data centers and the public cloud, interacting with around 10PB of data per day. In this paper, the design of the hybrid-cloud SQL federation system is presented, which consists of query, cluster, and storage federations. We identify challenges in a modern SQL system and demonstrate how our system addresses them with some important design decisions. We also conduct qualitative examinations and summarize instructive lessons learned from the development and operation of such a SQL system.
Comment: Submitted to ECSA 2021 post-proceedings
نوع الوثيقة: Working Paper
DOI: 10.1007/978-3-031-15116-3_1
URL الوصول: http://arxiv.org/abs/2207.04199
رقم الأكسشن: edsarx.2207.04199
قاعدة البيانات: arXiv
الوصف
DOI:10.1007/978-3-031-15116-3_1