Design and Implementation of Scheduling Pool Scheduling Algorithm Based on Reuse of Jobs in Spark

التفاصيل البيبلوغرافية
العنوان: Design and Implementation of Scheduling Pool Scheduling Algorithm Based on Reuse of Jobs in Spark
المؤلفون: Yan Zhou, Huang Chao-Qiang, Tang Jian-Chao, Yang Shu-qiang
المصدر: DSC
بيانات النشر: IEEE, 2016.
سنة النشر: 2016
مصطلحات موضوعية: Rate-monotonic scheduling, Earliest deadline first scheduling, Computer science, Distributed computing, 020206 networking & telecommunications, 02 engineering and technology, Dynamic priority scheduling, Flow shop scheduling, Reuse, computer.software_genre, Fair-share scheduling, Multiprocessor scheduling, 020401 chemical engineering, Two-level scheduling, 0202 electrical engineering, electronic engineering, information engineering, Operating system, 0204 chemical engineering, computer
الوصف: As a distributed computing framework based on memory, Spark is being used by more and more enterprises. Generally, Spark runs in multi-user and multi-job mode, where may exist a large number of reuse of jobs. This reuse, here, refers to the calculation reuse inside the jobs, and it can greatly shorten the executing time of jobs in Spark. Therefore, this paper proposes a scheduling pool scheduling algorithm based on reuse of jobs. This algorithm is based on the original scheduling pool scheduling algorithm in Spark and can take great advantage of the reusable parts. Experiments show that the new scheduling algorithm realizes reuse of jobs, and improves the execution efficiency of the cluster.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::b5d9c428f786c0a6c9e8ea6f085e4572
https://doi.org/10.1109/dsc.2016.81
رقم الأكسشن: edsair.doi...........b5d9c428f786c0a6c9e8ea6f085e4572
قاعدة البيانات: OpenAIRE