An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications

التفاصيل البيبلوغرافية
العنوان: An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications
المؤلفون: Ümit V. Çatalyürek, N. Vydyanathan, Sriram Krishnamoorthy, Tahsin Kurc, Gerald Sabin, P. Sadayappan, Joel H. Saltz
المصدر: IEEE Transactions on Parallel and Distributed Systems. 20:1158-1172
بيانات النشر: Institute of Electrical and Electronics Engineers (IEEE), 2009.
سنة النشر: 2009
مصطلحات موضوعية: Schedule, Job shop scheduling, Data parallelism, Computer science, Distributed computing, Locality, Processor scheduling, Task parallelism, Parallel computing, Directed acyclic graph, Scheduling (computing), Computational Theory and Mathematics, Hardware and Architecture, Signal Processing, Scalability, Concurrent computing, Resource allocation, Algorithm design, Critical path method
الوصف: Complex parallel applications can often be modeled as directed acyclic graphs of coarse-grained application tasks with dependences. These applications exhibit both task and data parallelism, and combining these two (also called mixed parallelism) has been shown to be an effective model for their execution. In this paper, we present an algorithm to compute the appropriate mix of task and data parallelism required to minimize the parallel completion time (makespan) of these applications. In other words, our algorithm determines the set of tasks that should be run concurrently and the number of processors to be allocated to each task. The processor allocation and scheduling decisions are made in an integrated manner and are based on several factors such as the structure of the task graph, the runtime estimates and scalability characteristics of the tasks, and the intertask data communication volumes. A locality-conscious scheduling strategy is used to improve intertask data reuse. Evaluation through simulations and actual executions of task graphs derived from real applications and synthetic graphs shows that our algorithm consistently generates schedules with a lower makespan as compared to Critical Path Reduction (CPR) and Critical Path and Allocation (CPA), two previously proposed scheduling algorithms. Our algorithm also produces schedules that have a lower makespan than pure task- and data-parallel schedules. For task graphs with known optimal schedules or lower bounds on the makespan, our algorithm generates schedules that are closer to the optima than other scheduling approaches.
تدمد: 1045-9219
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::1ed70e72e025515f0ddf1564d9c4755f
https://doi.org/10.1109/tpds.2008.219
حقوق: CLOSED
رقم الأكسشن: edsair.doi...........1ed70e72e025515f0ddf1564d9c4755f
قاعدة البيانات: OpenAIRE