دورية أكاديمية

Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor.

التفاصيل البيبلوغرافية
العنوان: Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor.
المؤلفون: Erickson RA; Upper Midwest Environmental Sciences Center, United States Geological Survey, La Crosse, Wisconsin, United States of America., Fienen MN; Wisconsin Water Science Center, United States Geological Survey, Middelton, Wisconsin, United States of America., McCalla SG; Upper Midwest Environmental Sciences Center, United States Geological Survey, La Crosse, Wisconsin, United States of America., Weiser EL; Upper Midwest Environmental Sciences Center, United States Geological Survey, La Crosse, Wisconsin, United States of America., Bower ML; Upper Midwest Environmental Sciences Center, United States Geological Survey, La Crosse, Wisconsin, United States of America., Knudson JM; Upper Midwest Environmental Sciences Center, United States Geological Survey, La Crosse, Wisconsin, United States of America., Thain G; Department of Computer Science, University of Wisconsin-Madison, Madison, Winconsin, United States of America.
المصدر: PLoS computational biology [PLoS Comput Biol] 2018 Oct 03; Vol. 14 (10), pp. e1006468. Date of Electronic Publication: 2018 Oct 03 (Print Publication: 2018).
نوع المنشور: Journal Article; Research Support, U.S. Gov't, Non-P.H.S.
اللغة: English
بيانات الدورية: Publisher: Public Library of Science Country of Publication: United States NLM ID: 101238922 Publication Model: eCollection Cited Medium: Internet ISSN: 1553-7358 (Electronic) Linking ISSN: 1553734X NLM ISO Abbreviation: PLoS Comput Biol Subsets: MEDLINE
أسماء مطبوعة: Original Publication: San Francisco, CA : Public Library of Science, [2005]-
مواضيع طبية MeSH: Computational Biology* , Computing Methodologies* , Ecology* , Software*, High-Throughput Screening Assays ; Humans ; Internet ; Research
مستخلص: Biologists and environmental scientists now routinely solve computational problems that were unimaginable a generation ago. Examples include processing geospatial data, analyzing -omics data, and running large-scale simulations. Conventional desktop computing cannot handle these tasks when they are large, and high-performance computing is not always available nor the most appropriate solution for all computationally intense problems. High-throughput computing (HTC) is one method for handling computationally intense research. In contrast to high-performance computing, which uses a single "supercomputer," HTC can distribute tasks over many computers (e.g., idle desktop computers, dedicated servers, or cloud-based resources). HTC facilities exist at many academic and government institutes and are relatively easy to create from commodity hardware. Additionally, consortia such as Open Science Grid facilitate HTC, and commercial entities sell cloud-based solutions for researchers who lack HTC at their institution. We provide an introduction to HTC for biologists and environmental scientists. Our examples from biology and the environmental sciences use HTCondor, an open source HTC system.
Competing Interests: The authors have declared that no competing interests exist.
References: J Integr Bioinform. 2015 May 20;12(1):255. (PMID: 26527189)
Front Genet. 2011 Feb 24;2:4. (PMID: 22303303)
Proc Natl Acad Sci U S A. 2004 Aug 24;101(34):12422-7. (PMID: 15314227)
Ground Water. 2015 Mar-Apr;53(2):180-4. (PMID: 25644169)
Science. 2000 Feb 18;287(5456):1221, 1223. (PMID: 10712158)
PLoS Comput Biol. 2016 Jun 07;12(6):e1004867. (PMID: 27271528)
Ground Water. 2009 Nov-Dec;47(6):835-44. (PMID: 19486167)
Science. 1992 Apr 3;256(5053):44-7. (PMID: 17802588)
تواريخ الأحداث: Date Created: 20181004 Date Completed: 20190128 Latest Revision: 20190128
رمز التحديث: 20231215
مُعرف محوري في PubMed: PMC6169842
DOI: 10.1371/journal.pcbi.1006468
PMID: 30281592
قاعدة البيانات: MEDLINE
الوصف
تدمد:1553-7358
DOI:10.1371/journal.pcbi.1006468