دورية أكاديمية

A Chado case study: an ontology-based modular schema for representing genome-associated biological information.

التفاصيل البيبلوغرافية
العنوان: A Chado case study: an ontology-based modular schema for representing genome-associated biological information.
المؤلفون: Mungall CJ; Lawrence Berkeley National Laboratory, Lawrence Berkeley National Lab, Mail Stop 64R0121, Berkeley, CA 94720, USA. cjm@fruitfly.org, Emmert DB
مؤلفون مشاركون: FlyBase Consortium
المصدر: Bioinformatics (Oxford, England) [Bioinformatics] 2007 Jul 01; Vol. 23 (13), pp. i337-46.
نوع المنشور: Journal Article; Research Support, N.I.H., Extramural; Research Support, Non-U.S. Gov't
اللغة: English
بيانات الدورية: Publisher: Oxford University Press Country of Publication: England NLM ID: 9808944 Publication Model: Print Cited Medium: Internet ISSN: 1367-4811 (Electronic) Linking ISSN: 13674803 NLM ISO Abbreviation: Bioinformatics Subsets: MEDLINE
أسماء مطبوعة: Original Publication: Oxford : Oxford University Press, c1998-
مواضيع طبية MeSH: Algorithms* , Database Management Systems* , Databases, Genetic* , User-Computer Interface*, Chromosome Mapping/*methods , Computational Biology/*methods , Information Storage and Retrieval/*methods, Systems Integration
مستخلص: Motivation: A few years ago, FlyBase undertook to design a new database schema to store Drosophila data. It would fully integrate genomic sequence and annotation data with bibliographic, genetic, phenotypic and molecular data from the literature representing a distillation of the first 100 years of research on this major animal model system. In developing this new integrated schema, FlyBase also made a commitment to ensure that its design was generic, extensible and available as open source, so that it could be employed as the core schema of any model organism data repository, thereby avoiding redundant software development and potentially increasing interoperability. Our question was whether we could create a relational database schema that would be successfully reused.
Results: Chado is a relational database schema now being used to manage biological knowledge for a wide variety of organisms, from human to pathogens, especially the classes of information that directly or indirectly can be associated with genome sequences or the primary RNA and protein products encoded by a genome. Biological databases that conform to this schema can interoperate with one another, and with application software from the Generic Model Organism Database (GMOD) toolkit. Chado is distinctive because its design is driven by ontologies. The use of ontologies (or controlled vocabularies) is ubiquitous across the schema, as they are used as a means of typing entities. The Chado schema is partitioned into integrated subschemas (modules), each encapsulating a different biological domain, and each described using representations in appropriate ontologies. To illustrate this methodology, we describe here the Chado modules used for describing genomic sequences.
Availability: GMOD is a collaboration of several model organism database groups, including FlyBase, to develop a set of open-source software for managing model organism data. The Chado schema is freely distributed under the terms of the Artistic License (http://www.opensource.org/licenses/artistic-license.php) from GMOD (www.gmod.org).
معلومات مُعتمدة: G0500293 United Kingdom MRC_ Medical Research Council; 5P41 HG000739 United States HG NHGRI NIH HHS
تواريخ الأحداث: Date Created: 20070725 Date Completed: 20070827 Latest Revision: 20220129
رمز التحديث: 20240628
DOI: 10.1093/bioinformatics/btm189
PMID: 17646315
قاعدة البيانات: MEDLINE
الوصف
تدمد:1367-4811
DOI:10.1093/bioinformatics/btm189