Building a sequence map of the pig pan-genome from multiple de novoassemblies and Hi-C data

التفاصيل البيبلوغرافية
العنوان: Building a sequence map of the pig pan-genome from multiple de novoassemblies and Hi-C data
المؤلفون: Tian, Xiaomeng, Li, Ran, Fu, Weiwei, Li, Yan, Wang, Xihong, Li, Ming, Du, Duo, Tang, Qianzi, Cai, Yudong, Long, Yiming, Zhao, Yue, Li, Mingzhou, Jiang, Yu
المصدر: SCIENCE CHINA Life Sciences; 20240101, Issue: Preprints p1-14, 14p
مستخلص: Pigs were domesticated independently in the Near East and China, indicating that a single reference genome from one individual is unable to represent the full spectrum of divergent sequences in pigs worldwide. Therefore, 12 de novopig assemblies from Eurasia were compared in this study to identify the missing sequences from the reference genome. As a result, 72.5 Mb of non-redundant sequences (∼3% of the genome) were found to be absent from the reference genome (Sscrofa11.1) and were defined as pan-sequences. Of the pan-sequences, 9.0 Mb were dominant in Chinese pigs, in contrast with their low frequency in European pigs. One sequence dominant in Chinese pigs contained the complete genic region of the tazarotene-induced gene 3 (TIG3) gene which is involved in fatty acid metabolism. Using flanking sequences and Hi-C based methods, 27.7% of the sequences could be anchored to the reference genome. The supplementation of these sequences could contribute to the accurate interpretation of the 3D chromatin structure. A web-based pan-genome database was further provided to serve as a primary resource for exploration of genetic diversity and promote pig breeding and biomedical research.
قاعدة البيانات: Supplemental Index
الوصف
تدمد:16747305
18691889
DOI:10.1007/s11427-019-9551-7