A pangenome graph reference of 30 chicken genomes allows genotyping of large and complex structural variants.

TitleA pangenome graph reference of 30 chicken genomes allows genotyping of large and complex structural variants.
Publication TypeJournal Article
Year of Publication2023
AuthorsRice, ES, Alberdi, A, Alfieri, J, Athrey, G, Balacco, JR, Bardou, P, Blackmon, H, Charles, M, Cheng, HH, Fedrigo, O, Fiddaman, SR, Formenti, G, Frantz, LAF, M Gilbert, TP, Hearn, CJ, Jarvis, ED, Klopp, C, Marcos, S, Mason, AS, Velez-Irizarry, D, Xu, L, Warren, WC
JournalBMC Biol
Volume21
Issue1
Pagination267
Date Published2023 Nov 22
ISSN1741-7007
KeywordsAnimals, Chickens, Genome, Genomics, Genotype, Sequence Analysis, DNA
Abstract

BACKGROUND: The red junglefowl, the wild outgroup of domestic chickens, has historically served as a reference for genomic studies of domestic chickens. These studies have provided insight into the etiology of traits of commercial importance. However, the use of a single reference genome does not capture diversity present among modern breeds, many of which have accumulated molecular changes due to drift and selection. While reference-based resequencing is well-suited to cataloging simple variants such as single-nucleotide changes and short insertions and deletions, it is mostly inadequate to discover more complex structural variation in the genome.

METHODS: We present a pangenome for the domestic chicken consisting of thirty assemblies of chickens from different breeds and research lines.

RESULTS: We demonstrate how this pangenome can be used to catalog structural variants present in modern breeds and untangle complex nested variation. We show that alignment of short reads from 100 diverse wild and domestic chickens to this pangenome reduces reference bias by 38%, which affects downstream genotyping results. This approach also allows for the accurate genotyping of a large and complex pair of structural variants at the K feathering locus using short reads, which would not be possible using a linear reference.

CONCLUSIONS: We expect that this new paradigm of genomic reference will allow better pinpointing of exact mutations responsible for specific phenotypes, which will in turn be necessary for breeding chickens that meet new sustainability criteria and are resilient to quickly evolving pathogen threats.

DOI10.1186/s12915-023-01758-0
Alternate JournalBMC Biol
PubMed ID37993882
PubMed Central IDPMC10664547
Grant List2020-67015-31574 / / National Institute of Food and Agriculture /
2022-67015-36218 / / National Institute of Food and Agriculture /
817729 / / HORIZON EUROPE Research Infrastructures /