The Genome of C57BL/6J "Eve", the Mother of the Laboratory Mouse Genome Reference Strain.

Vishal Kumar Sarsani, The Jackson Laboratory
Narayanan Raghupathy, The Jackson Laboratory
Ian T Fiddes
Joel Armstrong
Francoise Thibaud-Nissen
Oraya Zinder, The Jackson Laboratory
Mohan Bolisetty, The Jackson Laboratory
Kerstin Howe
Doug Hinerfeld
Xiaoan Ruan, The Jackson Laboratory
Lucy B. Rowe, The Jackson Laboratory
Mary Barter, The Jackson Laboratory
Guruprasad Ananda, The Jackson Laboratory
Benedict Paten
George M. Weinstock, The Jackson Laboratory
Gary Churchill, The Jackson Laboratory
Michael V. Wiles, The Jackson Laboratory
Valerie A Schneider
Anuj Srivastava, The Jackson Laboratory
Laura G Reinholdt

We are grateful to the services provided by The Jackson Laboratory Genome Technologies and Computational Sciences Core

Abstract

Isogenic laboratory mouse strains enhance reproducibility because individual animals are genetically identical. For the most widely used isogenic strain, C57BL/6, there exists a wealth of genetic, phenotypic, and genomic data, including a high-quality reference genome (GRCm38.p6). Now 20 years after the first release of the mouse reference genome, C57BL/6J mice are at least 26 inbreeding generations removed from GRCm38 and the strain is now maintained with periodic reintroduction of cryorecovered mice derived from a single breeder pair, aptly named Adam and Eve. To provide an update to the mouse reference genome that more accurately represents the genome of today's C57BL/6J mice, we took advantage of long read, short read, and optical mapping technologies to generate a de novo assembly of the C57BL/6J Eve genome (B6Eve). Using these data, we have addressed recurring variants observed in previous mouse genomic studies. We have also identified structural variations, closed gaps in the mouse reference assembly, and revealed previously unannotated coding sequences. This B6Eve assembly explains discrepant observations that have been associated with GRCm38-based analyses, and will inform a reference genome that is more representative of the C57BL/6J mice that are in use today.