Mouse Phenome Database: towards a more FAIR-compliant and TRUST-worthy data repository and tool suite for phenotypes and genotypes.

Molly A. Bogue, The Jackson Laboratory
Robyn L Ball, The Jackson Laboratory
Vivek M. Philip, The Jackson Laboratory
David O Walton, The Jackson Laboratory
Matthew H Dunn, The Jackson Laboratory
Georgi Kolishovski, The Jackson Laboratory
Anna Lamoureux, The Jackson Laboratory
Matthew Gerring, The Jackson Laboratory
Hongping Liang, The Jackson Laboratory
Jake Emerson, The Jackson Laboratory
Timothy M Stearns, The Jackson Laboratory
Hao He, The Jackson Laboratory
Gaurab Mukherjee, The Jackson Laboratory
John Bluis, The Jackson Laboratory
Sejal Desai, The Jackson Laboratory
Beth A. Sundberg, The Jackson Laboratory
Beena Kadakkuzha, The Jackson Laboratory
Govindarajan Kunde-Ramamoorthy, The Jackson Laboratory
Elissa J Chesler

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.


The Mouse Phenome Database (MPD;; RRID:SCR_003212), supported by the US National Institutes of Health, is a Biomedical Data Repository listed in the Trans-NIH Biomedical Informatics Coordinating Committee registry. As an increasingly FAIR-compliant and TRUST-worthy data repository, MPD accepts phenotype and genotype data from mouse experiments and curates, organizes, integrates, archives, and distributes those data using community standards. Data are accompanied by rich metadata, including widely used ontologies and detailed protocols. Data are from all over the world and represent genetic, behavioral, morphological, and physiological disease-related characteristics in mice at baseline or those exposed to drugs or other treatments. MPD houses data from over 6000 strains and populations, representing many reproducible strain types and heterogenous populations such as the Diversity Outbred where each mouse is unique but can be genotyped throughout the genome. A suite of analysis tools is available to aggregate, visualize, and analyze these data within and across studies and populations in an increasingly traceable and reproducible manner. We have refined existing resources and developed new tools to continue to provide users with access to consistent, high-quality data that has translational relevance in a modernized infrastructure that enables interaction with a suite of bioinformatics analytic and data services.