Document Type
Article
Publication Date
1-1-2019
Keywords
JMG
JAX Source
Database (Oxford) 2019 Jan 1;2019:baz036
Volume
2019
ISSN
1758-0463
PMID
30888410
DOI
https://doi.org/10.1093/database/baz036
Grant
CA034196,OD020351,AA18776
Abstract
Genomic data interpretation often requires analyses that move from a gene-by-gene focus to a focus on sets of genes that are associated with biological phenomena such as molecular processes, phenotypes, diseases, drug interactions or environmental conditions. Unique challenges exist in the curation of gene sets beyond the challenges in curation of individual genes. Here we highlight a literature curation workflow whereby gene sets are curated from peer-reviewed published data into GeneWeaver (GW), a data repository and analysis platform. We describe the system features that allow for a flexible yet precise curation procedure. We illustrate the value of curation by gene sets through analysis of independently curated sets that relate to the integrated stress response, showing that sets curated from independent sources all share significant Jaccard similarity. A suite of reproducible analysis tools is provided in GW as services to carry out interactive functional investigation of user-submitted gene sets within the context of over 150 000 gene sets constructed from publicly available resources and published gene lists. A curation interface supports the ability of users to design and maintain curation workflows of gene sets, including assigning, reviewing and releasing gene sets within a curation project context.
Recommended Citation
Bubier JA,
Hill DP,
Mukherjee G,
Reynolds T,
Baker E,
Berger A,
Emerson J,
Blake JA,
Chesler E.
Curating gene sets: challenges and opportunities for integrative analysis. Database (Oxford) 2019 Jan 1;2019:baz036
Comments
The authors would like to thank Robert Burgess, Emily Spaulding and Beena Kadhakhuza for their suggestions and comments toward this work.
Open access under the terms of the Creative Commons Attribution License