WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning.
Document Type
Article
Publication Date
11-3-2016
JAX Source
PLos Comput Biol 2016 Nov 3; 12(11):e1005182
Volume
12
Issue
11
First Page
1005182
Last Page
1005182
ISSN
1553-7358
PMID
27812085
Grant
AG038070, CA34196
Abstract
The rapid advancement of technology in genomics and targeted genetic manipulation has made comparative biology an increasingly prominent strategy to model human disease processes. Predicting orthology relationships between species is a vital component of comparative biology. Dozens of strategies for predicting orthologs have been developed using combinations of gene and protein sequence, phylogenetic history, and functional interaction with progressively increasing accuracy. A relatively new class of orthology prediction strategies combines aspects of multiple methods into meta-tools, resulting in improved prediction performance. Here we present WORMHOLE, a novel ortholog prediction meta-tool that applies machine learning to integrate 17 distinct ortholog prediction algorithms to identify novel least diverged orthologs (LDOs) between 6 eukaryotic species-humans, mice, zebrafish, fruit flies, nematodes, and budding yeast. Machine learning allows WORMHOLE to intelligently incorporate predictions from a wide-spectrum of strategies in order to form aggregate predictions of LDOs with high confidence. In this study we demonstrate the performance of WORMHOLE across each combination of query and target species. We show that WORMHOLE is particularly adept at improving LDO prediction performance between distantly related species, expanding the pool of LDOs while maintaining low evolutionary distance and a high level of functional relatedness between genes in LDO pairs. We present extensive validation, including cross-validated prediction of PANTHER LDOs and evaluation of evolutionary divergence and functional similarity, and discuss future applications of machine learning in ortholog prediction. A WORMHOLE web tool has been developed and is available at http://wormhole.jax.org/. PLos Comput Biol 2016 Nov 3; 12(11):e1005182
Recommended Citation
Sutphin GL,
Mahoney J,
Sheppard K,
Walton D,
Korstanje R.
WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning. PLos Comput Biol 2016 Nov 3; 12(11):e1005182