Computational analysis of the yeast proteome: understanding and exploiting functional specificity in genomic data.
Data-Interpretation-Statistical, Databases-Genetic, Databases-Protein, Genome-Fungal, Genomics, Proteome, Proteomics, Saccharomyces-cerevisiae, Systems-Biology
see Reprint Collection (a pdf is available)
Methods Mol Biol 2009; 548:273-93.
Modern experimental techniques have produced a wealth of high-throughput data that has enabled the ongoing genomic revolution. As the field continues to integrate experimental and computational analyzes of this data, it is essential that performance evaluations of high-throughput results be carried out in a consistent and biologically informative manner. Here, we present an overview of evaluation techniques for high-throughput experimental data and computational methods, and we discuss a number of potential pitfalls in this process. These primarily involve the biological diversity of genomic data, which can be masked or misrepresented in overly simplified global evaluations. We describe systems for preserving information about biological context during dataset evaluation, which can help to ensure that multiple different evaluations are more directly comparable. This biological variety in high-throughput data can also be taken advantage of computationally through data integration and process specificity to produce richer systems-level predictions of cellular function. An awareness of these considerations can greatly improve the evaluation and analysis of any high-throughput experimental dataset.
Computational analysis of the yeast proteome: understanding and exploiting functional specificity in genomic data. Methods Mol Biol 2009; 548:273-93.