Faculty Research 2000 - 2009

Discrimination of Non-Protein-Coding Transcripts from Protein-Coding mRNA.

M C. Frith
T L. Bailey
T Kasukawa
F Mignone
S K. Kummerfeld
M Madera
S Sunkara
M Furuno
C J. BultFollow
J Quackenbush
C Kai
J Kawai
P Carninci
Y Hayashizaki
G Pesole
J S. Mattick

Document Type

Article

Publication Date

2006

First Page

Last Page

JAX Source

RNA Biol 2006 Jan; 3(1):40-48.

Abstract

Several recent studies indicate that mammals and other organisms produce large numbers of RNA transcripts that do not correspond to known genes. It has been suggested that these transcripts do not encode proteins, but may instead function as RNAs. However, discrimination of coding and non-coding transcripts is not straightforward, and different laboratories have used different methods, whose ability to perform this discrimination is unclear. In this study, we examine ten bioinformatic methods that assess protein-coding potential and compare their ability and congruency in the discrimination of non-coding from coding sequences, based on four underlying principles: open reading frame size, sequence similarity to known proteins or protein domains, statistical models of protein-coding sequence, and synonymous versus non-synonymous substitution rates. Despite these different approaches, the methods show broad concordance, suggesting that coding and non-coding transcripts can, in general, be reliably discriminated, and that many of the recently discovered extra-genic transcripts are indeed non-coding. Comparison of the methods indicates reasons for unreliable predictions, and approaches to increase confidence further. Conversely and surprisingly, our analyses also provide evidence that as much as approximately 10% of entries in the manually curated protein database Swiss-Prot are erroneous translations of actually non-coding transcripts.

Recommended Citation

Frith MC, Bailey TL, Kasukawa T, Mignone F, Kummerfeld SK, Madera M, Sunkara S, Furuno M, Bult CJ, Quackenbush J, Kai C, Kawai J, Carninci P, Hayashizaki Y, Pesole G, Mattick JS. Discrimination of Non-Protein-Coding Transcripts from Protein-Coding mRNA. RNA Biol 2006 Jan; 3(1):40-48.

Link to Full Text

COinS

Faculty Research 2000 - 2009

Discrimination of Non-Protein-Coding Transcripts from Protein-Coding mRNA.

Document Type

Publication Date

First Page

Last Page

JAX Source

Abstract

Recommended Citation

Search

Browse

Links

Faculty Research 2000 - 2009

Discrimination of Non-Protein-Coding Transcripts from Protein-Coding mRNA.

Authors

Document Type

Publication Date

First Page

Last Page

JAX Source

Abstract

Recommended Citation

Share

Search

Browse

Links