Faculty Research 2000 - 2009

CDS annotation in full-length cDNA sequence.

M Furuno
T Kasukawa
R Saito
J Adachi
H Suzuki
R Baldarelli
Y Hayashizaki
Y Okazaki

Document Type

Article

Publication Date

2003

Keywords

Codon, Comparative-Study, Computational-Biology, DNA-Complementary, Databases-Genetic, Forecasting, Genome, Human, Mice, Open-Reading-Frames, Proteins, RNA-Splicing, Research-Design, Selenocysteine, Software-Design, Translation-Genetic

First Page

1478

Last Page

1487

JAX Source

Genome Res 2003 Jun; 13(6B):1478-87,

Abstract

The identification of coding sequences (CDS) is an important step in the functional annotation of genes. CDS prediction for mammalian genes from genomic sequence is complicated by the vast abundance of intergenic sequence in the genome, and provides little information about how different parts of potential CDS regions are expressed. In contrast, mammalian gene CDS prediction from cDNA sequence offers obvious advantages, yet encounters a different set of complexities when performed on high-throughput cDNA (HTC) sequences, such as the set of 60,770 cDNAs isolated from full-length enriched libraries of the FANTOM2 project. We developed a CDS annotation strategy that uses a variety of different CDS prediction programs to annotate the CDS regions of FANTOM2 cDNAs. These include rsCDS, which uses sequence similarity to known proteins; ProCrest; Longest-ORF and Truncated-ORF, which are ab initio based predictors; and finally, DECODER and NCBI CDS predictor, which use a combination of both principles. Aided by graphical displays of these CDS prediction results in the context of other sequence similarity results for each cDNA, FANTOM2 CDS inspection by curators and follow-up quality control procedures resulted in high quality CDS predictions for a total of 14,345 FANTOM2 clones.

Recommended Citation

Furuno M, Kasukawa T, Saito R, Adachi J, Suzuki H, Baldarelli R, Hayashizaki Y, Okazaki Y. CDS annotation in full-length cDNA sequence. Genome Res 2003 Jun; 13(6B):1478-87,

Please contact the Joan Staats Library for information regarding this document.

COinS

Faculty Research 2000 - 2009

CDS annotation in full-length cDNA sequence.

Document Type

Publication Date

Keywords

First Page

Last Page

JAX Source

Abstract

Recommended Citation

Search

Browse

Links

Faculty Research 2000 - 2009

CDS annotation in full-length cDNA sequence.

Authors

Document Type

Publication Date

Keywords

First Page

Last Page

JAX Source

Abstract

Recommended Citation

Share

Search

Browse

Links