TY - JOUR
T1 - Evolutionary conservation suggests a regulatory function of AUG triplets in 5′-UTRs of eukaryotic genes
AU - Churbanov, Alexander
AU - Rogozin, Igor B.
AU - Babenko, Vladimir N.
AU - Ali, Hesham
AU - Koonin, Eugene V.
N1 - Funding Information:
We thank Yuri Wolf, Svetlana Shabalina, Fyodor Kondrashov, King Jordan and Aleksey Kochetov for helpful discussions. This work was supported in part by the Intramural Research Program of the National Library of Medicine at National Institutes of Health/DHHS and NIH grant number P20 RR16469 from the INBRE program of the National Center for Research Resources. Funding to pay the Open Access publication charges for this article was provided by the National Institutes of Health, USA.
PY - 2005
Y1 - 2005
N2 - By comparing sequences of human, mouse and rat orthologous genes, we show that in 5′-untranslated regions (5′-UTRs) of mammalian cDNAs but not in 3′-UTRs or coding sequences, AUG is conserved to a significantly greater extent than any of the other 63 nt triplets. This effect is likely to reflect, primarily, bona fide evolutionary conservation, rather than cDNA annotation artifacts, because the excess of conserved upstream AUGs (uAUGs) is seen in 5′-UTRs containing stop codons in-frame with the start AUG and many of the conserved AUGs are found in different frames, consistent with the location in authentic non-coding sequences. Altogether, conserved uAUGs are present in at least 20-30% of mammalian genes. Qualitatively similar results were obtained by comparison of orthologous genes from different species of the yeast genus Saccharomyces. Together with the observation that mammalian and yeast 5′-UTRs are significantly depleted in overall AUG content, these findings suggest that AUG triplets in 5′-UTRs are subject to the pressure of purifying selection in two opposite directions: the uAUGs that have no specific function tend to be deleterious and get eliminated during evolution, whereas those uAUGs that do serve a function are conserved. Most probably, the principal role of the conserved uAUGs is attenuation of translation at the initiation stage, which is often additionally regulated by alternative splicing in the mammalian 5′-UTRs. Consistent with this hypothesis, we found that open reading frames starting from conserved uAUGs are significantly shorter than those starting from non-conserved uAUGs, possibly, owing to selection for optimization of the level of attenuation.
AB - By comparing sequences of human, mouse and rat orthologous genes, we show that in 5′-untranslated regions (5′-UTRs) of mammalian cDNAs but not in 3′-UTRs or coding sequences, AUG is conserved to a significantly greater extent than any of the other 63 nt triplets. This effect is likely to reflect, primarily, bona fide evolutionary conservation, rather than cDNA annotation artifacts, because the excess of conserved upstream AUGs (uAUGs) is seen in 5′-UTRs containing stop codons in-frame with the start AUG and many of the conserved AUGs are found in different frames, consistent with the location in authentic non-coding sequences. Altogether, conserved uAUGs are present in at least 20-30% of mammalian genes. Qualitatively similar results were obtained by comparison of orthologous genes from different species of the yeast genus Saccharomyces. Together with the observation that mammalian and yeast 5′-UTRs are significantly depleted in overall AUG content, these findings suggest that AUG triplets in 5′-UTRs are subject to the pressure of purifying selection in two opposite directions: the uAUGs that have no specific function tend to be deleterious and get eliminated during evolution, whereas those uAUGs that do serve a function are conserved. Most probably, the principal role of the conserved uAUGs is attenuation of translation at the initiation stage, which is often additionally regulated by alternative splicing in the mammalian 5′-UTRs. Consistent with this hypothesis, we found that open reading frames starting from conserved uAUGs are significantly shorter than those starting from non-conserved uAUGs, possibly, owing to selection for optimization of the level of attenuation.
UR - http://www.scopus.com/inward/record.url?scp=26944478233&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=26944478233&partnerID=8YFLogxK
U2 - 10.1093/nar/gki847
DO - 10.1093/nar/gki847
M3 - Article
C2 - 16186132
AN - SCOPUS:26944478233
SN - 0305-1048
VL - 33
SP - 5512
EP - 5520
JO - Nucleic acids research
JF - Nucleic acids research
IS - 17
ER -