Quantifying position-dependent codon usage bias
Hockenberry AJ, Sirer MI, Amaral LAN, Jewett MCMolecular Biology and Evolution 31(7), 1880 - 1893 (2014)
Times cited: 2
Abstract
Although the mapping of codon to amino acid is conserved across nearly
all species, the frequency at which synonymous codons are used varies
both between organisms and between genes from the same organism. This
variation affects diverse cellular processes including protein
expression, regulation, and folding. Here, we mathematically model an
additional layer of complexity and show that individual codon usage
biases follow a position-dependent exponential decay model with unique
parameter fits for each codon. We use this methodology to perform an
in-depth analysis on codon usage bias in the model organism Escherichia
coli. Our methodology shows that lowly and highly expressed genes are
more similar in their codon usage patterns in the 5?gene regions, but
that these preferences diverge at distal sites resulting in greater
positional dependency (pD, which we mathematically define later) for
highly expressed genes. We show that position-dependent codon usage bias
is partially explained by the structural requirements of mRNAs that
results in increased usage of A/T rich codons shortly after the gene
start. However, we also show that the pD of 4 and 6-fold degenerate
codons is partially related to the gene copy number of cognate-tRNAs
supporting existing hypotheses that posit benefits to a region of slow
translation in the beginning of coding sequences. Lastly, we demonstrate
that viewing codon usage bias through a position-dependent framework has
practical utility by improving accuracy of gene expression prediction
when incorporating positional dependencies into the Codon Adaptation
Index model.