20 Amino acids, their single-letter data-base codes (SLC), and their corresponding DNA codons |
Amino Acid |
SLC |
DNA codons |
Isoleucine |
I |
ATT, ATC, ATA |
Leucine |
L |
CTT, CTC, CTA, CTG, TTA, TTG |
Valine |
V |
GTT, GTC, GTA, GTG |
Phenylalanine |
F |
TTT, TTC |
Methionine | M | ATG |
Cysteine | C |
TGT, TGC |
Alanine |
A |
GCT, GCC, GCA, GCG |
Glycine |
G |
GGT, GGC, GGA, GGG |
Proline |
P |
CCT, CCC, CCA, CCG |
Threonine |
T |
ACT, ACC, ACA, ACG |
Serine | S | TCT, TCC, TCA, TCG, AGT, AGC |
Tyrosine |
Y | TAT, TAC |
Tryptophan | W | TGG |
Glutamine | Q | CAA, CAG |
Asparagine | N | AAT, AAC |
Histidine |
H |
CAT, CAC |
Glutamic acid |
E |
GAA, GAG |
Aspartic acid |
D |
GAT, GAC |
Lysine |
K |
AAA, AAG |
Arginine |
R |
CGT, CGC, CGA, CGG, AGA, AGG |
Stop codons | Stop | TAA, TAG, TGA |
In this table, the twenty amino acids found in proteins are listed, along with the single-letter code used to represent these amino acids in protein data bases. The DNA codons representing each amino acid are also listed. All 64 possible 3-letter combinations of the DNA coding units T, C, A and G are used either to encode one of these amino acids or as one of the three stop codons that signals the end of a sequence. While DNA can be decoded unambiguously, it is not possible to predict a DNA sequence from its protein sequence. Because most amino acids have multiple codons, a number of possible DNA sequences might represent the same protein sequence. |