Skip to content

Are There 20 Naturally Occurring Amino Acids? The Surprising Truth

4 min read

There are over 300 different amino acids that occur naturally in nature, far more than the commonly cited 20 that are encoded by the standard genetic code. So, are there 20 naturally occurring amino acids? The answer lies in understanding the distinction between standard and non-standard amino acids and the roles they play in different organisms.

Quick Summary

The number of naturally occurring amino acids depends on how you define 'naturally occurring.' While 20 are standard for protein synthesis, additional genetically-encoded ones and hundreds of non-canonical varieties exist.

Key Points

  • Standard vs. Non-Standard: The perception of 20 amino acids is based on the canonical set used in protein synthesis, but hundreds of other non-standard varieties exist in nature.

  • The 21st Amino Acid: Selenocysteine is a genetically encoded amino acid that uses a modified 'stop' codon (UGA) and a special mRNA structure for its insertion into proteins.

  • The 22nd Amino Acid: Pyrrolysine is a genetically encoded amino acid found in certain methanogenic organisms, also incorporated via a recoded 'stop' codon (UAG).

  • Non-Canonical Functions: Many other amino acids not encoded by the genetic code are vital for specific metabolic pathways, signaling, or as structural components in various organisms.

  • Context is Key: Whether there are 20, 22, or hundreds of naturally occurring amino acids depends entirely on the biological context being discussed, from protein synthesis to broader metabolism.

  • Specialized Use Cases: The 21st and 22nd amino acids allow certain proteins, like enzymes with special redox functions, to perform tasks the standard 20 cannot.

In This Article

For many, the mention of amino acids conjures up the image of a fixed set of 20, the fundamental building blocks of all proteins. This is a common and understandable oversimplification taught in high school biology. However, the true picture is far more complex and fascinating, revealing the immense biochemical diversity of life. The answer to the question, "Are there 20 naturally occurring amino acids?" is both yes and no, depending on the specific context.

The 20 Canonical Amino Acids

The 20 canonical, or standard, amino acids are the ones directly encoded by the universal genetic code and used by the cellular machinery (ribosomes and tRNAs) to build proteins in all known forms of life. These are the proteinogenic amino acids, the basic letters in life's instruction manual. They all share a common basic structure: a central carbon atom (the alpha-carbon) bonded to an amino group, a carboxyl group, a hydrogen atom, and a unique side-chain (R-group) that determines its properties.

Classification of the Canonical 20

The chemical properties of the side-chain are what give each amino acid its distinct characteristics and determine how proteins fold and function. They are typically categorized by their polarity and charge.

  • Nonpolar, Aliphatic: Glycine, Alanine, Valine, Leucine, Isoleucine, Proline.
  • Aromatic: Phenylalanine, Tyrosine, Tryptophan.
  • Polar, Uncharged: Serine, Threonine, Cysteine, Methionine, Asparagine, Glutamine.
  • Positively Charged (Basic): Lysine, Arginine, Histidine.
  • Negatively Charged (Acidic): Aspartic acid, Glutamic acid.

The 21st and 22nd Amino Acids: Selenocysteine and Pyrrolysine

Beyond the standard 20, scientists have discovered two more amino acids that are also genetically encoded, though through a special translational process. These are selenocysteine (Sec) and pyrrolysine (Pyl), often referred to as the 21st and 22nd proteinogenic amino acids.

  • Selenocysteine (Sec): This amino acid is an analog of cysteine, with a selenium atom replacing the sulfur. It is found in proteins across all three domains of life (bacteria, archaea, and eukaryotes), known as selenoproteins. Its incorporation is directed by a UGA codon, which normally functions as a stop codon. A specialized mRNA hairpin structure, the SECIS element, helps the ribosome to recognize this UGA as a command to insert selenocysteine instead of terminating translation.
  • Pyrrolysine (Pyl): Discovered in certain methanogenic archaea and bacteria, pyrrolysine is a derivative of lysine. Like selenocysteine, it is incorporated during protein synthesis in response to a 'stop' codon (UAG, the amber codon). Its incorporation is dependent on a specific tRNA and synthetase encoded by genes within the organism's genome.

These discoveries show that the genetic code, while largely universal, has small but significant deviations that allow for expanded chemical diversity in proteins.

Beyond the 22: Non-Canonical Amino Acids

Even with the inclusion of selenocysteine and pyrrolysine, the total number of naturally occurring amino acids is far from complete. The term “non-canonical” or “non-proteinogenic” refers to amino acids that are found in nature but are not typically used as building blocks for proteins via the genetic code. These can be natural intermediates in metabolic pathways, modified forms of canonical amino acids, or constituents of other biologically active molecules, such as peptide antibiotics.

Examples include:

  • Hydroxyproline and Hydroxylysine: Found in collagen and elastin, these are modified canonical amino acids.
  • Ornithine and Citrulline: These are intermediates in the urea cycle and are not incorporated into proteins.
  • Beta-alanine: A precursor to the coenzyme A and carnosine, beta-alanine has a different structure than the alpha-amino acids of proteins.

This vast chemical library underscores the inventive ways nature has adapted amino acid chemistry for diverse biological functions beyond protein construction.

Comparing the Different Classes of Amino Acids

To better understand the distinctions, here is a comparison table outlining the key features of these different amino acid categories.

Feature Canonical 20 Selenocysteine & Pyrrolysine Non-Canonical (Beyond 22)
Genetic Encoding Yes, via standard codons (61) Yes, via specialized recoding of stop codons No, not encoded by the genetic code
Incorporation Standard ribosomal translation Special translational recoding mechanisms Not incorporated into proteins by ribosomes
Occurrence Universal across all known life Found in specific organisms (Archaea, Bacteria, Eukarya) Widely distributed across organisms
Number 20 2 (Sec and Pyl) Hundreds of known varieties
Function Primary building blocks of proteins Active site components of specialized enzymes (e.g., redox) Metabolic intermediates, structural components, signaling molecules

Conclusion: More Than Just the Basic 20

The statement that there are only 20 naturally occurring amino acids is a simplification that overlooks the greater complexity of biological chemistry. While the 20 canonical amino acids are universally recognized as the primary building blocks of proteins, two additional genetically encoded amino acids—selenocysteine and pyrrolysine—exist and are crucial for the functions of certain proteins. Furthermore, the natural world is filled with hundreds of other non-canonical amino acids that perform a wide array of biochemical roles outside of standard protein synthesis. For most practical purposes, particularly in human nutrition, focusing on the 20 standard essential and non-essential amino acids is sufficient. However, a deeper understanding of biochemistry reveals that nature's amino acid toolbox is much larger and more diverse than the standard list suggests. For further reading on this subject, exploring the specifics of essential versus non-essential amino acids provides more context on their dietary importance.

NCBI: Biochemistry, Essential Amino Acids

Frequently Asked Questions

Canonical amino acids are the 20 standard types that are directly encoded by the universal genetic code and are used by ribosomes to build proteins. Non-canonical amino acids are those found in nature but are not part of this standard set, often serving other metabolic or structural roles.

Selenocysteine and pyrrolysine are special because they are genetically encoded, meaning their incorporation into proteins is directed by the genetic code. However, they use a unique translational mechanism that recodes a 'stop' codon to incorporate them, making them exceptions to the standard 20.

No, their use is not universal. Selenocysteine is found across all three domains of life (bacteria, archaea, eukaryotes), but pyrrolysine is far more restricted, primarily found in certain methanogenic archaea and bacteria.

The total number is not fixed. Researchers have identified hundreds of non-canonical amino acids in nature, with new varieties potentially still being discovered, serving diverse biochemical functions.

No, the genetic code still consists of 64 codons. The unique nature of selenocysteine and pyrrolysine relies on a specialized translational recoding system that re-interprets a standard stop codon (UGA for Sec, UAG for Pyl) to insert an amino acid instead of ending translation.

Common examples include hydroxyproline and hydroxylysine (found in collagen), ornithine and citrulline (intermediates in the urea cycle), and gamma-aminobutyric acid (GABA), a neurotransmitter derived from glutamate.

Humans have selenoproteins and thus require selenium, but the mechanism for selenocysteine synthesis is internal. We do not use pyrrolysine and therefore do not require it in our diet. Our nutritional needs are met by the 20 standard amino acids, along with proper intake of essential minerals like selenium.

Medical Disclaimer

This content is for informational purposes only and should not replace professional medical advice.