Amino Acids & Peptides: Fundamentals

The essential building blocks of proteins and their properties

Contents

What Are Amino Acids?

Amino acids are the fundamental building blocks of proteins and peptides. Every amino acid shares a common core structure but differs in its side chain (R group), which determines its unique chemical and physical properties.

The General Structure

        NH₃⁺
        |
    R — Cα — COO⁻
        |
        H
                    

Four Groups Around the Central α-Carbon:

  • Amino group (–NH₃⁺): Always attached to the α-carbon (hence "α-amino acid")
  • Carboxyl group (–COO⁻): Provides the acidic character
  • Hydrogen atom (–H): The fourth substituent (except in glycine)
  • Side chain (R): Variable group that defines each amino acid's identity

Zwitterionic Nature

⚡ Amino Acids as Zwitterions

At physiological pH (~7.4), amino acids exist as zwitterions—molecules that carry both a positive charge (on the amino group) and a negative charge (on the carboxyl group) simultaneously:

At pH 7.4: R–CH(NH₃⁺)–COO⁻

This dipolar form is the predominant species in aqueous solution at neutral pH. The amino acid has zero net charge (charge balance) but is not uncharged—both the positive and negative charges are present and electrostatically active.

Why this matters: The zwitterionic character affects solubility, crystal packing, and reactivity. It also explains why amino acids have relatively high melting points (strong ionic interactions) compared to similar-sized uncharged molecules.

The 20 Standard Amino Acids

Twenty amino acids are encoded by the standard genetic code and commonly found in proteins. Each is abbreviated by both a three-letter code and a single-letter code:

G / Gly
Glycine
R = H
👆 Click for structure
Glycine structure
👆 Click to return
A / Ala
Alanine
R = CH₃
👆 Click for structure
Alanine structure
👆 Click to return
V / Val
Valine
R = CH(CH₃)₂
👆 Click for structure
Valine structure
👆 Click to return
L / Leu
Leucine
R = CH₂CH(CH₃)₂
👆 Click for structure
Leucine structure
👆 Click to return
I / Ile
Isoleucine
R = CH(CH₃)CH₂CH₃
👆 Click for structure
Isoleucine structure
👆 Click to return
M / Met
Methionine
R = CH₂CH₂SCH₃
👆 Click for structure
Methionine structure
👆 Click to return
F / Phe
Phenylalanine
R = CH₂-phenyl
👆 Click for structure
Phenylalanine structure
👆 Click to return
W / Trp
Tryptophan
R = CH₂-indole
👆 Click for structure
Tryptophan structure
👆 Click to return
P / Pro
Proline
Cyclic (imino acid)
👆 Click for structure
Proline structure
👆 Click to return
S / Ser
Serine
R = CH₂OH
👆 Click for structure
Serine structure
👆 Click to return
T / Thr
Threonine
R = CH(OH)CH₃
👆 Click for structure
Threonine structure
👆 Click to return
C / Cys
Cysteine
R = CH₂SH
👆 Click for structure
Cysteine structure
👆 Click to return
Y / Tyr
Tyrosine
R = CH₂-phenol
👆 Click for structure
Tyrosine structure
👆 Click to return
N / Asn
Asparagine
R = CH₂CONH₂
👆 Click for structure
Asparagine structure
👆 Click to return
Q / Gln
Glutamine
R = CH₂CH₂CONH₂
👆 Click for structure
Glutamine structure
👆 Click to return
D / Asp
Aspartic Acid
R = CH₂COO⁻
👆 Click for structure
Aspartic acid structure
👆 Click to return
E / Glu
Glutamic Acid
R = CH₂CH₂COO⁻
👆 Click for structure
Glutamic acid structure
👆 Click to return
K / Lys
Lysine
R = (CH₂)₄NH₃⁺
👆 Click for structure
Lysine structure
👆 Click to return
R / Arg
Arginine
R = (CH₂)₃NHC(NH₂)₂⁺
👆 Click for structure
Arginine structure
👆 Click to return
H / His
Histidine
R = CH₂-imidazole
👆 Click for structure
Histidine structure
👆 Click to return
💡 Memorization Tip

Common mnemonics for remembering amino acids:

  • Nonpolar: "GAVLIM FWP" (sounds like "gave 'em FWP")
  • Polar: "STCYNQ" (stick-en-cue)
  • Acidic: "DE" (Aspartate, Glutamate - think "D-E-acidic")
  • Basic: "KRH" (Lysine, aRginine, Histidine - think "cationic")

Amino Acid Classification

Amino acids can be classified in multiple ways based on their chemical and physical properties. The most common classification is by side chain polarity and charge.

By Side Chain Properties

🔵 Nonpolar / Hydrophobic (9 amino acids)

Members: Gly (G), Ala (A), Val (V), Leu (L), Ile (I), Met (M), Phe (F), Trp (W), Pro (P)

Characteristics:

  • Side chains contain mostly carbon and hydrogen
  • Cannot form hydrogen bonds with water (except backbone)
  • Tend to cluster together in protein cores (hydrophobic effect)
  • Critical for membrane protein structure and stability

Special cases:
Glycine (G): Smallest side chain (R = H), achiral, provides flexibility
Proline (P): Cyclic structure, restricts backbone flexibility, "helix breaker"
Aromatic (F, W, Y): Contain benzene-like rings, absorb UV light at 280 nm

🟢 Polar, Uncharged (6 amino acids)

Members: Ser (S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q)

Characteristics:

  • Can form hydrogen bonds with water and other molecules
  • Often found on protein surfaces
  • Participate in enzyme active sites and binding pockets
  • Neutral at physiological pH (no net charge)

Special cases:
Cysteine (C): Thiol group (–SH) can form disulfide bonds (S–S), pKa ~8.3
Tyrosine (Y): Phenolic –OH can be ionized at high pH, pKa ~10.1
Serine & Threonine: Hydroxyl groups, sites for phosphorylation (PTM)

🔴 Negatively Charged / Acidic (2 amino acids)

Members: Asp (D), Glu (E)

Characteristics:

  • Carboxyl groups in side chains (pKa ~3.9-4.1)
  • Negatively charged (–COO⁻) at physiological pH
  • Repel other negative charges, attract positive charges
  • Critical for metal ion binding (Ca²⁺, Mg²⁺, Zn²⁺)

Nomenclature note: "Aspartate" and "glutamate" refer to the ionized forms (–COO⁻), while "aspartic acid" and "glutamic acid" refer to the protonated forms (–COOH). At pH 7.4, they exist almost entirely as aspartate and glutamate.

🟣 Positively Charged / Basic (3 amino acids)

Members: Lys (K), Arg (R), His (H)

Characteristics:

  • Nitrogen-containing groups that can accept protons
  • Positively charged at physiological pH (except His, which is ~5% protonated)
  • Attract negative charges, critical for DNA/RNA binding
  • Essential for cell-penetrating peptides and antimicrobial peptides

Special cases:
Lysine (K): pKa ~10.5, almost always protonated (+) at pH 7.4
Arginine (R): pKa ~12.5, always protonated (+) at biological pH
Histidine (H): pKa ~6.0, can switch between protonated/deprotonated near pH 7.4

🧬 Functional Importance of Classification

Understanding amino acid categories helps predict:

  • Protein structure: Hydrophobic residues buried inside, polar residues on surface
  • Enzyme mechanisms: Acidic/basic residues in active sites for catalysis
  • Protein-protein interactions: Charge complementarity (opposites attract)
  • Membrane association: Hydrophobic and positively charged peptides cross membranes
  • Post-translational modifications: Specific residues are modification sites (Ser/Thr for phosphorylation, Lys for acetylation)

The Peptide Bond

Amino acids link together through peptide bonds to form chains. Understanding peptide bond formation, structure, and properties is essential for comprehending peptide behavior.

Formation: Condensation Reaction

The Bond-Forming Reaction

Peptide bond formation via condensation

Key points:

  • The carboxyl group of one amino acid reacts with the amino group of another
  • A water molecule is released (hence "condensation" or "dehydration" reaction)
  • The resulting C–N bond is the peptide bond (also called amide bond)
  • In cells, this reaction requires energy (ATP) and ribosome machinery
  • In chemical synthesis, requires activating agents and protecting groups

Breaking: Hydrolysis Reaction

The Bond-Breaking Reaction

Peptide bond breaking via hydrolysis

Key points:

  • Water is added across the peptide bond (hence "hydrolysis")
  • The peptide bond is cleaved, regenerating free amino acids
  • In cells, proteases (enzymes) catalyze this reaction
  • Spontaneous hydrolysis is very slow at neutral pH (half-life ~500 years!)
  • Can be accelerated by acid, base, or heat

Structure and Properties of the Peptide Bond

Resonance Structure
The peptide bond has partial double bond character due to resonance between C=O and C–N forms. The nitrogen lone pair delocalizes into the carbonyl, giving the C–N bond ~40% double bond character.

Consequence: Restricted rotation around the C–N bond
Planarity
The six atoms (Cα₁–C–O–N–H–Cα₂) lie in the same plane due to resonance. This rigid, flat geometry constrains protein structure.

Rotation is allowed: Around Cα–C bonds (ψ angle) and Cα–N bonds (φ angle)
Trans Configuration
The peptide bond is almost always in the trans configuration (substituents on opposite sides) rather than cis, because trans minimizes steric clashes.

Exception: Proline can adopt cis configuration (~5-10% of X-Pro bonds)
Polarity
The C=O and N–H groups are polar and can form hydrogen bonds. In proteins, backbone hydrogen bonding drives secondary structure formation (α-helices, β-sheets).

H-bond donors: N–H groups
H-bond acceptors: C=O groups
⚠️ The Peptide Bond is Stable

The peptide bond's partial double bond character makes it remarkably stable. This stability is essential for proteins to maintain their structure, but it also means:

  • Proteins are kinetically stable even when thermodynamically unstable (metastable)
  • Protein degradation requires specific proteases (enzymes) to break bonds efficiently
  • Synthetic peptides can be stored for long periods without significant degradation
  • Special conditions (strong acid, high temperature) are needed to hydrolyze peptides in the lab

From Amino Acids to Peptides

When amino acids link together through peptide bonds, they form peptide chains with distinct structural features and conventions.

Primary Structure: The Sequence

📝 Sequence Notation

Peptide sequences are written from N-terminus to C-terminus (left to right), using either three-letter or one-letter amino acid codes:

Three-letter code:
Gly-Ile-Gly-Ala-Val-Leu-Lys-Val-Leu
One-letter code:
GIGAVLKVL
With terminal labels:
H₂N-GIGAVLKVL-COOH
(showing free N-terminus and C-terminus)

Why N→C convention? This matches the direction of ribosomal synthesis (proteins are built from N-terminus to C-terminus) and is universally used in biochemistry.

N-Terminus and C-Terminus

N-Terminus

Structure: Free amino group (–NH₃⁺ at pH 7.4)
Also called: Amino terminus, N-terminal end
Charge: Usually +1 at physiological pH (pKa ~9.0)
Can be modified: Acetylation (Ac-) removes the charge

C-Terminus

Structure: Free carboxyl group (–COO⁻ at pH 7.4)
Also called: Carboxyl terminus, C-terminal end
Charge: Usually -1 at physiological pH (pKa ~3.1)
Can be modified: Amidation (–NH₂) removes the charge

Peptides vs Proteins

Feature Peptides Proteins
Length Typically 2-50 amino acids Typically >50 amino acids
Structure Often linear, may have some secondary structure Complex 3D structure with defined tertiary/quaternary structure
Synthesis Can be chemically synthesized (solid-phase peptide synthesis) Usually produced by recombinant expression in cells
Stability Generally stable, flexible Require proper folding for stability and function
Examples Oxytocin (9 aa), Insulin B-chain (30 aa), Melittin (26 aa) Lysozyme (129 aa), Hemoglobin (574 aa), Titin (38,138 aa)

Note: The boundary between "peptide" and "protein" is arbitrary and not strictly defined. Generally, chains <50 amino acids are called peptides, while longer chains are proteins. However, small proteins (like insulin at 51 residues) may still be called proteins despite their size.

🔬 Special Peptide Types

Beyond simple linear peptides, several special categories exist:

  • Dipeptides: Two amino acids (e.g., Ala-Gly)
  • Tripeptides: Three amino acids (e.g., glutathione: Glu-Cys-Gly)
  • Oligopeptides: 3-20 amino acids
  • Polypeptides: Many amino acids (>20), often synonymous with "protein"
  • Cyclic peptides: N- and C-termini are bonded (head-to-tail cyclization)
  • Stapled peptides: Chemically crosslinked within the sequence for stability

Peptide Properties Overview

Several key properties can be calculated or predicted from a peptide's amino acid sequence. These properties are essential for understanding peptide behavior and designing experiments.

⚖️

Molecular Weight

What it is: The sum of atomic masses of all atoms in the peptide.

How to calculate: Add the mass of each amino acid residue (amino acid minus H₂O) plus terminal groups (H for N-term, OH for C-term).

Why it matters: Essential for mass spectrometry, concentration calculations, and gel electrophoresis (SDS-PAGE).

Units: Daltons (Da) or g/mol

Net Charge

What it is: The algebraic sum of all positive and negative charges at a given pH.

How to calculate: Count charged groups (N-term, C-term, K, R, H, D, E) and apply Henderson-Hasselbalch equation for each based on pH and pKa.

Why it matters: Determines electrostatic interactions, membrane binding, and purification strategy (ion exchange chromatography).

Learn more about net charge →

🎯

Isoelectric Point (pI)

What it is: The pH at which the peptide has zero net charge (zwitterion).

How to calculate: Find the pH where positive charges = negative charges using iterative numerical methods (bisection, Newton-Raphson).

Why it matters: Minimum solubility at pI, maximum aggregation at pI, critical for isoelectric focusing (IEF) and formulation.

Learn more about isoelectric point →

💧

Hydrophobicity

What it is: A measure of how "water-fearing" vs "water-loving" the peptide is.

How to calculate: Sum hydrophobicity values for each amino acid using scales like Wimley-White (based on water→membrane transfer free energy).

Why it matters: Predicts membrane interactions, retention in reverse-phase HPLC, and tendency to aggregate. Essential for cell-penetrating peptides.

Units: ΔG (kcal/mol)

🔬

Extinction Coefficient (ε₂₈₀)

What it is: How strongly the peptide absorbs UV light at 280 nm wavelength.

How to calculate: Count aromatic residues: Trp (5,500) + Tyr (1,490) + Cys-Cys disulfides (125).

Why it matters: Allows accurate concentration measurement using UV-Vis spectroscopy via Beer-Lambert law (A = εcl).

Units: M⁻¹cm⁻¹

Learn more about UV-Vis spectroscopy →

🧪

Chemical Formula

What it is: The count of each element (C, H, N, O, S) in the peptide.

How to calculate: Sum the elemental composition of each amino acid residue plus terminal groups (H and OH).

Why it matters: Needed for accurate mass calculations, isotope labeling experiments, and elemental analysis.

Example: Gly-Ala = C₅H₁₀N₂O₃

🎨 Try PepDraw!

PepDraw automatically calculates all these properties from your peptide sequence. Simply enter your sequence in the app, and instantly see:

  • Molecular weight (exact mass)
  • Net charge at any pH (interactive slider)
  • Isoelectric point (pI)
  • Hydrophobicity (Wimley-White ΔG)
  • Extinction coefficient (ε₂₈₀)
  • Chemical formula (C, H, N, O, S counts)
  • Plus: Full chemical structure visualization!
Launch PepDraw App →

Stereochemistry in Detail

With the exception of glycine (which has R = H), all amino acids have a chiral center at the α-carbon, meaning they can exist as two non-superimposable mirror images (enantiomers).

L vs D Amino Acids

L-Amino Acids (Natural)

  • All 20 standard amino acids are L-amino acids
  • The amino group is on the LEFT in Fischer projection
  • Used by ribosomes to make proteins
  • Corresponds to (S) configuration in R/S nomenclature (except Cys)
             COO⁻
             |
        H₃N⁺—C—H
             |
             R
        
    (L-amino acid)
                        

How L-amino acids are rendered in PepDraw (up and down):

L-amino acid structure

D-Amino Acids (Synthetic/Rare Natural)

  • Mirror images of L-amino acids
  • The amino group is on the RIGHT in Fischer projection
  • Found in bacterial cell walls and some natural antibiotics
  • Used in synthetic peptides for protease resistance
            COO⁻
            |
          H—C—NH₃⁺
            |
            R
        
        (D-amino acid)
                        

How D-amino acids are rendered in PepDraw (up and down):

D-amino acid structure

Why Does Stereochemistry Matter?

Biological Recognition
Enzymes, receptors, and antibodies are stereospecific—they recognize only L-amino acids. D-amino acids are not substrates for most proteases, making D-peptides resistant to degradation.
Therapeutic Applications
D-amino acids are incorporated into therapeutic peptides to increase metabolic stability. Retro-inverso peptides (reversed sequence + all D-amino acids) can mimic L-peptides but resist proteolysis.
Chemical Properties
L- and D-amino acids have identical physical properties (melting point, solubility, pKa) because they have the same chemical composition. They differ only in optical activity (rotate plane-polarized light in opposite directions).
PepDraw Visualization
PepDraw renders stereochemistry using wedge-dash notation: solid wedges indicate bonds coming out of the plane, dashed wedges indicate bonds going into the plane. This shows the 3D arrangement around chiral centers.
⚠️ R/S vs L/D Nomenclature

There are two systems for naming stereochemistry:

  • L/D system: Based on Fischer projections, compares to glyceraldehyde. Biochemists use this for amino acids.
  • R/S system (Cahn-Ingold-Prelog): Based on priority rules for substituents. Chemists use this universally.

Important: Most L-amino acids are (S), but L-cysteine is (R) because the sulfur in the side chain has higher priority than the carboxyl group! Don't assume L = S.

Side Chain Properties

The 20 standard amino acids differ only in their side chains (R groups), yet these variations create enormous functional diversity. Understanding side chain properties is key to predicting peptide behavior.

Key Side Chain Features

Size and Shape

Side chains range from the smallest (Gly, R = H) to large bulky groups (Trp, with an indole ring).

  • Small: Gly, Ala, Ser (allow tight packing)
  • Medium: Val, Thr, Cys, Pro
  • Large: Phe, Tyr, Trp, Arg (create steric constraints)

Functional impact: Small residues (Gly) provide flexibility; large residues restrict conformations and drive specific folding patterns.

Polarity and Charge

This is the most common classification scheme (covered in detail above). Briefly:

  • Nonpolar: Cannot H-bond, cluster in hydrophobic cores
  • Polar uncharged: Can H-bond, often on protein surfaces
  • Charged (acidic/basic): Drive electrostatic interactions

Hydrogen Bonding Capability

Several side chains can donate or accept hydrogen bonds:

  • H-bond donors: Ser, Thr, Cys, Tyr (–OH groups), Asn, Gln, Trp (–NH groups)
  • H-bond acceptors: Asp, Glu (–COO⁻), Asn, Gln (C=O), Ser, Thr, Tyr (–O⁻ at high pH)
  • Both: His (imidazole can donate and accept)

Functional impact: H-bonding stabilizes secondary structure, mediates protein-protein interactions, and is critical in enzyme active sites.

Special Functional Groups

Some amino acids have unique reactive groups:

Thiol (Cys)

• Can form disulfide bonds (Cys-S-S-Cys)
• Nucleophilic at high pH (thiolate, –S⁻)
• Redox active (oxidation ↔ reduction)
• pKa ~8.3, partially deprotonated at pH 7.4

Phenol (Tyr)

• Weakly acidic (pKa ~10.1)
• Can be phosphorylated (PTM)
• Absorbs UV light (280 nm)
• Neutral at physiological pH

Imidazole (His)

• pKa ~6.0 (near physiological pH!)
• Acts as both acid and base
• pH sensor in proteins
• Essential in many enzyme active sites

Indole (Trp)

• Largest standard side chain
• Strongly absorbs UV (280 nm, ε = 5,500)
• Hydrophobic but can H-bond
• Fluorescent (intrinsic fluorescence)

Guanidinium (Arg)

• Always protonated (pKa ~12.5)
• Delocalized positive charge
• Forms strong salt bridges
• Critical for DNA/RNA binding

Thioether (Met)

• Sulfur but NOT ionizable
• Susceptible to oxidation
• Hydrophobic character
• Important in protein folding

Post-Translational Modifications (PTMs)

Certain amino acids serve as sites for chemical modifications that expand protein functionality:

Phosphorylation
Sites: Ser, Thr, Tyr
Effect: Adds negative charge, regulatory switch
Acetylation
Sites: Lys, N-terminus
Effect: Removes positive charge, affects DNA binding
Methylation
Sites: Lys, Arg
Effect: Modulates charge, regulates gene expression
Glycosylation
Sites: Asn (N-glyc), Ser/Thr (O-glyc)
Effect: Adds mass/bulk, affects solubility and stability
Ubiquitination
Sites: Lys
Effect: Targets proteins for degradation
Hydroxylation
Sites: Pro, Lys
Effect: Stabilizes collagen triple helix

Interactive Tools

Use these calculators to explore amino acid and peptide properties:

⚖️ Calculator 1: Molecular Weight

Calculate the exact molecular weight from a peptide sequence:

📊 Calculator 2: Amino Acid Composition

Analyze the amino acid composition of your peptide:

References

1. Foundational Textbook
Nelson, D. L., & Cox, M. M. (2017). Lehninger Principles of Biochemistry (7th ed.). W. H. Freeman. Chapters 3-4: Amino Acids, Peptides, and Proteins.
2. Amino Acid Properties
Creighton, T. E. (1993). Proteins: Structures and Molecular Properties (2nd ed.). W. H. Freeman. Chapter 1: Amino Acids and the Primary Structures of Proteins.
3. Stereochemistry
IUPAC-IUB Joint Commission on Biochemical Nomenclature (1983). Nomenclature and Symbolism for Amino Acids and Peptides. European Journal of Biochemistry, 138(1), 9-37. doi:10.1111/j.1432-1033.1984.tb07877.x
4. Peptide Bond Properties
Pauling, L., Corey, R. B., & Branson, H. R. (1951). The structure of proteins: two hydrogen-bonded helical configurations of the polypeptide chain. Proceedings of the National Academy of Sciences, 37(4), 205-211. doi:10.1073/pnas.37.4.205
5. Post-Translational Modifications
Walsh, C. T., Garneau-Tsodikova, S., & Gatto, G. J. (2005). Protein posttranslational modifications: the chemistry of proteome diversifications. Angewandte Chemie International Edition, 44(45), 7342-7372. doi:10.1002/anie.200501023

Continue Learning

Explore more topics in peptide chemistry and analysis: