Accurate peptide description requires standardized nomenclature. Miscommunication about sequences, modifications, or structures can lead to failed experiments, wasted resources, and irreproducible results. This guide covers the conventions used in peptide science for clear, unambiguous communication.
Amino Acid Nomenclature
Standard Amino Acids
**Three-Letter Codes:**
Amino AcidThree-LetterOne-Letter
|------------|--------------|------------|
AlanineAlaA
ArginineArgR
AsparagineAsnN
Aspartic AcidAspD
CysteineCysC
Glutamic AcidGluE
GlutamineGlnQ
GlycineGlyG
HistidineHisH
IsoleucineIleI
LeucineLeuL
LysineLysK
MethionineMetM
PhenylalaninePheF
ProlineProP
SerineSerS
ThreonineThrT
TryptophanTrpW
TyrosineTyrY
ValineValV
**Ambiguous Codes:**
CodeMeaning
|------|---------|
Asx (B)Asn or Asp
Glx (Z)Gln or Glu
Xaa (X)Any amino acid
Writing Peptide Sequences
**Convention:**
N-terminus on left, C-terminus on rightBy default, N-terminus is free amine, C-terminus is free acid**Three-Letter Format:**
Hyphenated: Ala-Gly-Phe-Leu-ArgNo terminal indication = free termini**One-Letter Format:**
No separators: AGFLRCompact for longer sequences**Indicating Chain Direction:**
H-Ala-Gly-Phe-OH (explicit free termini)NH2-AGFL-COOH (alternative)Terminal Modifications
N-Terminal Modifications
**Acetylation:**
Ac-Ala-Gly-Phe or Ac-AGFCH3CO- group**Formylation:**
For-Ala-Gly-Phe or fMet (special case)CHO- group**Pyroglutamic Acid:**
pGlu- or pECyclized glutamate**Biotin:**
Biotin-Ahx-Ala-Gly (with spacer)Bio-Ala-Gly**Fluorescent Labels:**
FITC-Ala-GlyFluorescein-Ahx-Ala-Gly5-FAM-Ala-Gly**Fatty Acids:**
Myr-Ala-Gly (myristoyl)Pal-Ala-Gly (palmitoyl)C-Terminal Modifications
**Amidation:**
Ala-Gly-Phe-NH2 or AGF-NH2Very common, mimics natural peptides**Specific Amides:**
Ala-Gly-Phe-NHMe (N-methylamide)Ala-Gly-Phe-NEt (ethylamide)**Esters:**
Ala-Gly-Phe-OMe (methyl ester)Ala-Gly-Phe-OEt (ethyl ester)**Aldehyde:**
Ala-Gly-Arg-H or Ala-Gly-Arg-CHOFor protease inhibitorsSide Chain Modifications
Phosphorylation
**Phosphoserine:**
pSer or S(PO3H2) or SepSer(P)**Phosphothreonine:**
pThr or T(PO3H2) or Thr(P)**Phosphotyrosine:**
pTyr or Y(PO3H2) or Tyr(P)**In Sequence:**
Ala-pSer-Gly or Ala-S(PO3H2)-GlyMethylation
**Lysine Methylation:**
Lys(Me) = monomethylLys(Me2) = dimethylLys(Me3) = trimethyl**Arginine Methylation:**
Arg(Me) = monomethylArg(Me2-a) = asymmetric dimethylArg(Me2-s) = symmetric dimethylAcetylation
**Lysine Acetylation:**
Lys(Ac) or K(Ac)Glycosylation
**General:**
Ser(GalNAc) or Thr(GalNAc)Asn(GlcNAc) for N-linkedOther Modifications
Cys(Acm) = acetamidomethyl protectedLys(Biotin) = biotinylated lysineTyr(NO2) = nitrotyrosineMet(O) = methionine sulfoxideNon-Standard Amino Acids
D-Amino Acids
**Convention:**
Lowercase: d-Ala or D-AlaIn sequence: D-Ala-Gly-Phe or dA-G-FBeta-Amino Acids
beta-Ala or bAlaDifferent backbone connectivityUnusual Amino Acids
AbbreviationFull Name
|--------------|-----------|
NleNorleucine
OrnOrnithine
CitCitrulline
HseHomoserine
SarSarcosine (N-methylglycine)
Aib2-aminoisobutyric acid
NvaNorvaline
Abu2-aminobutyric acid
Spacers and Linkers
Ahx = 6-aminohexanoic acidAva = 5-aminovaleric acidbetaAla = beta-alaninePEGn = polyethylene glycol (n units)Cyclic Peptides
Head-to-Tail Cyclization
cyclo(Ala-Gly-Phe-Leu-Arg)cyclo(-AGFLR-)c(AGFLR)Side Chain Cyclization
**Disulfide:**
Cys(1)-Ala-Gly-Cys(1) (bridge notation)Indicates which Cys residues are paired**Lactam:**
cyclo(Glu-Ala-Lys) [side chain to side chain]Specify Glu-Lys connectivity**Thioether:**
Indicate with appropriate notationMultiple Bridges
Use superscript numbers:
Cys1-Ala-Cys2-Gly-Cys1-Phe-Cys2Numbers indicate pairingDisulfide Bond Notation
Single Disulfide
Parenthetical: C(S-S)CSuperscript: Cys1-Ala-Cys1Multiple Disulfides
**Sequential Numbering:**
Cys1-Ala-Cys2-Gly-Cys2-Phe-Cys1Same number = bonded pair**Explicit Notation:**
[Cys3-Cys15] [Cys8-Cys22]List of bonded pairsPosition Numbering
Standard Convention
Number from N-terminusFirst residue = 1 (or 0 in some systems)Include position when discussing specific residuesExamples
Ser8 = serine at position 8K12R = lysine 12 replaced with argininepS15 = phosphoserine at position 15Fragment Notation
For partial sequences from larger proteins:
Peptide(1-20) = first 20 residuesProtein(45-60) = residues 45-60Mutation Nomenclature
Point Mutations
Original-Position-New: K12RPosition can include protein name: EGFR-K12RMultiple Mutations
Separate with commas: K12R, S15A, G20VOr slashes: K12R/S15A/G20VDeletions
del or delta symboldeltaK12 or K12delInsertions
ins: 12insA (alanine inserted after position 12)Mass and Formula Conventions
Molecular Weight Reporting
Monoisotopic mass: Based on most abundant isotopesAverage mass: Weighted average of isotopesSpecify which is being reportedChemical Formula
Include all atomsAccount for terminal groupsStandard format: C, H, N, O, S, othersBest Practices
For Ordering Peptides
Include:
Complete sequence with termini specifiedAll modifications explicitly statedDisulfide connectivity if applicableDesired purity and quantityFor Publications
Define non-standard abbreviationsSpecify sequence directionIndicate modifications unambiguouslyProvide complete sequence in methodsCommon Errors to Avoid
Assuming termini (always specify)Ambiguous modification placementInconsistent D-amino acid notationUnclear disulfide connectivityConclusion
Standardized peptide nomenclature ensures clear communication throughout the research community. Proper notation of sequences, modifications, and structures prevents costly errors and enables reproducibility. When in doubt, err on the side of explicit detail rather than assuming shared understanding.