Bio.Alphabet.IUPAC module¶

Standard nucleotide and protein alphabets defined by IUPAC.

class Bio.Alphabet.IUPAC.ExtendedIUPACProtein¶

Extended uppercase IUPAC protein single letter alphabet including X etc.

In addition to the standard 20 single letter protein codes, this includes:

This alphabet is not intended to be used with X for Selenocysteine (an ad-hoc standard prior to the IUPAC adoption of U instead).

class Bio.Alphabet.IUPAC.IUPACProtein¶

Bases: Bio.Alphabet.IUPAC.ExtendedIUPACProtein

IUPAC protein alphabet of the 20 standard amino acids.

Uppercase and single letter.

class Bio.Alphabet.IUPAC.IUPACAmbiguousDNA¶

Uppercase IUPAC ambiguous DNA.

class Bio.Alphabet.IUPAC.IUPACUnambiguousDNA¶

Uppercase IUPAC unambiguous DNA (letters GATC only).

class Bio.Alphabet.IUPAC.ExtendedIUPACDNA¶

Bases: Bio.Alphabet.DNAAlphabet

Extended IUPAC DNA alphabet.

In addition to the standard letter codes GATC, this includes:

class Bio.Alphabet.IUPAC.IUPACAmbiguousRNA¶

Uppercase IUPAC ambiguous RNA.

class Bio.Alphabet.IUPAC.IUPACUnambiguousRNA¶

Uppercase IUPAC unambiguous RNA (letters GAUC only).