Bio.Data.CodonTable module

Codon tables based on those from the NCBI.

These tables are based on parsing the NCBI file ftp://ftp.ncbi.nih.gov/entrez/misc/data/gc.prt using Scripts/update_ncbi_codon_table.py

Last updated at Version 4.4 (May 2019)

exception Bio.Data.CodonTable.TranslationError

Bases: Exception

Container for translation specific exceptions.

class Bio.Data.CodonTable.CodonTable(nucleotide_alphabet: str | None = None, protein_alphabet: str | None = None, forward_table: dict[str, str] = forward_table, back_table: dict[str, str] = back_table, start_codons: list[str] = start_codons, stop_codons: list[str] = stop_codons)

Bases: object

A codon-table, or genetic code.

__init__(nucleotide_alphabet: str | None = None, protein_alphabet: str | None = None, forward_table: dict[str, str] = forward_table, back_table: dict[str, str] = back_table, start_codons: list[str] = start_codons, stop_codons: list[str] = stop_codons) → None: Initialize the class.

forward_table: dict[str, str] = {}

back_table: dict[str, str] = {}

start_codons: list[str] = []

stop_codons: list[str] = []

__str__()

Return a simple text representation of the codon table.

e.g.:

>>> import Bio.Data.CodonTable
>>> print(Bio.Data.CodonTable.standard_dna_table)
Table 1 Standard, SGC0

  |  T      |  C      |  A      |  G      |
--+---------+---------+---------+---------+--
T | TTT F   | TCT S   | TAT Y   | TGT C   | T
T | TTC F   | TCC S   | TAC Y   | TGC C   | C
...
G | GTA V   | GCA A   | GAA E   | GGA G   | A
G | GTG V   | GCG A   | GAG E   | GGG G   | G
--+---------+---------+---------+---------+--
>>> print(Bio.Data.CodonTable.generic_by_id[1])
Table 1 Standard, SGC0

  |  U      |  C      |  A      |  G      |
--+---------+---------+---------+---------+--
U | UUU F   | UCU S   | UAU Y   | UGU C   | U
U | UUC F   | UCC S   | UAC Y   | UGC C   | C
...
G | GUA V   | GCA A   | GAA E   | GGA G   | A
G | GUG V   | GCG A   | GAG E   | GGG G   | G
--+---------+---------+---------+---------+--

__annotations__ = {'back_table': dict[str, str], 'forward_table': dict[str, str], 'start_codons': list[str], 'stop_codons': list[str]}

Bio.Data.CodonTable.make_back_table(table, default_stop_codon)

Back a back-table (naive single codon mapping).

ONLY RETURNS A SINGLE CODON, chosen from the possible alternatives based on their sort order.

class Bio.Data.CodonTable.NCBICodonTable(id, names, table, start_codons, stop_codons)

Bases: CodonTable

Codon table for generic nucleotide sequences.

nucleotide_alphabet: str | None = None

protein_alphabet = 'ACDEFGHIKLMNPQRSTVWY'

__init__(id, names, table, start_codons, stop_codons): Initialize the class.

__repr__(): Represent the NCBI codon table class as a string for debugging.

__annotations__ = {'back_table': 'dict[str, str]', 'forward_table': 'dict[str, str]', 'nucleotide_alphabet': str | None, 'start_codons': 'list[str]', 'stop_codons': 'list[str]'}

class Bio.Data.CodonTable.NCBICodonTableDNA(id, names, table, start_codons, stop_codons)

Bases: NCBICodonTable

Codon table for unambiguous DNA sequences.

nucleotide_alphabet: str | None = 'GATC'

__annotations__ = {'back_table': 'dict[str, str]', 'forward_table': 'dict[str, str]', 'nucleotide_alphabet': 'str | None', 'start_codons': 'list[str]', 'stop_codons': 'list[str]'}

class Bio.Data.CodonTable.NCBICodonTableRNA(id, names, table, start_codons, stop_codons)

Bases: NCBICodonTable

Codon table for unambiguous RNA sequences.

nucleotide_alphabet: str | None = 'GAUC'

__annotations__ = {'back_table': 'dict[str, str]', 'forward_table': 'dict[str, str]', 'nucleotide_alphabet': 'str | None', 'start_codons': 'list[str]', 'stop_codons': 'list[str]'}

class Bio.Data.CodonTable.AmbiguousCodonTable(codon_table, ambiguous_nucleotide_alphabet, ambiguous_nucleotide_values, ambiguous_protein_alphabet, ambiguous_protein_values)

Bases: CodonTable

Base codon table for ambiguous sequences.

__init__(codon_table, ambiguous_nucleotide_alphabet, ambiguous_nucleotide_values, ambiguous_protein_alphabet, ambiguous_protein_values): Initialize the class.

__getattr__(name): Forward attribute lookups to the original table.

__annotations__ = {}

Bio.Data.CodonTable.list_possible_proteins(codon, forward_table, ambiguous_nucleotide_values): Return all possible encoded amino acids for ambiguous codon.

Bio.Data.CodonTable.list_ambiguous_codons(codons, ambiguous_nucleotide_values)

Extend a codon list to include all possible ambiguous codons.

e.g.:

['TAG', 'TAA'] -> ['TAG', 'TAA', 'TAR']
['UAG', 'UGA'] -> ['UAG', 'UGA', 'URA']

Note that [‘TAG’, ‘TGA’] -> [‘TAG’, ‘TGA’], this does not add ‘TRR’ (which could also mean ‘TAA’ or ‘TGG’). Thus only two more codons are added in the following:

e.g.:

['TGA', 'TAA', 'TAG'] -> ['TGA', 'TAA', 'TAG', 'TRA', 'TAR']

Returns a new (longer) list of codon strings.

class Bio.Data.CodonTable.AmbiguousForwardTable(forward_table, ambiguous_nucleotide, ambiguous_protein)

Bases: object

Forward table for translation of ambiguous nucleotide sequences.

__init__(forward_table, ambiguous_nucleotide, ambiguous_protein): Initialize the class.

__contains__(codon)

Check if codon works as key for ambiguous forward_table.

Only returns ‘True’ if forward_table[codon] returns a value.

get(codon, failobj=None): Implement get for dictionary-like behaviour.

__getitem__(codon)

Implement dictionary-like behaviour for AmbiguousForwardTable.

forward_table[codon] will either return an amino acid letter, or throws a KeyError (if codon does not encode an amino acid) or a TranslationError (if codon does encode for an amino acid, but either is also a stop codon or does encode several amino acids, for which no unique letter is available in the given alphabet.

Bio.Data.CodonTable.register_ncbi_table(name, alt_name, id, table, start_codons, stop_codons)

Turn codon table data into objects (PRIVATE).

The data is stored in the dictionaries.