Bio.Blast package
Submodules
- Bio.Blast.Applications module
NcbiblastpCommandlineNcbiblastpCommandline.__init__()NcbiblastpCommandline.best_hit_overhangNcbiblastpCommandline.best_hit_score_edgeNcbiblastpCommandline.comp_based_statsNcbiblastpCommandline.culling_limitNcbiblastpCommandline.dbNcbiblastpCommandline.db_hard_maskNcbiblastpCommandline.db_soft_maskNcbiblastpCommandline.dbsizeNcbiblastpCommandline.entrez_queryNcbiblastpCommandline.evalueNcbiblastpCommandline.export_search_strategyNcbiblastpCommandline.gapextendNcbiblastpCommandline.gapopenNcbiblastpCommandline.gilistNcbiblastpCommandline.hNcbiblastpCommandline.helpNcbiblastpCommandline.htmlNcbiblastpCommandline.import_search_strategyNcbiblastpCommandline.lcase_maskingNcbiblastpCommandline.line_lengthNcbiblastpCommandline.matrixNcbiblastpCommandline.max_hspsNcbiblastpCommandline.max_hsps_per_subjectNcbiblastpCommandline.max_target_seqsNcbiblastpCommandline.negative_gilistNcbiblastpCommandline.negative_seqidlistNcbiblastpCommandline.num_alignmentsNcbiblastpCommandline.num_descriptionsNcbiblastpCommandline.num_threadsNcbiblastpCommandline.outNcbiblastpCommandline.outfmtNcbiblastpCommandline.parse_deflinesNcbiblastpCommandline.qcov_hsp_percNcbiblastpCommandline.queryNcbiblastpCommandline.query_locNcbiblastpCommandline.remoteNcbiblastpCommandline.searchspNcbiblastpCommandline.segNcbiblastpCommandline.seqidlistNcbiblastpCommandline.show_gisNcbiblastpCommandline.soft_maskingNcbiblastpCommandline.subjectNcbiblastpCommandline.subject_locNcbiblastpCommandline.sum_statisticsNcbiblastpCommandline.sum_statsNcbiblastpCommandline.taskNcbiblastpCommandline.thresholdNcbiblastpCommandline.ungappedNcbiblastpCommandline.use_sw_tbackNcbiblastpCommandline.versionNcbiblastpCommandline.window_sizeNcbiblastpCommandline.word_sizeNcbiblastpCommandline.xdrop_gapNcbiblastpCommandline.xdrop_gap_finalNcbiblastpCommandline.xdrop_ungap
NcbiblastnCommandlineNcbiblastnCommandline.__init__()NcbiblastnCommandline.best_hit_overhangNcbiblastnCommandline.best_hit_score_edgeNcbiblastnCommandline.culling_limitNcbiblastnCommandline.dbNcbiblastnCommandline.db_hard_maskNcbiblastnCommandline.db_soft_maskNcbiblastnCommandline.dbsizeNcbiblastnCommandline.dustNcbiblastnCommandline.entrez_queryNcbiblastnCommandline.evalueNcbiblastnCommandline.export_search_strategyNcbiblastnCommandline.filtering_dbNcbiblastnCommandline.gapextendNcbiblastnCommandline.gapopenNcbiblastnCommandline.gilistNcbiblastnCommandline.hNcbiblastnCommandline.helpNcbiblastnCommandline.htmlNcbiblastnCommandline.import_search_strategyNcbiblastnCommandline.index_nameNcbiblastnCommandline.lcase_maskingNcbiblastnCommandline.line_lengthNcbiblastnCommandline.max_hspsNcbiblastnCommandline.max_hsps_per_subjectNcbiblastnCommandline.max_target_seqsNcbiblastnCommandline.min_raw_gapped_scoreNcbiblastnCommandline.negative_gilistNcbiblastnCommandline.negative_seqidlistNcbiblastnCommandline.no_greedyNcbiblastnCommandline.num_alignmentsNcbiblastnCommandline.num_descriptionsNcbiblastnCommandline.num_threadsNcbiblastnCommandline.off_diagonal_rangeNcbiblastnCommandline.outNcbiblastnCommandline.outfmtNcbiblastnCommandline.parse_deflinesNcbiblastnCommandline.penaltyNcbiblastnCommandline.perc_identityNcbiblastnCommandline.qcov_hsp_percNcbiblastnCommandline.queryNcbiblastnCommandline.query_locNcbiblastnCommandline.remoteNcbiblastnCommandline.rewardNcbiblastnCommandline.searchspNcbiblastnCommandline.seqidlistNcbiblastnCommandline.show_gisNcbiblastnCommandline.soft_maskingNcbiblastnCommandline.strandNcbiblastnCommandline.subjectNcbiblastnCommandline.subject_locNcbiblastnCommandline.sum_statisticsNcbiblastnCommandline.sum_statsNcbiblastnCommandline.taskNcbiblastnCommandline.template_lengthNcbiblastnCommandline.template_typeNcbiblastnCommandline.ungappedNcbiblastnCommandline.use_indexNcbiblastnCommandline.versionNcbiblastnCommandline.window_masker_dbNcbiblastnCommandline.window_masker_taxidNcbiblastnCommandline.window_sizeNcbiblastnCommandline.word_sizeNcbiblastnCommandline.xdrop_gapNcbiblastnCommandline.xdrop_gap_finalNcbiblastnCommandline.xdrop_ungap
NcbiblastxCommandlineNcbiblastxCommandline.__init__()NcbiblastxCommandline.best_hit_overhangNcbiblastxCommandline.best_hit_score_edgeNcbiblastxCommandline.comp_based_statsNcbiblastxCommandline.culling_limitNcbiblastxCommandline.dbNcbiblastxCommandline.db_hard_maskNcbiblastxCommandline.db_soft_maskNcbiblastxCommandline.dbsizeNcbiblastxCommandline.entrez_queryNcbiblastxCommandline.evalueNcbiblastxCommandline.export_search_strategyNcbiblastxCommandline.frame_shift_penaltyNcbiblastxCommandline.gapextendNcbiblastxCommandline.gapopenNcbiblastxCommandline.gilistNcbiblastxCommandline.hNcbiblastxCommandline.helpNcbiblastxCommandline.htmlNcbiblastxCommandline.import_search_strategyNcbiblastxCommandline.lcase_maskingNcbiblastxCommandline.line_lengthNcbiblastxCommandline.matrixNcbiblastxCommandline.max_hspsNcbiblastxCommandline.max_hsps_per_subjectNcbiblastxCommandline.max_intron_lengthNcbiblastxCommandline.max_target_seqsNcbiblastxCommandline.negative_gilistNcbiblastxCommandline.negative_seqidlistNcbiblastxCommandline.num_alignmentsNcbiblastxCommandline.num_descriptionsNcbiblastxCommandline.num_threadsNcbiblastxCommandline.outNcbiblastxCommandline.outfmtNcbiblastxCommandline.parse_deflinesNcbiblastxCommandline.qcov_hsp_percNcbiblastxCommandline.queryNcbiblastxCommandline.query_gencodeNcbiblastxCommandline.query_locNcbiblastxCommandline.remoteNcbiblastxCommandline.searchspNcbiblastxCommandline.segNcbiblastxCommandline.seqidlistNcbiblastxCommandline.show_gisNcbiblastxCommandline.soft_maskingNcbiblastxCommandline.strandNcbiblastxCommandline.subjectNcbiblastxCommandline.subject_locNcbiblastxCommandline.sum_statisticsNcbiblastxCommandline.sum_statsNcbiblastxCommandline.taskNcbiblastxCommandline.thresholdNcbiblastxCommandline.ungappedNcbiblastxCommandline.use_sw_tbackNcbiblastxCommandline.versionNcbiblastxCommandline.window_sizeNcbiblastxCommandline.word_sizeNcbiblastxCommandline.xdrop_gapNcbiblastxCommandline.xdrop_gap_finalNcbiblastxCommandline.xdrop_ungap
NcbitblastnCommandlineNcbitblastnCommandline.__init__()NcbitblastnCommandline.best_hit_overhangNcbitblastnCommandline.best_hit_score_edgeNcbitblastnCommandline.comp_based_statsNcbitblastnCommandline.culling_limitNcbitblastnCommandline.dbNcbitblastnCommandline.db_gencodeNcbitblastnCommandline.db_hard_maskNcbitblastnCommandline.db_soft_maskNcbitblastnCommandline.dbsizeNcbitblastnCommandline.entrez_queryNcbitblastnCommandline.evalueNcbitblastnCommandline.export_search_strategyNcbitblastnCommandline.frame_shift_penaltyNcbitblastnCommandline.gapextendNcbitblastnCommandline.gapopenNcbitblastnCommandline.gilistNcbitblastnCommandline.hNcbitblastnCommandline.helpNcbitblastnCommandline.htmlNcbitblastnCommandline.import_search_strategyNcbitblastnCommandline.in_pssmNcbitblastnCommandline.lcase_maskingNcbitblastnCommandline.line_lengthNcbitblastnCommandline.matrixNcbitblastnCommandline.max_hspsNcbitblastnCommandline.max_hsps_per_subjectNcbitblastnCommandline.max_intron_lengthNcbitblastnCommandline.max_target_seqsNcbitblastnCommandline.negative_gilistNcbitblastnCommandline.negative_seqidlistNcbitblastnCommandline.num_alignmentsNcbitblastnCommandline.num_descriptionsNcbitblastnCommandline.num_threadsNcbitblastnCommandline.outNcbitblastnCommandline.outfmtNcbitblastnCommandline.parse_deflinesNcbitblastnCommandline.qcov_hsp_percNcbitblastnCommandline.queryNcbitblastnCommandline.query_locNcbitblastnCommandline.remoteNcbitblastnCommandline.searchspNcbitblastnCommandline.segNcbitblastnCommandline.seqidlistNcbitblastnCommandline.show_gisNcbitblastnCommandline.soft_maskingNcbitblastnCommandline.subjectNcbitblastnCommandline.subject_locNcbitblastnCommandline.sum_statisticsNcbitblastnCommandline.sum_statsNcbitblastnCommandline.taskNcbitblastnCommandline.thresholdNcbitblastnCommandline.ungappedNcbitblastnCommandline.use_sw_tbackNcbitblastnCommandline.versionNcbitblastnCommandline.window_sizeNcbitblastnCommandline.word_sizeNcbitblastnCommandline.xdrop_gapNcbitblastnCommandline.xdrop_gap_finalNcbitblastnCommandline.xdrop_ungap
NcbitblastxCommandlineNcbitblastxCommandline.__init__()NcbitblastxCommandline.best_hit_overhangNcbitblastxCommandline.best_hit_score_edgeNcbitblastxCommandline.culling_limitNcbitblastxCommandline.dbNcbitblastxCommandline.db_gencodeNcbitblastxCommandline.db_hard_maskNcbitblastxCommandline.db_soft_maskNcbitblastxCommandline.dbsizeNcbitblastxCommandline.entrez_queryNcbitblastxCommandline.evalueNcbitblastxCommandline.export_search_strategyNcbitblastxCommandline.gapextendNcbitblastxCommandline.gapopenNcbitblastxCommandline.gilistNcbitblastxCommandline.hNcbitblastxCommandline.helpNcbitblastxCommandline.htmlNcbitblastxCommandline.import_search_strategyNcbitblastxCommandline.lcase_maskingNcbitblastxCommandline.line_lengthNcbitblastxCommandline.matrixNcbitblastxCommandline.max_hspsNcbitblastxCommandline.max_hsps_per_subjectNcbitblastxCommandline.max_intron_lengthNcbitblastxCommandline.max_target_seqsNcbitblastxCommandline.negative_gilistNcbitblastxCommandline.negative_seqidlistNcbitblastxCommandline.num_alignmentsNcbitblastxCommandline.num_descriptionsNcbitblastxCommandline.num_threadsNcbitblastxCommandline.outNcbitblastxCommandline.outfmtNcbitblastxCommandline.parse_deflinesNcbitblastxCommandline.qcov_hsp_percNcbitblastxCommandline.queryNcbitblastxCommandline.query_gencodeNcbitblastxCommandline.query_locNcbitblastxCommandline.remoteNcbitblastxCommandline.searchspNcbitblastxCommandline.segNcbitblastxCommandline.seqidlistNcbitblastxCommandline.show_gisNcbitblastxCommandline.soft_maskingNcbitblastxCommandline.strandNcbitblastxCommandline.subjectNcbitblastxCommandline.subject_locNcbitblastxCommandline.sum_statisticsNcbitblastxCommandline.sum_statsNcbitblastxCommandline.thresholdNcbitblastxCommandline.versionNcbitblastxCommandline.window_sizeNcbitblastxCommandline.word_sizeNcbitblastxCommandline.xdrop_gapNcbitblastxCommandline.xdrop_gap_finalNcbitblastxCommandline.xdrop_ungap
NcbipsiblastCommandlineNcbipsiblastCommandline.__init__()NcbipsiblastCommandline.best_hit_overhangNcbipsiblastCommandline.best_hit_score_edgeNcbipsiblastCommandline.comp_based_statsNcbipsiblastCommandline.culling_limitNcbipsiblastCommandline.dbNcbipsiblastCommandline.dbsizeNcbipsiblastCommandline.entrez_queryNcbipsiblastCommandline.evalueNcbipsiblastCommandline.export_search_strategyNcbipsiblastCommandline.gap_triggerNcbipsiblastCommandline.gapextendNcbipsiblastCommandline.gapopenNcbipsiblastCommandline.gilistNcbipsiblastCommandline.hNcbipsiblastCommandline.helpNcbipsiblastCommandline.htmlNcbipsiblastCommandline.ignore_msa_masterNcbipsiblastCommandline.import_search_strategyNcbipsiblastCommandline.in_msaNcbipsiblastCommandline.in_pssmNcbipsiblastCommandline.inclusion_ethreshNcbipsiblastCommandline.lcase_maskingNcbipsiblastCommandline.line_lengthNcbipsiblastCommandline.matrixNcbipsiblastCommandline.max_hspsNcbipsiblastCommandline.max_hsps_per_subjectNcbipsiblastCommandline.max_target_seqsNcbipsiblastCommandline.msa_master_idxNcbipsiblastCommandline.negative_gilistNcbipsiblastCommandline.negative_seqidlistNcbipsiblastCommandline.num_alignmentsNcbipsiblastCommandline.num_descriptionsNcbipsiblastCommandline.num_iterationsNcbipsiblastCommandline.num_threadsNcbipsiblastCommandline.outNcbipsiblastCommandline.out_ascii_pssmNcbipsiblastCommandline.out_pssmNcbipsiblastCommandline.outfmtNcbipsiblastCommandline.parse_deflinesNcbipsiblastCommandline.phi_patternNcbipsiblastCommandline.pseudocountNcbipsiblastCommandline.qcov_hsp_percNcbipsiblastCommandline.queryNcbipsiblastCommandline.query_locNcbipsiblastCommandline.remoteNcbipsiblastCommandline.save_each_pssmNcbipsiblastCommandline.save_pssm_after_last_roundNcbipsiblastCommandline.searchspNcbipsiblastCommandline.segNcbipsiblastCommandline.seqidlistNcbipsiblastCommandline.show_gisNcbipsiblastCommandline.soft_maskingNcbipsiblastCommandline.subjectNcbipsiblastCommandline.subject_locNcbipsiblastCommandline.sum_statisticsNcbipsiblastCommandline.sum_statsNcbipsiblastCommandline.thresholdNcbipsiblastCommandline.use_sw_tbackNcbipsiblastCommandline.versionNcbipsiblastCommandline.window_sizeNcbipsiblastCommandline.word_sizeNcbipsiblastCommandline.xdrop_gapNcbipsiblastCommandline.xdrop_gap_finalNcbipsiblastCommandline.xdrop_ungap
NcbirpsblastCommandlineNcbirpsblastCommandline.__init__()NcbirpsblastCommandline.best_hit_overhangNcbirpsblastCommandline.best_hit_score_edgeNcbirpsblastCommandline.comp_based_statsNcbirpsblastCommandline.culling_limitNcbirpsblastCommandline.dbNcbirpsblastCommandline.dbsizeNcbirpsblastCommandline.entrez_queryNcbirpsblastCommandline.evalueNcbirpsblastCommandline.export_search_strategyNcbirpsblastCommandline.gilistNcbirpsblastCommandline.hNcbirpsblastCommandline.helpNcbirpsblastCommandline.htmlNcbirpsblastCommandline.import_search_strategyNcbirpsblastCommandline.lcase_maskingNcbirpsblastCommandline.line_lengthNcbirpsblastCommandline.max_hspsNcbirpsblastCommandline.max_hsps_per_subjectNcbirpsblastCommandline.max_target_seqsNcbirpsblastCommandline.negative_gilistNcbirpsblastCommandline.negative_seqidlistNcbirpsblastCommandline.num_alignmentsNcbirpsblastCommandline.num_descriptionsNcbirpsblastCommandline.num_threadsNcbirpsblastCommandline.outNcbirpsblastCommandline.outfmtNcbirpsblastCommandline.parse_deflinesNcbirpsblastCommandline.qcov_hsp_percNcbirpsblastCommandline.queryNcbirpsblastCommandline.query_locNcbirpsblastCommandline.remoteNcbirpsblastCommandline.searchspNcbirpsblastCommandline.segNcbirpsblastCommandline.seqidlistNcbirpsblastCommandline.show_gisNcbirpsblastCommandline.soft_maskingNcbirpsblastCommandline.sum_statisticsNcbirpsblastCommandline.sum_statsNcbirpsblastCommandline.use_sw_tbackNcbirpsblastCommandline.versionNcbirpsblastCommandline.window_sizeNcbirpsblastCommandline.word_sizeNcbirpsblastCommandline.xdrop_gapNcbirpsblastCommandline.xdrop_gap_finalNcbirpsblastCommandline.xdrop_ungap
NcbirpstblastnCommandlineNcbirpstblastnCommandline.__init__()NcbirpstblastnCommandline.comp_based_statsNcbirpstblastnCommandline.dbNcbirpstblastnCommandline.dbsizeNcbirpstblastnCommandline.entrez_queryNcbirpstblastnCommandline.evalueNcbirpstblastnCommandline.export_search_strategyNcbirpstblastnCommandline.gilistNcbirpstblastnCommandline.hNcbirpstblastnCommandline.helpNcbirpstblastnCommandline.htmlNcbirpstblastnCommandline.import_search_strategyNcbirpstblastnCommandline.lcase_maskingNcbirpstblastnCommandline.line_lengthNcbirpstblastnCommandline.max_hspsNcbirpstblastnCommandline.max_hsps_per_subjectNcbirpstblastnCommandline.max_target_seqsNcbirpstblastnCommandline.negative_gilistNcbirpstblastnCommandline.negative_seqidlistNcbirpstblastnCommandline.num_alignmentsNcbirpstblastnCommandline.num_descriptionsNcbirpstblastnCommandline.num_threadsNcbirpstblastnCommandline.outNcbirpstblastnCommandline.outfmtNcbirpstblastnCommandline.parse_deflinesNcbirpstblastnCommandline.qcov_hsp_percNcbirpstblastnCommandline.queryNcbirpstblastnCommandline.query_gencodeNcbirpstblastnCommandline.query_locNcbirpstblastnCommandline.remoteNcbirpstblastnCommandline.searchspNcbirpstblastnCommandline.segNcbirpstblastnCommandline.seqidlistNcbirpstblastnCommandline.show_gisNcbirpstblastnCommandline.soft_maskingNcbirpstblastnCommandline.strandNcbirpstblastnCommandline.sum_statisticsNcbirpstblastnCommandline.sum_statsNcbirpstblastnCommandline.ungappedNcbirpstblastnCommandline.use_sw_tbackNcbirpstblastnCommandline.versionNcbirpstblastnCommandline.window_sizeNcbirpstblastnCommandline.word_sizeNcbirpstblastnCommandline.xdrop_gapNcbirpstblastnCommandline.xdrop_gap_finalNcbirpstblastnCommandline.xdrop_ungap
NcbiblastformatterCommandlineNcbiblastformatterCommandline.__init__()NcbiblastformatterCommandline.archiveNcbiblastformatterCommandline.hNcbiblastformatterCommandline.helpNcbiblastformatterCommandline.htmlNcbiblastformatterCommandline.line_lengthNcbiblastformatterCommandline.max_target_seqsNcbiblastformatterCommandline.num_alignmentsNcbiblastformatterCommandline.num_descriptionsNcbiblastformatterCommandline.outNcbiblastformatterCommandline.outfmtNcbiblastformatterCommandline.parse_deflinesNcbiblastformatterCommandline.ridNcbiblastformatterCommandline.show_gisNcbiblastformatterCommandline.version
NcbideltablastCommandlineNcbideltablastCommandline.__init__()NcbideltablastCommandline.best_hit_overhangNcbideltablastCommandline.best_hit_score_edgeNcbideltablastCommandline.comp_based_statsNcbideltablastCommandline.culling_limitNcbideltablastCommandline.dbNcbideltablastCommandline.dbsizeNcbideltablastCommandline.domain_inclusion_ethreshNcbideltablastCommandline.entrez_queryNcbideltablastCommandline.evalueNcbideltablastCommandline.export_search_strategyNcbideltablastCommandline.gap_triggerNcbideltablastCommandline.gapextendNcbideltablastCommandline.gapopenNcbideltablastCommandline.gilistNcbideltablastCommandline.hNcbideltablastCommandline.helpNcbideltablastCommandline.htmlNcbideltablastCommandline.import_search_strategyNcbideltablastCommandline.inclusion_ethreshNcbideltablastCommandline.lcase_maskingNcbideltablastCommandline.line_lengthNcbideltablastCommandline.matrixNcbideltablastCommandline.max_hspsNcbideltablastCommandline.max_hsps_per_subjectNcbideltablastCommandline.max_target_seqsNcbideltablastCommandline.negative_gilistNcbideltablastCommandline.negative_seqidlistNcbideltablastCommandline.num_alignmentsNcbideltablastCommandline.num_descriptionsNcbideltablastCommandline.num_iterationsNcbideltablastCommandline.num_threadsNcbideltablastCommandline.outNcbideltablastCommandline.out_ascii_pssmNcbideltablastCommandline.out_pssmNcbideltablastCommandline.outfmtNcbideltablastCommandline.parse_deflinesNcbideltablastCommandline.pseudocountNcbideltablastCommandline.qcov_hsp_percNcbideltablastCommandline.queryNcbideltablastCommandline.query_locNcbideltablastCommandline.remoteNcbideltablastCommandline.rpsdbNcbideltablastCommandline.save_each_pssmNcbideltablastCommandline.save_pssm_after_last_roundNcbideltablastCommandline.searchspNcbideltablastCommandline.segNcbideltablastCommandline.seqidlistNcbideltablastCommandline.show_domain_hitsNcbideltablastCommandline.show_gisNcbideltablastCommandline.soft_maskingNcbideltablastCommandline.subjectNcbideltablastCommandline.subject_locNcbideltablastCommandline.sum_statisticsNcbideltablastCommandline.sum_statsNcbideltablastCommandline.thresholdNcbideltablastCommandline.use_sw_tbackNcbideltablastCommandline.versionNcbideltablastCommandline.window_sizeNcbideltablastCommandline.word_sizeNcbideltablastCommandline.xdrop_gapNcbideltablastCommandline.xdrop_gap_finalNcbideltablastCommandline.xdrop_ungap
NcbimakeblastdbCommandlineNcbimakeblastdbCommandline.__init__()NcbimakeblastdbCommandline.blastdb_versionNcbimakeblastdbCommandline.dbtypeNcbimakeblastdbCommandline.gi_maskNcbimakeblastdbCommandline.gi_mask_nameNcbimakeblastdbCommandline.hNcbimakeblastdbCommandline.hash_indexNcbimakeblastdbCommandline.helpNcbimakeblastdbCommandline.input_fileNcbimakeblastdbCommandline.input_typeNcbimakeblastdbCommandline.logfileNcbimakeblastdbCommandline.mask_dataNcbimakeblastdbCommandline.mask_descNcbimakeblastdbCommandline.mask_idNcbimakeblastdbCommandline.max_file_szNcbimakeblastdbCommandline.outNcbimakeblastdbCommandline.parse_seqidsNcbimakeblastdbCommandline.taxidNcbimakeblastdbCommandline.taxid_mapNcbimakeblastdbCommandline.titleNcbimakeblastdbCommandline.version
- Bio.Blast.NCBIWWW module
- Bio.Blast.NCBIXML module
Module contents
Code to parse and store BLAST XML output, and to invoke the NCBI BLAST web server.
This module provides code to parse and store BLAST XML output, following its definition in the associated BLAST XML DTD file: https://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd
This module also provides code to invoke the BLAST web server provided by NCBI. https://blast.ncbi.nlm.nih.gov/
Variables:
email Set the Blast email parameter (default is None).
tool Set the Blast tool parameter (default is
biopython).
- exception Bio.Blast.NotXMLError(message)
Bases:
ValueErrorFailed to parse file as XML.
- __init__(message)
Initialize the class.
- __str__()
Return a string summary of the exception.
- exception Bio.Blast.CorruptedXMLError(message)
Bases:
ValueErrorCorrupted XML.
- __init__(message)
Initialize the class.
- __str__()
Return a string summary of the exception.
- class Bio.Blast.HSP(sequences, coordinates=None)
Bases:
AlignmentStores an alignment of one query sequence against a target sequence.
An HSP (High-scoring Segment Pair) stores the alignment of one query sequence segment against one target (hit) sequence segment. The
Bio.Blast.HSPclass inherits from theBio.Align.Alignmentclass.In addition to the
targetandqueryattributes of aBio.Align.Alignment, aBio.Blast.HSPobject has the following attributes:score: score of HSP;
- annotations: a dictionary that may contain the following keys:
‘bit score’: score (in bits) of HSP (float);
‘evalue’: e-value of HSP (float);
‘identity’: number of identities in HSP (integer);
‘positive’: number of positives in HSP (integer);
‘gaps’: number of gaps in HSP (integer);
‘midline’: formatting middle line.
A
Bio.Blast.HSPobject behaves the same as a Bio.Align.Alignment` object and can be used as such. However, when printing aBio.Blast.HSPobject, the BLAST e-value and bit score are included in the output (in addition to the alignment itself).See the documentation of
Bio.Blast.Recordfor a more detailed explanation of how the information in BLAST records is stored in Biopython.- __repr__()
Return a representation of the alignment, including its shape.
The representation cannot be used with eval() to recreate the object, which is usually possible with simple python objects. For example:
<Alignment object (2 rows x 14 columns) at 0x10403d850>
The hex string is the memory address of the object and can be used to distinguish different Alignment objects. See help(id) for more information.
>>> import numpy as np >>> from Bio.Align import Alignment >>> alignment = Alignment(("ACCGT", "ACGT"), ... coordinates = np.array([[0, 2, 3, 5], ... [0, 2, 2, 4], ... ])) >>> print(alignment) target 0 ACCGT 5 0 ||-|| 5 query 0 AC-GT 4 >>> alignment <Alignment object (2 rows x 5 columns) at 0x...>
- __str__()
Return a human-readable string representation of the alignment.
For sequence alignments, each line has at most 80 columns. The first 10 columns show the (possibly truncated) sequence name, which may be the id attribute of a SeqRecord, or otherwise ‘target’ or ‘query’ for pairwise alignments. The next 10 columns show the sequence coordinate, using zero-based counting as usual in Python. The remaining 60 columns shown the sequence, using dashes to represent gaps. At the end of the alignment, the end coordinates are shown on the right of the sequence, again in zero-based coordinates.
Pairwise alignments have an additional line between the two sequences showing whether the sequences match (‘|’) or mismatch (‘.’), or if there is a gap (‘-‘). The coordinates shown for this line are the column indices, which can be useful when extracting a subalignment.
For example,
>>> from Bio.Align import PairwiseAligner >>> aligner = PairwiseAligner()
>>> seqA = "TTAACCCCATTTG" >>> seqB = "AAGCCCCTTT" >>> seqC = "AAAGGGGCTT"
>>> alignments = aligner.align(seqA, seqB) >>> len(alignments) 1 >>> alignment = alignments[0] >>> print(alignment) target 0 TTAA-CCCCATTTG 13 0 --||-||||-|||- 14 query 0 --AAGCCCC-TTT- 10
Note that seqC is the reverse complement of seqB. Aligning it to the reverse strand gives the same alignment, but the query coordinates are switched:
>>> alignments = aligner.align(seqA, seqC, strand="-") >>> len(alignments) 1 >>> alignment = alignments[0] >>> print(alignment) target 0 TTAA-CCCCATTTG 13 0 --||-||||-|||- 14 query 10 --AAGCCCC-TTT- 0
- class Bio.Blast.Hit(alignments=())
Bases:
AlignmentsStores a single BLAST hit of one single query against one target.
The
Bio.Blast.Hitclass inherits from theBio.Align.Alignmentsclass, which is a subclass of a Python list. TheBio.Blast.Hitclass storesBio.Blast.HSPobjwcts, which inherit fromBio.Align.Alignment. ABio.Blast.Hitobject is therefore effectively a list ofBio.Align.Alignmentobjects. Most hits consist of only 1 or a few Alignment objects.Each
Bio.Blast.Hitobject has atargetattribute containing the following information:target.id: seqId of subject;
target.description: definition line of subject;
target.name: accession of subject;
len(target.seq): sequence length of subject.
See the documentation of
Bio.Blast.Recordfor a more detailed explanation of the information stored in the alignments contained in theBio.Blast.Hitobject.- __getitem__(key)
x.__getitem__(y) <==> x[y]
- __repr__()
Return repr(self).
- __str__()
Return a human readable summary of the Hit object.
- __abstractmethods__ = frozenset({})
- class Bio.Blast.Record
Bases:
listStores the BLAST results for a single query.
A
Bio.Blast.Recordobject is a list ofBio.Blast.Hitobjects, each corresponding to one hit for the query in the BLAST output.- The
Bio.Blast.Recordobject may have the following attributes: - query: A
SeqRecordobject which may contain some or all of the - following information:
query.id: SeqId of query;
query.description: Definition line of query;
len(query.seq): Length of the query sequence.
- query: A
- stat: A dictionary with summary statistics of the BLAST run. It may
- contain the following keys:
‘db-num’: number of sequences in BLAST db (integer);
‘db-len’: length of BLAST db (integer);
‘hsp-len’: effective HSP length (integer);
‘eff-space’: effective search space (float);
‘kappa’: Karlin-Altschul parameter K (float);
‘lambda’: Karlin-Altschul parameter Lambda (float);
‘entropy’: Karlin-Altschul parameter H (float).
message: Some (error?) information.
Each
Bio.Blast.Hitobject has atargetattribute containing the following information:target.id: seqId of subject;
target.description: definition line of subject;
target.name: accession of subject;
len(target.seq): sequence length of subject.
The
Bio.Blast.Hitclass inherits from theBio.Align.Alignmentsclass, which inherits from a Python list. In this list, theBio.Blast.Hitobject storesBio.Blast.HSPobjects, which inherit from theBio.Align.Alignmentclass. ABio.Blast.Hitobject is therefore effectively a list of alignment objects.Each HSP in a
Bio.Blast.Hitobject has the attributestargetandqueryattributes, as usual for of aBio.Align.Alignmentobject storing a pairwise alignment, pointing to aSeqRecordobject representing the target and query, respectively. For translated BLAST searches, thefeaturesattribute of the target or query may contain aSeqFeatureof type CDS that stores the amino acid sequence region. Thequalifiersattribute of such a feature is a dictionary with a single key ‘coded_by’; the corresponding value specifies the nucleotide sequence region, in a GenBank-style string with 1-based coordinates, that encodes the amino acid sequence.Each
Bio.Blast.HSPobject has the following additional attributes:score: score of HSP;
- annotations: a dictionary that may contain the following keys:
‘bit score’: score (in bits) of HSP (float);
‘evalue’: e-value of HSP (float);
‘identity’: number of identities in HSP (integer);
‘positive’: number of positives in HSP (integer);
‘gaps’: number of gaps in HSP (integer);
‘midline’: formatting middle line.
>>> from Bio import Blast >>> record = Blast.read("Blast/xml_2212L_blastx_001.xml") >>> record.query SeqRecord(seq=Seq(None, length=556), id='gi|1347369|gb|G25137.1|G25137', name='<unknown name>', description='human STS EST48004, sequence tagged site', dbxrefs=[]) >>> record.stat {'db-num': 2934173, 'db-len': 1011751523, 'hsp-len': 0, 'eff-space': 0, 'kappa': 0.041, 'lambda': 0.267, 'entropy': 0.14} >>> len(record) 78 >>> hit = record[0] >>> type(hit) <class 'Bio.Blast.Hit'> >>> from Bio.Align import Alignments >>> isinstance(hit, Alignments) True >>> hit.target SeqRecord(seq=Seq(None, length=319), id='gi|12654095|gb|AAH00859.1|', name='AAH00859', description='Unknown (protein for IMAGE:3459481) [Homo sapiens]', dbxrefs=[])
Most hits consist of only 1 or a few Alignment objects:
>>> len(hit) 1 >>> alignment = hit[0] >>> type(alignment) <class 'Bio.Blast.HSP'> >>> alignment.score 630.0 >>> alignment.annotations {'bit score': 247.284, 'evalue': 1.69599e-64, 'identity': 122, 'positive': 123, 'gaps': 0, 'midline': 'DLQLLIKAVNLFPAGTNSRWEVIANYMNIHSSSGVKRTAKDVIGKAKSLQKLDPHQKDDINKKAFDKFKKEHGVVPQADNATPSERF GPYTDFTP TTE QKL EQAL TYPVNT ERW IA AVPGR K+'}
Target and query information are stored in the respective attributes of the alignment:
>>> alignment.target SeqRecord(seq=Seq({155: 'DLQLLIKAVNLFPAGTNSRWEVIANYMNIHSSSGVKRTAKDVIGKAKSLQKLDP...TKK'}, length=319), id='gi|12654095|gb|AAH00859.1|', name='AAH00859', description='Unknown (protein for IMAGE:3459481) [Homo sapiens]', dbxrefs=[]) >>> alignment.query SeqRecord(seq=Seq('DLQLLIKAVNLFPAGTNSRWEVIANYMNIHSSSGVKRTAKDVIGKAKSLQKLDP...XKE'), id='gi|1347369|gb|G25137.1|G25137', name='<unknown name>', description='human STS EST48004, sequence tagged site', dbxrefs=[])
This was a BLASTX run, so the query sequence was translated:
>>> len(alignment.target.features) 0 >>> len(alignment.query.features) 1 >>> feature = alignment.query.features[0] >>> feature SeqFeature(SimpleLocation(ExactPosition(0), ExactPosition(133)), type='CDS', qualifiers=...) >>> feature.qualifiers {'coded_by': 'gi|1347369|gb|G25137.1|G25137:1..399'}
i.e., nucleotides 0:399 (in zero-based coordinates) encode the amino acids of the query in the alignment.
For an alignment against the reverse strand, the location in the qualifier is shown as in this example:
>>> record[72][0].query.features[0].qualifiers {'coded_by': 'complement(gi|1347369|gb|G25137.1|G25137:345..530)'}
- __init__()
Initialize the Record object.
- __repr__()
Return repr(self).
- __str__()
Return str(self).
- __getitem__(key)
x.__getitem__(y) <==> x[y]
- keys()
Return a list of the target.id of each hit.
- __contains__(key)
Return key in self.
- index(key)
Return the index of the hit for which the target.id is equal to the key.
- The
- class Bio.Blast.Records(source)
Bases:
UserListStores the BLAST results of a single BLAST run.
A
Bio.Blast.Recordsobject is an iterator. Iterating over it returns returnsBio.Blast.Recordobjects, each of which corresponds to one BLAST query.Common attributes of a
Bio.Blast.Recordsobject are- source: The input data from which the
Bio.Blast.Recordsobject was constructed.
- source: The input data from which the
program: The specific BLAST program that was used (e.g., ‘blastn’).
version: The version of the BLAST program (e.g., ‘BLASTN 2.2.27+’).
reference: The literature reference to the BLAST publication.
- db: The BLAST database against which the query was run
(e.g., ‘nr’).
- query: A
SeqRecordobject which may contain some or all of the - following information:
query.id: SeqId of the query;
query.description: Definition line of the query;
- query.seq: The query sequence. The query sequence.
The query sequence.
- query: A
- param: A dictionary with the parameters used for the BLAST run.
- You may find the following keys in this dictionary:
- ‘matrix’: the scoring matrix used in the BLAST run
(e.g., ‘BLOSUM62’) (string);
- ‘expect’: threshold on the expected number of chance
matches (float);
- ‘include’: e-value threshold for inclusion in
multipass model in psiblast (float);
‘sc-match’: score for matching nucleotides (integer);
- ‘sc-mismatch’: score for mismatched nucleotides
(integer);
‘gap-open’: gap opening cost (integer);
‘gap-extend’: gap extension cost (integer);
- ‘filter’: filtering options applied in the BLAST
run (string);
‘pattern’: PHI-BLAST pattern (string);
‘entrez-query’: Limit of request to Entrez query (string).
- mbstat: A dictionary with Mega BLAST search statistics. As this
information is stored near the end of the XML file, this attribute can only be accessed after the file has been read completely (by iterating over the records until a
StopIterationis issued. This dictionary can contain the same keys as the dictionary stored under thestatattribute of aRecordobject.
>>> from Bio import Blast >>> path = "Blast/xml_2218_blastp_002.xml"
In a script, you would use a
withblock, as in>>> with Blast.parse(path) as records: ... print(records.source) ... Blast/xml_2218_blastp_002.xml
to ensure that the file is closed at the end of the block. Here, we will simply do
>>> records = Blast.parse("Blast/xml_2218_blastp_002.xml")
so we can see the output of each command right away.
>>> type(records) <class 'Bio.Blast.Records'> >>> records.source 'Blast/xml_2218_blastp_002.xml' >>> records.program 'blastp' >>> records.version 'BLASTP 2.2.18+' >>> records.reference 'Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.' >>> records.db 'gpipe/9606/Previous/protein' >>> records.param {'matrix': 'BLOSUM62', 'expect': 0.01, 'gap-open': 11, 'gap-extend': 1, 'filter': 'm L; R -d repeat/repeat_9606;'}
Iterating over the records returns Bio.Blast.Record objects:
>>> record = next(records) >>> type(record) <class 'Bio.Blast.Record'> >>> record.query.id 'gi|585505|sp|Q08386|MOPB_RHOCA' >>> record = next(records) >>> type(record) <class 'Bio.Blast.Record'> >>> record.query.id 'gi|129628|sp|P07175.1|PARA_AGRTU' >>> record = next(records) Traceback (most recent call last): ... StopIteration
You can also use the records as a list, for example by extracting a record by index, or by calling
lenorprinton the records. The parser will then automatically iterate over the records and store them:>>> records = Blast.parse("Blast/wnts.xml") >>> record = records[3] # this causes all records to be read in and stored >>> record.query.id 'Query_4' >>> len(records) 5
After the records have been read in, you can still iterate over them:
>>> for i, record in enumerate(records): ... print(i, record.query.id) ... 0 Query_1 1 Query_2 2 Query_3 3 Query_4 4 Query_5
- __init__(source)
Initialize the Records object.
- __enter__()
- __exit__(exc_type, exc_value, exc_traceback)
- __iter__()
- __next__()
- __getitem__(index)
- property data
Overrides the data attribute of UserList.
- __repr__()
Return repr(self).
- __str__()
Return str(self).
- __abstractmethods__ = frozenset({})
- Bio.Blast.parse(source)
Parse an XML file containing BLAST output and return a Bio.Blast.Records object.
This returns an iterator object; iterating over it returns Bio.Blast.Record objects one by one.
The source can be a file stream or the path to an XML file containing the BLAST output. If a file stream, source must be in binary mode. This allows the parser to detect the encoding from the XML file,and to use it to convert any text in the XML to the correct Unicode string. The qblast function in Bio.Blast returns a file stream in binary mode. For files, please use mode “rb” when opening the file, as in
>>> from Bio import Blast >>> stream = open("Blast/wnts.xml", "rb") # opened in binary mode >>> records = Blast.parse(stream) >>> for record in records: ... print(record.query.id, record.query.description) ... Query_1 gi|195230749:301-1383 Homo sapiens wingless-type MMTV integration site family member 2 (WNT2), transcript variant 1, mRNA Query_2 gi|325053704:108-1166 Homo sapiens wingless-type MMTV integration site family, member 3A (WNT3A), mRNA Query_3 gi|156630997:105-1160 Homo sapiens wingless-type MMTV integration site family, member 4 (WNT4), mRNA Query_4 gi|371502086:108-1205 Homo sapiens wingless-type MMTV integration site family, member 5A (WNT5A), transcript variant 2, mRNA Query_5 gi|53729353:216-1313 Homo sapiens wingless-type MMTV integration site family, member 6 (WNT6), mRNA >>> stream.close()
- Bio.Blast.read(source)
Parse an XML file containing BLAST output for a single query and return it.
Internally, this function uses Bio.Blast.parse to obtain an iterator over BLAST records. The function then reads one record from the iterator, ensures that there are no further records, and returns the record it found as a Bio.Blast.Record object. An exception is raised if no records are found, or more than one record is found.
The source can be a file stream or the path to an XML file containing the BLAST output. If a file stream, source must be in binary mode. This allows the parser to detect the encoding from the XML file,and to use it to convert any text in the XML to the correct Unicode string. The qblast function in Bio.Blast returns a file stream in binary mode. For files, please use mode “rb” when opening the file, as in
>>> from Bio import Blast >>> stream = open("Blast/xml_21500_blastn_001.xml", "rb") # opened in binary mode >>> record = Blast.read(stream) >>> record.query.id 'Query_78041' >>> record.query.description 'G26684.1 human STS STS_D11570, sequence tagged site' >>> len(record) 11 >>> stream.close()
Use the Bio.Blast.parse function if you want to read a file containing BLAST output for more than one query.
- Bio.Blast.write(records, destination, fmt='XML')
Write BLAST records as an XML file, and return the number of records.
- Arguments:
records - A
Bio.Blast.Recordsobject.- destination - File or file-like object to write to, or filename as
string. The File object must have been opened for writing in binary mode, and must be closed (or flushed) by the caller after this function returns to ensure that all records are written.
- fmt - string describing the file format to write
(case-insensitive). Currently, only “XML” and “XML2” are accepted.
Returns the number of records written (as an integer).
- Bio.Blast.qblast(program, database, sequence, url_base=NCBI_BLAST_URL, auto_format=None, composition_based_statistics=None, db_genetic_code=None, endpoints=None, entrez_query='(none)', expect=10.0, filter=None, gapcosts=None, genetic_code=None, hitlist_size=50, i_thresh=None, layout=None, lcase_mask=None, matrix_name=None, nucl_penalty=None, nucl_reward=None, other_advanced=None, perc_ident=None, phi_pattern=None, query_file=None, query_believe_defline=None, query_from=None, query_to=None, searchsp_eff=None, service=None, threshold=None, ungapped_alignment=None, word_size=None, short_query=None, alignments=500, alignment_view=None, descriptions=500, entrez_links_new_window=None, expect_low=None, expect_high=None, format_entrez_query=None, format_object=None, format_type='XML', ncbi_gi=None, results_file=None, show_overview=None, megablast=None, template_type=None, template_length=None, username='blast', password=None)
BLAST search using NCBI’s QBLAST server or a cloud service provider.
Supports all parameters of the old qblast API for Put and Get.
Please note that NCBI uses the new Common URL API for BLAST searches on the internet (http://ncbi.github.io/blast-cloud/dev/api.html). Thus, some of the parameters used by this function are not (or are no longer) officially supported by NCBI. Although they are still functioning, this may change in the future.
The Common URL API (http://ncbi.github.io/blast-cloud/dev/api.html) allows doing BLAST searches on cloud servers. To use this feature, please set
url_base='http://host.my.cloud.service.provider.com/cgi-bin/blast.cgi'andformat_object='Alignment'. For more details, please see https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastDocs&DOC_TYPE=CloudBlastSome useful parameters:
program blastn, blastp, blastx, tblastn, or tblastx (lower case)
database Which database to search against (e.g. “nr”).
sequence The sequence to search.
ncbi_gi TRUE/FALSE whether to give ‘gi’ identifier.
descriptions Number of descriptions to show. Def 500.
alignments Number of alignments to show. Def 500.
expect An expect value cutoff. Def 10.0.
matrix_name Specify an alt. matrix (PAM30, PAM70, BLOSUM80, BLOSUM45).
filter “none” turns off filtering. Default no filtering
- format_type “XML” (default), “HTML”, “Text”, “XML2”, “JSON2”,
or “Tabular”.
entrez_query Entrez query to limit Blast search
hitlist_size Number of hits to return. Default 50
megablast TRUE/FALSE whether to use MEga BLAST algorithm (blastn only)
- short_query TRUE/FALSE whether to adjust the search parameters for a
short query sequence. Note that this will override manually set parameters like word size and e value. Turns off when sequence length is > 30 residues. Default: None.
service plain, psi, phi, rpsblast, megablast (lower case)
This function does no checking of the validity of the parameters and passes the values to the server as is. More help is available at: https://ncbi.github.io/blast-cloud/dev/api.html
The http.client.HTTPResponse object returned by this function has the additional attributes rid and rtoe with the Request ID and Request Time Of Execution for this BLAST search.