Bio.Sequencing.Applications package

Module contents

Sequencing related command line application wrappers (OBSOLETE).

We have decided to remove this module in future, and instead recommend building your command and invoking it via the subprocess module directly.

class Bio.Sequencing.Applications.BwaIndexCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) index.

Index database sequences in the FASTA format, equivalent to:

$ bwa index [-p prefix] [-a algoType] [-c] <in.db.fasta>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaIndexCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> index_cmd = BwaIndexCommandline(infile=reference_genome, algorithm="bwtsw")
>>> print(index_cmd)
bwa index -a bwtsw /path/to/reference_genome.fasta

You would typically run the command using index_cmd() or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property algorithm

Algorithm for constructing BWT index.

Available options are:
  • is: IS linear-time algorithm for constructing suffix array. It requires 5.37N memory where N is the size of the database. IS is moderately fast, but does not work with database larger than 2GB. IS is the default algorithm due to its simplicity.

  • bwtsw: Algorithm implemented in BWT-SW. This method works with the whole human genome, but it does not work with database smaller than 10MB and it is usually slower than IS.

This controls the addition of the -a parameter and its associated value. Set this property to the argument value required.

property c

Build color-space index. The input fasta should be in nucleotide space.

This property controls the addition of the -c switch, treat this property as a boolean.

property infile

Input file name

This controls the addition of the infile parameter and its associated value. Set this property to the argument value required.

property prefix

Prefix of the output database [same as db filename]

This controls the addition of the -p parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.BwaAlignCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) aln.

Run a BWA alignment, equivalent to:

$ bwa aln [...] <in.db.fasta> <in.query.fq> > <out.sai>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaAlignCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> read_file = "/path/to/read_1.fq"
>>> output_sai_file = "/path/to/read_1.sai"
>>> align_cmd = BwaAlignCommandline(reference=reference_genome, read_file=read_file)
>>> print(align_cmd)
bwa aln /path/to/reference_genome.fasta /path/to/read_1.fq

You would typically run the command line using align_cmd(stdout=output_sai_file) or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property B

Length of barcode starting from the 5-end. When INT is positive, the barcode of each read will be trimmed before mapping and will be written at the BC SAM tag. For paired-end reads, the barcode from both ends are concatenated. [0]

This controls the addition of the -B parameter and its associated value. Set this property to the argument value required.

property E

Gap extension penalty [4]

This controls the addition of the -E parameter and its associated value. Set this property to the argument value required.

property I

The input is in the Illumina 1.3+ read format (quality equals ASCII-64).

This property controls the addition of the -I switch, treat this property as a boolean.

property M

Mismatch penalty. BWA will not search for suboptimal hits with a score lower than (bestScore-misMsc). [3]

This controls the addition of the -M parameter and its associated value. Set this property to the argument value required.

property N

Disable iterative search. All hits with no more than maxDiff differences will be found. This mode is much slower than the default.

This property controls the addition of the -N switch, treat this property as a boolean.

property O

Gap open penalty [11]

This controls the addition of the -O parameter and its associated value. Set this property to the argument value required.

property R

Proceed with suboptimal alignments if there are no more than INT equally best hits.

This option only affects paired-end mapping. Increasing this threshold helps to improve the pairing accuracy at the cost of speed, especially for short reads (~32bp).

This controls the addition of the -R parameter and its associated value. Set this property to the argument value required.

property b

Specify the input read sequence file is the BAM format

This property controls the addition of the -b switch, treat this property as a boolean.

property b1

When -b is specified, only use the first read in a read pair in mapping (skip single-end reads and the second reads).

This property controls the addition of the -b1 switch, treat this property as a boolean.

property b2

When -b is specified, only use the second read in a read pair in mapping.

This property controls the addition of the -b2 switch, treat this property as a boolean.

property c

Reverse query but not complement it, which is required for alignment in the color space.

This property controls the addition of the -c switch, treat this property as a boolean.

property d

Disallow a long deletion within INT bp towards the 3-end [16]

This controls the addition of the -d parameter and its associated value. Set this property to the argument value required.

property e

Maximum number of gap extensions, -1 for k-difference mode (disallowing long gaps) [-1]

This controls the addition of the -e parameter and its associated value. Set this property to the argument value required.

property i

Disallow an indel within INT bp towards the ends [5]

This controls the addition of the -i parameter and its associated value. Set this property to the argument value required.

property k

Maximum edit distance in the seed [2]

This controls the addition of the -k parameter and its associated value. Set this property to the argument value required.

property l

Take the first INT subsequence as seed.

If INT is larger than the query sequence, seeding will be disabled. For long reads, this option is typically ranged from 25 to 35 for -k 2. [inf]

This controls the addition of the -l parameter and its associated value. Set this property to the argument value required.

property n

Maximum edit distance if the value is INT, or the fraction of missing alignments given 2% uniform base error rate if FLOAT. In the latter case, the maximum edit distance is automatically chosen for different read lengths. [0.04]

This controls the addition of the -n parameter and its associated value. Set this property to the argument value required.

property o

Maximum edit distance if the value is INT, or the fraction of missing alignments given 2% uniform base error rate if FLOAT. In the latter case, the maximum edit distance is automatically chosen for different read lengths. [0.04]

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

property q

Parameter for read trimming [0].

BWA trims a read down to argmax_x{sum_{i=x+1}^l(INT-q_i)} if q_l<INT where l is the original read length.

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

property read_file

Read file name

This controls the addition of the read_file parameter and its associated value. Set this property to the argument value required.

property reference

Reference file name

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property t

Number of threads (multi-threading mode) [1]

This controls the addition of the -t parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.BwaSamseCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) samse.

Generate alignments in the SAM format given single-end reads. Equvialent to:

$ bwa samse [-n maxOcc] <in.db.fasta> <in.sai> <in.fq> > <out.sam>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaSamseCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> read_file = "/path/to/read_1.fq"
>>> sai_file = "/path/to/read_1.sai"
>>> output_sam_file = "/path/to/read_1.sam"
>>> samse_cmd = BwaSamseCommandline(reference=reference_genome,
...                                 read_file=read_file, sai_file=sai_file)
>>> print(samse_cmd)
bwa samse /path/to/reference_genome.fasta /path/to/read_1.sai /path/to/read_1.fq

You would typically run the command line using samse_cmd(stdout=output_sam_file) or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property n

Maximum number of alignments to output in the XA tag for reads paired properly.

If a read has more than INT hits, the XA tag will not be written. [3]

This controls the addition of the -n parameter and its associated value. Set this property to the argument value required.

property r

Specify the read group in a format like @RG ID:foo SM:bar’. [null]

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property read_file

Read file name

This controls the addition of the read_file parameter and its associated value. Set this property to the argument value required.

property reference

Reference file name

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property sai_file

Sai file name

This controls the addition of the sai_file parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.BwaSampeCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) sampe.

Generate alignments in the SAM format given paired-end reads. Equivalent to:

$ bwa sampe [...] <in.db.fasta> <in1.sai> <in2.sai> <in1.fq> <in2.fq> > <out.sam>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaSampeCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> read_file1 = "/path/to/read_1.fq"
>>> read_file2 = "/path/to/read_2.fq"
>>> sai_file1 = "/path/to/read_1.sai"
>>> sai_file2 = "/path/to/read_2.sai"
>>> output_sam_file = "/path/to/output.sam"
>>> read_group = r"@RG\tID:foo\tSM:bar"  # BWA will turn backslash-t into tab
>>> sampe_cmd = BwaSampeCommandline(reference=reference_genome,
...                                 sai_file1=sai_file1, sai_file2=sai_file2,
...                                 read_file1=read_file1, read_file2=read_file2,
...                                 r=read_group)
>>> print(sampe_cmd)
bwa sampe /path/to/reference_genome.fasta /path/to/read_1.sai /path/to/read_2.sai /path/to/read_1.fq /path/to/read_2.fq -r @RG\tID:foo\tSM:bar

You would typically run the command line using sampe_cmd(stdout=output_sam_file) or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property N

Maximum number of alignments to output in the XA tag for disconcordant read pairs (excluding singletons) [10].

If a read has more than INT hits, the XA tag will not be written.

This controls the addition of the -N parameter and its associated value. Set this property to the argument value required.

property a

Maximum insert size for a read pair to be considered being mapped properly [500].

Since 0.4.5, this option is only used when there are not enough good alignments to infer the distribution of insert sizes.

This controls the addition of the -a parameter and its associated value. Set this property to the argument value required.

property n

Maximum number of alignments to output in the XA tag for reads paired properly [3].

If a read has more than INT hits, the XA tag will not be written.

This controls the addition of the -n parameter and its associated value. Set this property to the argument value required.

property o

Maximum occurrences of a read for pairing [100000].

A read with more occurrences will be treated as a single-end read. Reducing this parameter helps faster pairing.

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

property r

Specify the read group in a format like @RG ID:foo SM:bar’. [null]

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property read_file1

Read file 1

This controls the addition of the read_file1 parameter and its associated value. Set this property to the argument value required.

property read_file2

Read file 2

This controls the addition of the read_file2 parameter and its associated value. Set this property to the argument value required.

property reference

Reference file name

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property sai_file1

Sai file 1

This controls the addition of the sai_file1 parameter and its associated value. Set this property to the argument value required.

property sai_file2

Sai file 2

This controls the addition of the sai_file2 parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.BwaBwaswCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) bwasw.

Align query sequences from FASTQ files. Equivalent to:

$ bwa bwasw [...] <in.db.fasta> <in.fq>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaBwaswCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> read_file = "/path/to/read_1.fq"
>>> bwasw_cmd = BwaBwaswCommandline(reference=reference_genome, read_file=read_file)
>>> print(bwasw_cmd)
bwa bwasw /path/to/reference_genome.fasta /path/to/read_1.fq

You would typically run the command line using bwasw_cmd() or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property N

Minimum number of seeds supporting the resultant alignment to skip reverse alignment. [5]

This controls the addition of the -N parameter and its associated value. Set this property to the argument value required.

property T

Minimum score threshold divided by a [37]

This controls the addition of the -T parameter and its associated value. Set this property to the argument value required.

property a

Score of a match [1]

This controls the addition of the -a parameter and its associated value. Set this property to the argument value required.

property b

Mismatch penalty [3]

This controls the addition of the -b parameter and its associated value. Set this property to the argument value required.

property c

Coefficient for threshold adjustment according to query length [5.5].

Given an l-long query, the threshold for a hit to be retained is a*max{T,c*log(l)}.

This controls the addition of the -c parameter and its associated value. Set this property to the argument value required.

property mate_file

Mate file

This controls the addition of the mate_file parameter and its associated value. Set this property to the argument value required.

property q

Gap open penalty [5]

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

property r

Gap extension penalty. The penalty for a contiguous gap of size k is q+k*r. [2]

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property read_file

Read file

This controls the addition of the read_file parameter and its associated value. Set this property to the argument value required.

property reference

Reference file name

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property s

Maximum SA interval size for initiating a seed [3].

Higher -s increases accuracy at the cost of speed.

This controls the addition of the -s parameter and its associated value. Set this property to the argument value required.

property t

Number of threads in the multi-threading mode [1]

This controls the addition of the -t parameter and its associated value. Set this property to the argument value required.

property w

Band width in the banded alignment [33]

This controls the addition of the -w parameter and its associated value. Set this property to the argument value required.

property z

Z-best heuristics. Higher -z increases accuracy at the cost of speed. [1]

This controls the addition of the -z parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.BwaMemCommandline(cmd='bwa', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for Burrows Wheeler Aligner (BWA) mem.

Run a BWA-MEM alignment, with single- or paired-end reads, equivalent to:

$ bwa mem [...] <in.db.fasta> <in1.fq> <in2.fq> > <out.sam>

See http://bio-bwa.sourceforge.net/bwa.shtml for details.

Examples

>>> from Bio.Sequencing.Applications import BwaMemCommandline
>>> reference_genome = "/path/to/reference_genome.fasta"
>>> read_file = "/path/to/read_1.fq"
>>> output_sam_file = "/path/to/output.sam"
>>> align_cmd = BwaMemCommandline(reference=reference_genome, read_file1=read_file)
>>> print(align_cmd)
bwa mem /path/to/reference_genome.fasta /path/to/read_1.fq

You would typically run the command line using align_cmd(stdout=output_sam_file) or via the Python subprocess module, as described in the Biopython tutorial.

__init__(cmd='bwa', **kwargs)

Initialize the class.

property A

Matching score. [1]

This controls the addition of the -A parameter and its associated value. Set this property to the argument value required.

property B

Mismatch penalty. The sequence error rate is approximately: {.75 * exp[-log(4) * B/A]}. [4]

This controls the addition of the -B parameter and its associated value. Set this property to the argument value required.

property C

Append FASTA/Q comment to SAM output. This option can be used to transfer read meta information (e.g. barcode) to the SAM output. Note that the FASTA/Q comment (the string after a space in the header line) must conform the SAM spec (e.g. BC:Z:CGTAC). Malformated comments lead to incorrect SAM output.

This property controls the addition of the -C switch, treat this property as a boolean.

property E

Gap extension penalty. A gap of length k costs O + k*E (i.e. -O is for opening a zero-length gap). [1]

This controls the addition of the -E parameter and its associated value. Set this property to the argument value required.

property H

Use hard clipping ‘H’ in the SAM output. This option may dramatically reduce the redundancy of output when mapping long contig or BAC sequences.

This property controls the addition of the -H switch, treat this property as a boolean.

property L

Clipping penalty. When performing SW extension, BWA-MEM keeps track of the best score reaching the end of query. If this score is larger than the best SW score minus the clipping penalty, clipping will not be applied. Note that in this case, the SAM AS tag reports the best SW score; clipping penalty is not deducted. [5]

This controls the addition of the -L parameter and its associated value. Set this property to the argument value required.

property M

Mark shorter split hits as secondary (for Picard compatibility).

This property controls the addition of the -M switch, treat this property as a boolean.

property O

Gap open penalty. [6]

This controls the addition of the -O parameter and its associated value. Set this property to the argument value required.

property P

In the paired-end mode, perform SW to rescue missing hits only but do not try to find hits that fit a proper pair.

This property controls the addition of the -P switch, treat this property as a boolean.

property R

Complete read group header line. ‘t’ can be used in STR and will be converted to a TAB in the output SAM. The read group ID will be attached to every read in the output. An example is @RG ID:foo SM:bar’. [null]

This controls the addition of the -R parameter and its associated value. Set this property to the argument value required.

property T

Don’t output alignment with score lower than INT. This option only affects output. [30]

This controls the addition of the -T parameter and its associated value. Set this property to the argument value required.

property U

Penalty for an unpaired read pair. BWA-MEM scores an unpaired read pair as scoreRead1+scoreRead2-INT and scores a paired as scoreRead1+scoreRead2-insertPenalty. It compares these two scores to determine whether we should force pairing. [9]

This controls the addition of the -U parameter and its associated value. Set this property to the argument value required.

property a

Output all found alignments for single-end or unpaired paired-end reads. These alignments will be flagged as secondary alignments.

This property controls the addition of the -a switch, treat this property as a boolean.

property c

Discard a MEM if it has more than INT occurrence in the genome. This is an insensitive parameter. [10000]

This controls the addition of the -c parameter and its associated value. Set this property to the argument value required.

property d

Off-diagonal X-dropoff (Z-dropoff). Stop extension when the difference between the best and the current extension score is above |i-j|*A+INT, where i and j are the current positions of the query and reference, respectively, and A is the matching score. Z-dropoff is similar to BLAST’s X-dropoff except that it doesn’t penalize gaps in one of the sequences in the alignment. Z-dropoff not only avoids unnecessary extension, but also reduces poor alignments inside a long good alignment. [100]

This controls the addition of the -d parameter and its associated value. Set this property to the argument value required.

property k

Minimum seed length. Matches shorter than INT will be missed. The alignment speed is usually insensitive to this value unless it significantly deviates 20. [19]

This controls the addition of the -k parameter and its associated value. Set this property to the argument value required.

property p

Assume the first input query file is interleaved paired-end FASTA/Q. See the command description for details.

This property controls the addition of the -p switch, treat this property as a boolean.

property r

Trigger re-seeding for a MEM longer than minSeedLen*FLOAT. This is a key heuristic parameter for tuning the performance. Larger value yields fewer seeds, which leads to faster alignment speed but lower accuracy. [1.5]

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property read_file1

Read 1 file name

This controls the addition of the read_file1 parameter and its associated value. Set this property to the argument value required.

property read_file2

Read 2 file name

This controls the addition of the read_file2 parameter and its associated value. Set this property to the argument value required.

property reference

Reference file name

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property t

Number of threads [1]

This controls the addition of the -t parameter and its associated value. Set this property to the argument value required.

property v

Control the verbose level of the output. This option has not been fully supported throughout BWA. Ideally, a value 0 for disabling all the output to stderr; 1 for outputting errors only; 2 for warnings and errors; 3 for all normal messages; 4 or higher for debugging. When this option takes value 4, the output is not SAM. [3]

This controls the addition of the -v parameter and its associated value. Set this property to the argument value required.

property w

Band width. Essentially, gaps longer than INT will not be found. Note that the maximum gap length is also affected by the scoring matrix and the hit length, not solely determined by this option. [100]

This controls the addition of the -w parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.NovoalignCommandline(cmd='novoalign', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for novoalign by Novocraft.

See www.novocraft.com - novoalign is a short read alignment program.

Examples

>>> from Bio.Sequencing.Applications import NovoalignCommandline
>>> novoalign_cline = NovoalignCommandline(database='some_db',
...                                        readfile='some_seq.txt')
>>> print(novoalign_cline)
novoalign -d some_db -f some_seq.txt

As with all the Biopython application wrappers, you can also add or change options after creating the object:

>>> novoalign_cline.format = 'PRBnSEQ'
>>> novoalign_cline.r_method='0.99' # limited valid values
>>> novoalign_cline.fragment = '250 20' # must be given as a string
>>> novoalign_cline.miRNA = 100
>>> print(novoalign_cline)
novoalign -d some_db -f some_seq.txt -F PRBnSEQ -r 0.99 -i 250 20 -m 100

You would typically run the command line with novoalign_cline() or via the Python subprocess module, as described in the Biopython tutorial.

Last checked against version: 2.05.04

__init__(cmd='novoalign', **kwargs)

Initialize the class.

property adapter3

Strips a 3’ adapter sequence prior to alignment.

With paired ends two adapters can be specified

This controls the addition of the -a parameter and its associated value. Set this property to the argument value required.

property adapter5

Strips a 5’ adapter sequence.

Similar to -a (adaptor3), but on the 5’ end.

This controls the addition of the -5 parameter and its associated value. Set this property to the argument value required.

property cores

Number of threads, disabled on free versions [default: number of cores]

This controls the addition of the -c parameter and its associated value. Set this property to the argument value required.

property database

database filename

This controls the addition of the -d parameter and its associated value. Set this property to the argument value required.

property format

Format of read files.

Allowed values: FA, SLXFQ, STDFQ, ILMFQ, PRB, PRBnSEQ

This controls the addition of the -F parameter and its associated value. Set this property to the argument value required.

property fragment

Fragment length (2 reads + insert) and standard deviation [default: 250 30]

This controls the addition of the -i parameter and its associated value. Set this property to the argument value required.

property gap_extend

Gap extend penalty [default: 15]

This controls the addition of the -x parameter and its associated value. Set this property to the argument value required.

property gap_open

Gap opening penalty [default: 40]

This controls the addition of the -g parameter and its associated value. Set this property to the argument value required.

property good_bases

Minimum number of good quality bases [default: log(N_g, 4) + 5]

This controls the addition of the -l parameter and its associated value. Set this property to the argument value required.

property homopolymer

Homopolymer read filter [default: 20; disable: negative value]

This controls the addition of the -h parameter and its associated value. Set this property to the argument value required.

property miRNA

Sets miRNA mode and optionally sets a value for the region scanned [default: off]

This controls the addition of the -m parameter and its associated value. Set this property to the argument value required.

property qual_digits

Decimal digits for quality scores [default: 0]

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

property quality

Lower threshold for an alignment to be reported [default: 0]

This controls the addition of the -Q parameter and its associated value. Set this property to the argument value required.

property r_method

Methods to report reads with multiple matches.

Allowed values: None, Random, All, Exhaustive, 0.99 ‘All’ and ‘Exhaustive’ accept limits.

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property read_cal

Read quality calibration from file (mismatch counts)

This controls the addition of the -k parameter and its associated value. Set this property to the argument value required.

property readfile

read file

This controls the addition of the -f parameter and its associated value. Set this property to the argument value required.

property recorded

Alignments recorded with score equal to the best.

Default: 1000 in default read method, otherwise no limit.

This controls the addition of the -e parameter and its associated value. Set this property to the argument value required.

property repeats

If score difference is higher, report repeats.

Otherwise -r read method applies [default: 5]

This controls the addition of the -R parameter and its associated value. Set this property to the argument value required.

property report

Specifies the report format.

Allowed values: Native, Pairwise, SAM Default: Native

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

property threshold

Threshold for alignment score

This controls the addition of the -t parameter and its associated value. Set this property to the argument value required.

property trimming

If fail to align, trim by s bases until they map or become shorter than l.

Ddefault: 2

This controls the addition of the -s parameter and its associated value. Set this property to the argument value required.

property truncate

Truncate to specific length before alignment

This controls the addition of the -n parameter and its associated value. Set this property to the argument value required.

property unconverted

Experimental: unconverted cytosines penalty in bisulfite mode

Default: no penalty

This controls the addition of the -u parameter and its associated value. Set this property to the argument value required.

property variation

Structural variation penalty [default: 70]

This controls the addition of the -v parameter and its associated value. Set this property to the argument value required.

property write_cal

Accumulate mismatch counts and write to file

This controls the addition of the -K parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsViewCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools view.

Extract/print all or sub alignments in SAM or BAM format, equivalent to:

$ samtools view [-bchuHS] [-t in.refList] [-o output] [-f reqFlag]
                [-F skipFlag] [-q minMapQ] [-l library] [-r readGroup]
                [-R rgFile] <in.bam>|<in.sam> [region1 [...]]

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsViewCommandline
>>> input_file = "/path/to/sam_or_bam_file"
>>> samtools_view_cmd = SamtoolsViewCommandline(input_file=input_file)
>>> print(samtools_view_cmd)
samtools view /path/to/sam_or_bam_file
__init__(cmd='samtools', **kwargs)

Initialize the class.

property F

Skip alignments with bits present in INT

This controls the addition of the -F parameter and its associated value. Set this property to the argument value required.

property H

Output the header only

This property controls the addition of the -H switch, treat this property as a boolean.

property R

Output reads in read groups listed in FILE

This controls the addition of the -R parameter and its associated value. Set this property to the argument value required.

property S
Input is in SAM.

If @SQ header lines are absent, the ‘-t’ option is required.

This property controls the addition of the -S switch, treat this property as a boolean.

property b

Output in the BAM format

This property controls the addition of the -b switch, treat this property as a boolean.

property c
Instead of printing the alignments, only count them and

print the total number.

All filter options, such as ‘-f’, ‘-F’ and ‘-q’, are taken into account

This property controls the addition of the -c switch, treat this property as a boolean.

property f
Only output alignments with all bits in

INT present in the FLAG field

This controls the addition of the -f parameter and its associated value. Set this property to the argument value required.

property fast_bam

Use zlib compression level 1 to compress the output

This property controls the addition of the -1 switch, treat this property as a boolean.

property h

Include the header in the output

This property controls the addition of the -h switch, treat this property as a boolean.

property input_file

Input File Name

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property l

Only output reads in library STR

This controls the addition of the -l parameter and its associated value. Set this property to the argument value required.

property o

Output file

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

property q

Skip alignments with MAPQ smaller than INT

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

property r

Only output reads in read group STR

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property region

Region

This controls the addition of the region parameter and its associated value. Set this property to the argument value required.

property t
This file is TAB-delimited.

Each line must contain the reference name and the length of the reference, one line for each distinct reference; additional fields are ignored.

This file also defines the order of the reference sequences in sorting. If you run ‘samtools faidx <ref.fa>’, the resultant index file <ref.fa>.fai can be used as this <in.ref_list> file.

This controls the addition of the -t parameter and its associated value. Set this property to the argument value required.

property u

Output uncompressed BAM.

This option saves time spent on compression/decompression and is thus preferred when the output is piped to another samtools command

This property controls the addition of the -u switch, treat this property as a boolean.

class Bio.Sequencing.Applications.SamtoolsCalmdCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools calmd.

Generate the MD tag, equivalent to:

$ samtools calmd [-EeubSr] [-C capQcoef] <aln.bam> <ref.fasta>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsCalmdCommandline
>>> input_bam = "/path/to/aln.bam"
>>> reference_fasta = "/path/to/reference.fasta"
>>> calmd_cmd = SamtoolsCalmdCommandline(input_bam=input_bam,
...                                      reference=reference_fasta)
>>> print(calmd_cmd)
samtools calmd /path/to/aln.bam /path/to/reference.fasta
__init__(cmd='samtools', **kwargs)

Initialize the class.

property A
When used jointly with -r this option overwrites

the original base quality

This property controls the addition of the -A switch, treat this property as a boolean.

property C
Coefficient to cap mapping quality

of poorly mapped reads.

See the pileup command for details.

This controls the addition of the -C parameter and its associated value. Set this property to the argument value required.

property E
Extended BAQ calculation.

This option trades specificity for sensitivity, though the effect is minor.

This property controls the addition of the -E switch, treat this property as a boolean.

property S

The input is SAM with header lines

This property controls the addition of the -S switch, treat this property as a boolean.

property b

Output compressed BAM

This property controls the addition of the -b switch, treat this property as a boolean.

property e
Convert the read base to = if it is

identical to the aligned reference base.

Indel caller does not support the = bases at the moment.

This property controls the addition of the -e switch, treat this property as a boolean.

property input_bam

Input BAM

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property r
Compute the BQ tag (without -A)

or cap base quality by BAQ (with -A).

This property controls the addition of the -r switch, treat this property as a boolean.

property ref

Reference FASTA to be indexed

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

property u

Output uncompressed BAM

This property controls the addition of the -u switch, treat this property as a boolean.

class Bio.Sequencing.Applications.SamtoolsCatCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools cat.

Concatenate BAMs, equivalent to:

$ samtools cat [-h header.sam] [-o out.bam] <in1.bam> <in2.bam> [ ... ]

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsCatCommandline
>>> input_bam1 = "/path/to/input_bam1"
>>> input_bam2 = "/path/to/input_bam2"
>>> input_bams = [input_bam1, input_bam2]
>>> samtools_cat_cmd = SamtoolsCatCommandline(input_bam=input_bams)
>>> print(samtools_cat_cmd)
samtools cat /path/to/input_bam1 /path/to/input_bam2
__init__(cmd='samtools', **kwargs)

Initialize the class.

property bams

Input BAM files

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property h

Header SAM file

This controls the addition of the -h parameter and its associated value. Set this property to the argument value required.

property o

Output SAM file

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsFaidxCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools faidx.

Retrieve and print stats in the index file, equivalent to:

$ samtools faidx <ref.fasta> [region1 [...]]

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsFaidxCommandline
>>> reference = "/path/to/reference.fasta"
>>> samtools_faidx_cmd = SamtoolsFaidxCommandline(reference=reference)
>>> print(samtools_faidx_cmd)
samtools faidx /path/to/reference.fasta
__init__(cmd='samtools', **kwargs)

Initialize the class.

property ref

Reference FASTA to be indexed

This controls the addition of the reference parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsFixmateCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools fixmate.

Fill in mate coordinates, ISIZE and mate related flags from a name-sorted alignment, equivalent to:

$ samtools fixmate <in.nameSrt.bam> <out.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsFixmateCommandline
>>> in_bam = "/path/to/in.nameSrt.bam"
>>> out_bam = "/path/to/out.bam"
>>> fixmate_cmd = SamtoolsFixmateCommandline(input_bam=in_bam,
...                                          out_bam=out_bam)
>>> print(fixmate_cmd)
samtools fixmate /path/to/in.nameSrt.bam /path/to/out.bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property input_file

Name Sorted Alignment File

This controls the addition of the in_bam parameter and its associated value. Set this property to the argument value required.

property output_file

Output file

This controls the addition of the out_bam parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsIdxstatsCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools idxstats.

Retrieve and print stats in the index file, equivalent to:

$ samtools idxstats <aln.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsIdxstatsCommandline
>>> input = "/path/to/aln_bam"
>>> samtools_idxstats_cmd = SamtoolsIdxstatsCommandline(input_bam=input)
>>> print(samtools_idxstats_cmd)
samtools idxstats /path/to/aln_bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property input_bam

BAM file to be indexed

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsIndexCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools index.

Index sorted alignment for fast random access, equivalent to:

$ samtools index <aln.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsIndexCommandline
>>> input = "/path/to/aln_bam"
>>> samtools_index_cmd = SamtoolsIndexCommandline(input_bam=input)
>>> print(samtools_index_cmd)
samtools index /path/to/aln_bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property input_bam

BAM file to be indexed

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsMergeCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools merge.

Merge multiple sorted alignments, equivalent to:

$ samtools merge [-nur1f] [-h inh.sam] [-R reg]
                 <out.bam> <in1.bam> <in2.bam> [...]

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsMergeCommandline
>>> out_bam = "/path/to/out_bam"
>>> in_bam = ["/path/to/input_bam1", "/path/to/input_bam2"]
>>> merge_cmd = SamtoolsMergeCommandline(out_bam=out_bam,
...                                      input_bam=in_bam)
>>> print(merge_cmd)
samtools merge /path/to/out_bam /path/to/input_bam1 /path/to/input_bam2
__init__(cmd='samtools', **kwargs)

Initialize the class.

property R

Merge files in the specified region indicated by STR

This controls the addition of the -R parameter and its associated value. Set this property to the argument value required.

property bam

Input BAM

This controls the addition of the input_bam parameter and its associated value. Set this property to the argument value required.

property f
Force to overwrite the

output file if present

This property controls the addition of the -f switch, treat this property as a boolean.

property fast_bam
Use zlib compression level 1

to compress the output

This property controls the addition of the -1 switch, treat this property as a boolean.

property h
Use the lines of FILE as ‘@’

headers to be copied to out.bam

This controls the addition of the -h parameter and its associated value. Set this property to the argument value required.

property n
The input alignments are sorted by read names

rather than by chromosomal coordinates

This property controls the addition of the -n switch, treat this property as a boolean.

property output

Output BAM file

This controls the addition of the output_bam parameter and its associated value. Set this property to the argument value required.

property r
Attach an RG tag to each alignment.

The tag value is inferred from file names

This property controls the addition of the -r switch, treat this property as a boolean.

property u

Uncompressed BAM output

This property controls the addition of the -u switch, treat this property as a boolean.

class Bio.Sequencing.Applications.SamtoolsMpileupCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools mpileup.

Generate BCF or pileup for one or multiple BAM files, equivalent to:

$ samtools mpileup [-EBug] [-C capQcoef] [-r reg] [-f in.fa]
                   [-l list] [-M capMapQ] [-Q minBaseQ]
                   [-q minMapQ] in.bam [in2.bam [...]]

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsMpileupCommandline
>>> input = ["/path/to/sam_or_bam_file"]
>>> samtools_mpileup_cmd = SamtoolsMpileupCommandline(input_file=input)
>>> print(samtools_mpileup_cmd)
samtools mpileup /path/to/sam_or_bam_file
__init__(cmd='samtools', **kwargs)

Initialize the class.

property A

Do not skip anomalous read pairs in variant calling.

This property controls the addition of the -A switch, treat this property as a boolean.

property B
Disable probabilistic realignment for the

computation of base alignment quality (BAQ).

BAQ is the Phred-scaled probability of a read base being misaligned. Applying this option greatly helps to reduce false SNPs caused by misalignments

This property controls the addition of the -B switch, treat this property as a boolean.

property C
Coefficient for downgrading mapping quality for

reads containing excessive mismatches.

Given a read with a phred-scaled probability q of being generated from the mapped position, the new mapping quality is about sqrt((INT-q)/INT)*INT. A zero value disables this functionality; if enabled, the recommended value for BWA is 50

This controls the addition of the -C parameter and its associated value. Set this property to the argument value required.

property D

Output per-sample read depth

This property controls the addition of the -D switch, treat this property as a boolean.

property E
Extended BAQ computation.

This option helps sensitivity especially for MNPs, but may hurt specificity a little bit

This property controls the addition of the -E switch, treat this property as a boolean.

property I

Do not perform INDEL calling

This property controls the addition of the -I switch, treat this property as a boolean.

property L
Skip INDEL calling if the average per-sample

depth is above INT

This controls the addition of the -L parameter and its associated value. Set this property to the argument value required.

property M

Cap Mapping Quality at M

This controls the addition of the -M parameter and its associated value. Set this property to the argument value required.

property Q

Minimum base quality for a base to be considered

This controls the addition of the -Q parameter and its associated value. Set this property to the argument value required.

property S
Output per-sample Phred-scaled

strand bias P-value

This property controls the addition of the -S switch, treat this property as a boolean.

property b

List of input BAM files, one file per line

This controls the addition of the -b parameter and its associated value. Set this property to the argument value required.

property d

At a position, read maximally INT reads per input BAM

This controls the addition of the -d parameter and its associated value. Set this property to the argument value required.

property e

Phred-scaled gap extension sequencing error probability.

Reducing INT leads to longer indels

This controls the addition of the -e parameter and its associated value. Set this property to the argument value required.

property f

The faidx-indexed reference file in the FASTA format.

The file can be optionally compressed by razip

This controls the addition of the -f parameter and its associated value. Set this property to the argument value required.

property g
Compute genotype likelihoods and output them in the

binary call format (BCF)

This property controls the addition of the -g switch, treat this property as a boolean.

property h

Coefficient for modeling homopolymer errors.

Given an l-long homopolymer run, the sequencing error of an indel of size s is modeled as INT*s/l

This controls the addition of the -h parameter and its associated value. Set this property to the argument value required.

property illumina_13

Assume the quality is in the Illumina 1.3+ encoding

This property controls the addition of the -6 switch, treat this property as a boolean.

property input_file

Input File for generating mpileup

This controls the addition of the input_file parameter and its associated value. Set this property to the argument value required.

property l
BED or position list file containing a list of regions

or sites where pileup or BCF should be generated

This controls the addition of the -l parameter and its associated value. Set this property to the argument value required.

property o

Phred-scaled gap open sequencing error probability.

Reducing INT leads to more indel calls.

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

property p
Comma delimited list of platforms (determined by @RG-PL)

from which indel candidates are obtained.

It is recommended to collect indel candidates from sequencing technologies that have low indel error rate such as ILLUMINA

This controls the addition of the -p parameter and its associated value. Set this property to the argument value required.

property q

Minimum mapping quality for an alignment to be used

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

property r

Only generate pileup in region STR

This controls the addition of the -r parameter and its associated value. Set this property to the argument value required.

property u
Similar to -g except that the output is

uncompressed BCF, which is preferred for piping

This property controls the addition of the -u switch, treat this property as a boolean.

class Bio.Sequencing.Applications.SamtoolsPhaseCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools phase.

Call and phase heterozygous SNPs, equivalent to:

$ samtools phase [-AF] [-k len] [-b prefix]
                 [-q minLOD] [-Q minBaseQ] <in.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsPhaseCommandline
>>> input_bam = "/path/to/in.bam"
>>> samtools_phase_cmd = SamtoolsPhaseCommandline(input_bam=input_bam)
>>> print(samtools_phase_cmd)
samtools phase /path/to/in.bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property A

Drop reads with ambiguous phase

This property controls the addition of the -A switch, treat this property as a boolean.

property F

Do not attempt to fix chimeric reads

This property controls the addition of the -F switch, treat this property as a boolean.

property Q
Minimum base quality to be

used in het calling

This controls the addition of the -Q parameter and its associated value. Set this property to the argument value required.

property b

Prefix of BAM output

This controls the addition of the -b parameter and its associated value. Set this property to the argument value required.

property in_bam

Input file

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property k

Maximum length for local phasing

This controls the addition of the -k parameter and its associated value. Set this property to the argument value required.

property q
Minimum Phred-scaled LOD to

call a heterozygote

This controls the addition of the -q parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsReheaderCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools reheader.

Replace the header in in.bam with the header in in.header.sam, equivalent to:

$ samtools reheader <in.header.sam> <in.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsReheaderCommandline
>>> input_header = "/path/to/header_sam_file"
>>> input_bam = "/path/to/input_bam_file"
>>> reheader_cmd = SamtoolsReheaderCommandline(input_header=input_header,
...                                            input_bam=input_bam)
>>> print(reheader_cmd)
samtools reheader /path/to/header_sam_file /path/to/input_bam_file
__init__(cmd='samtools', **kwargs)

Initialize the class.

property bam_file

BAM file for writing header to

This controls the addition of the input_bam parameter and its associated value. Set this property to the argument value required.

property sam_file

Sam file with header

This controls the addition of the input_header parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsRmdupCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools rmdup.

Remove potential PCR duplicates, equivalent to:

$ samtools rmdup [-sS] <input.srt.bam> <out.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsRmdupCommandline
>>> input_sorted_bam = "/path/to/input.srt.bam"
>>> out_bam = "/path/to/out.bam"
>>> rmdup_cmd = SamtoolsRmdupCommandline(input_bam=input_sorted_bam,
...                                      out_bam=out_bam)
>>> print(rmdup_cmd)
samtools rmdup /path/to/input.srt.bam /path/to/out.bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property S
Treat paired-end reads

as single-end reads

This property controls the addition of the -S switch, treat this property as a boolean.

property input_file

Name Sorted Alignment File

This controls the addition of the in_bam parameter and its associated value. Set this property to the argument value required.

property output_file

Output file

This controls the addition of the out_bam parameter and its associated value. Set this property to the argument value required.

property s

Remove duplicates for single-end reads.

By default, the command works for paired-end reads only

This property controls the addition of the -s switch, treat this property as a boolean.

Bio.Sequencing.Applications.SamtoolsSortCommandline

alias of Bio.Sequencing.Applications._samtools.SamtoolsVersion0xSortCommandline

class Bio.Sequencing.Applications.SamtoolsVersion0xSortCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools version 0.1.x sort.

Concatenate BAMs, equivalent to:

$ samtools sort [-no] [-m maxMem] <in.bam> <out.prefix>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsVersion0xSortCommandline
>>> input_bam = "/path/to/input_bam"
>>> out_prefix = "/path/to/out_prefix"
>>> samtools_sort_cmd = SamtoolsVersion0xSortCommandline(input=input_bam, out_prefix=out_prefix)
>>> print(samtools_sort_cmd)
samtools sort /path/to/input_bam /path/to/out_prefix
__init__(cmd='samtools', **kwargs)

Initialize the class.

property input

Input BAM file

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property m

Approximately the maximum required memory

This controls the addition of the -m parameter and its associated value. Set this property to the argument value required.

property n
Sort by read names rather

than by chromosomal coordinates

This property controls the addition of the -n switch, treat this property as a boolean.

property o
Output the final alignment

to the standard output

This property controls the addition of the -o switch, treat this property as a boolean.

property out_prefix

Output prefix

This controls the addition of the out_prefix parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsVersion1xSortCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools version 1.3.x sort.

Concatenate BAMs, equivalent to:

$ samtools sort [-n] [-T FREFIX] [-o file] [-I INT] [-m maxMem] <in.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsVersion1xSortCommandline
>>> input_bam = "/path/to/input_bam"
>>> FREFIX = "/path/to/out_prefix"
>>> file_name = "/path/to/out_file"
>>> samtools_sort_cmd = SamtoolsVersion1xSortCommandline(input=input_bam, T=FREFIX, o=file_name)
>>> print(samtools_sort_cmd)
samtools sort -o /path/to/out_file -T /path/to/out_prefix /path/to/input_bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property I
(INT) Set the desired compression level for the final output file,

ranging from 0 (uncompressed) or 1 (fastest but minimal compression) to 9 (best compression but slowest to write), similarly to gzip(1)’s compression level setting.

This controls the addition of the -I parameter and its associated value. Set this property to the argument value required.

property O

(FORMAT) Write the final output as sam, bam, or cram

This controls the addition of the -O parameter and its associated value. Set this property to the argument value required.

property T
(PREFIX) Write temporary files to PREFIX.nnnn.bam, or if the specified PREFIX

is an existing directory, to PREFIX/samtools.mmm.mmm.tmp.nnnn.bam, where mmm is unique to this invocation of the sort command

This controls the addition of the -T parameter and its associated value. Set this property to the argument value required.

property input

Input SAM/BAM/CRAM file

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.

property m

Approximately the maximum required memory

This controls the addition of the -m parameter and its associated value. Set this property to the argument value required.

property n
Sort by read names rather

than by chromosomal coordinates

This property controls the addition of the -n switch, treat this property as a boolean.

property o
(file) Write the final sorted output to FILE,

rather than to standard output

This controls the addition of the -o parameter and its associated value. Set this property to the argument value required.

class Bio.Sequencing.Applications.SamtoolsTargetcutCommandline(cmd='samtools', **kwargs)

Bases: Bio.Application.AbstractCommandline

Command line wrapper for samtools targetcut.

This command identifies target regions by examining the continuity of read depth, computes haploid consensus sequences of targets and outputs a SAM with each sequence corresponding to a target, equivalent to:

$ samtools targetcut [-Q minBaseQ] [-i inPenalty] [-0 em0]
                     [-1 em1] [-2 em2] [-f ref] <in.bam>

See http://samtools.sourceforge.net/samtools.shtml for more details

Examples

>>> from Bio.Sequencing.Applications import SamtoolsTargetcutCommandline
>>> input_bam = "/path/to/aln.bam"
>>> samtools_targetcut_cmd = SamtoolsTargetcutCommandline(input_bam=input_bam)
>>> print(samtools_targetcut_cmd)
samtools targetcut /path/to/aln.bam
__init__(cmd='samtools', **kwargs)

Initialize the class.

property Q

Minimum Base Quality

This controls the addition of the -Q parameter and its associated value. Set this property to the argument value required.

property em0

This controls the addition of the -0 parameter and its associated value. Set this property to the argument value required.

property em1

This controls the addition of the -1 parameter and its associated value. Set this property to the argument value required.

property em2

This controls the addition of the -2 parameter and its associated value. Set this property to the argument value required.

property f

Reference Filename

This controls the addition of the -f parameter and its associated value. Set this property to the argument value required.

property i

Insertion Penalty

This controls the addition of the -i parameter and its associated value. Set this property to the argument value required.

property in_bam

Input file

This controls the addition of the input parameter and its associated value. Set this property to the argument value required.