BRESEQ :: Summary Statistics

breseq version 0.35.4 revision f352f80f4bc9
mutation predictions | marginal predictions | summary statistics | genome diff | command line log

Read File Information

	read file	reads	bases	passed filters	average	longest	mapped
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L001_R1_001.good.fq	348,832	45,036,279	100.0%	129.1 bases	137 bases	97.7%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L001_R2_001.good.fq	348,832	45,037,025	100.0%	129.1 bases	137 bases	95.9%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L002_R1_001.good.fq	347,176	44,787,545	100.0%	129.0 bases	137 bases	98.4%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L002_R2_001.good.fq	347,176	44,789,406	100.0%	129.0 bases	137 bases	97.1%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L003_R1_001.good.fq	348,725	44,974,687	100.0%	129.0 bases	137 bases	98.3%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L003_R2_001.good.fq	348,725	44,976,482	100.0%	129.0 bases	137 bases	97.3%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L004_R1_001.good.fq	355,869	45,823,680	100.0%	128.8 bases	137 bases	98.7%
errors	TF_KALE_lrp_A20_F88_I0_R1_S94_L004_R2_001.good.fq	355,869	45,825,338	100.0%	128.8 bases	137 bases	97.9%
	total	2,801,204	361,250,442	100.0%	129.0 bases	137 bases	97.7%

		seq id	length	fit mean	fit dispersion	% mapped reads	description
coverage	distribution	NC_000913	4,641,652	74.3	4.2	100.0%	Escherichia coli str. K-12 substr. MG1655, complete genome.
		total	4,641,652			100.0%

fit dispersion is the ratio of the variance to the mean for the negative binomial fit. It is =1 for Poisson and >1 for over-dispersed data.

option	limit	actual
Number of alignment pairs examined for constructing junction candidates	≤ 100000	1696
Coverage evenness (position-hash) score of junction candidates	≥ 2	≥ 2
Test this many junction candidates (n). May be smaller if not enough passed the coverage evenness threshold	100 ≤ n ≤ 5000	171
Total length of all junction candidates (factor times the reference genome length)	≤ 0.1	0.010

reference sequence	pr(no read start)
NC_000913	0.83110

pr(no read start) is the probability that there will not be an aligned read whose first base matches a given position on a given strand.

option	value
Coverage evenness (position-hash) score of predicted junctions must be	≥ 3
Minimum probablilty assigned that no mapped read will start at a given position and strand for junction prediction	0.1
Junction allow suboptimal matches	FALSE
Skew score of predicted junction (−log10 probability of unusual coverage evenness) must be	≤ 0
Number of bases that at least one read must overlap each uniquely aligned side of a predicted junction	≥ 6

option	value
Mode	Full Polymorphism
Ploidy	1 (haploid)
Consensus mutation E-value cutoff	10
Consensus frequency cutoff	0.75
Consensus minimum variant coverage each strand	OFF
Consensus minimum total coverage each strand	OFF
Consensus minimum variant coverage	OFF
Consensus minimum total coverage	OFF
Polymorphism E-value cutoff	2
Polymorphism frequency cutoff	0.05
Polymorphism minimum variant coverage each strand	2
Polymorphism minimum total coverage each strand	OFF
Polymorphism minimum variant coverage	OFF
Polymorphism minimum total coverage	OFF
Polymorphism bias cutoff	OFF
Predict indel polymorphisms	YES
Skip indel polymorphisms in homopolymers runs of	≥3 bases
Skip base substitutions when they create a homopolymer flanked on each side by	≥5 bases

program	version
bowtie2	2.2.6
R	3.4.4

step	start	end	elapsed
Read and reference sequence file input	21:22:44 22 Apr 2021	21:23:31 22 Apr 2021	47 seconds
Read alignment to reference genome	21:23:31 22 Apr 2021	21:29:07 22 Apr 2021	5 minutes 36 seconds
Preprocessing alignments for candidate junction identification	21:29:07 22 Apr 2021	21:29:40 22 Apr 2021	33 seconds
Preliminary analysis of coverage distribution	21:29:40 22 Apr 2021	21:31:15 22 Apr 2021	1 minute 35 seconds
Identifying junction candidates	21:31:15 22 Apr 2021	21:31:15 22 Apr 2021	0 seconds
Re-alignment to junction candidates	21:31:15 22 Apr 2021	21:32:38 22 Apr 2021	1 minute 23 seconds
Resolving best read alignments	21:32:38 22 Apr 2021	21:33:26 22 Apr 2021	48 seconds
Creating BAM files	21:33:26 22 Apr 2021	21:34:52 22 Apr 2021	1 minute 26 seconds
Tabulating error counts	21:34:52 22 Apr 2021	21:35:25 22 Apr 2021	33 seconds
Re-calibrating base error rates	21:35:25 22 Apr 2021	21:35:27 22 Apr 2021	2 seconds
Examining read alignment evidence	21:35:27 22 Apr 2021	21:58:44 22 Apr 2021	23 minutes 17 seconds
Polymorphism statistics	21:58:44 22 Apr 2021	21:58:45 22 Apr 2021	1 second
Output	21:58:45 22 Apr 2021	21:58:57 22 Apr 2021	12 seconds
Total			36 minutes 13 seconds