MoP4 - pipeline
- Please cite
- Cozzuto L, Liu H, Pryszcz LP, Pulido TH, Delgado-Tejedor A, Ponomarenko J, Novoa EM. MasterOfPores: A Workflow for the Analysis of Oxford Nanopore Direct RNA Sequencing Datasets. Front Genet. 2020 Mar 17;11:211. doi: 10.3389/fgene.2020.00211
- And
- Cozzuto L, Delgado-Tejedor A, Hermoso Pulido T, Novoa EM, Ponomarenko J. Nanopore Direct RNA Sequencing Data Processing and Analysis Using MasterOfPores. N. Methods Mol Biol. 2023;2624:185-205. doi: 10.1007/978-1-0716-2962-8_13.
/Users/lcozzuto/ooo/master_of_pores/mop_preprocess/work/2f/23940b8b15fa3637ffdc8db2bfc7ba
General Statistics
Sample Name | Reads | Bases | Read N50 | Longest Read | Shortest Read | Mean Length | Median Length | Mean Qual | Median Qual | Median length | Read N50 | Median Qual | Median Identity | # Reads (K) | Total Bases (Mb) | Aligned Bases (Mb) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
mod | 0.0M | 0.4Mb | 1753bp | 3529bp | 1bp | 867.0bp | 519 | 6.5 | 6.8 | 1227bp | 1808bp | 7.8 | 83.3% | 0.3K | 0.4Mb | 0.4Mb |
wt | 0.0M | 0.4Mb | 1762bp | 6772bp | 1bp | 838.0bp | 413 | 6.2 | 6.8 | 1444bp | 1783bp | 8.2 | 84.6% | 0.2K | 0.3Mb | 0.3Mb |
Assigned counts
File name | Counts |
---|---|
mod | 256 |
wt | 233 |
Alignment QC
File name | Mapped reads | Map % | Bases | Bases % | Avg read length | Max read length | identity |
---|---|---|---|---|---|---|---|
mod_s.bam | 254 | 100.0% | 363166 | 97.9% | 1460 | 3529 | 81.20% |
wt_s.bam | 231 | 100.0% | 338752 | 98.0% | 1496 | 3456 | 82.46% |
nanoq
Reports read quality and length from nanopore sequencing data.URL: https://github.com/nerdna/nanoqDOI: 10.21105/joss.02991
Nanoq Summary
Statistics from Nanoq reports
Sample Name | Reads | Bases | Read N50 | Longest Read | Shortest Read | Mean Length | Median Length | Mean Qual | Median Qual |
---|---|---|---|---|---|---|---|---|---|
mod | 0.0M | 0.4Mb | 1753bp | 3529bp | 1bp | 867.0bp | 519 | 6.5 | 6.8 |
wt | 0.0M | 0.4Mb | 1762bp | 6772bp | 1bp | 838.0bp | 413 | 6.2 | 6.8 |
Read quality
Read counts categorised by read quality (Phred score).
Sequencing machines assign each generated read a quality score using the Phred scale. The phred score represents the liklelyhood that a given read contains errors. High quality reads have a high score.
Read lengths
Read counts categorised by read length.
NanoStat
Reports various statistics for long read dataset in FASTQ, BAM, or albacore sequencing summary format (supports NanoPack; NanoPlot, NanoComp).URL: https://github.com/wdecoster/nanostat; https://github.com/wdecoster/nanoplotDOI: 10.1093/bioinformatics/bty149
Programs are part of the NanoPack family for summarising results of sequencing on Oxford Nanopore methods (MinION, PromethION etc.)Summary Statistics (Aligned)
Sample Name | Median length | Mean length | Read N50 | Median Qual | Mean Qual | Median Identity | Mean Identity | # Reads (K) | Total Bases (Mb) | Aligned Bases (Mb) |
---|---|---|---|---|---|---|---|---|---|---|
mod | 1227bp | 1450bp | 1808bp | 7.8 | 7.8 | 83.3% | 82.2% | 0.3K | 0.4Mb | 0.4Mb |
wt | 1444bp | 1488bp | 1783bp | 8.2 | 8.1 | 84.6% | 83.3% | 0.2K | 0.3Mb | 0.3Mb |
Reads by quality
Read counts categorised by read quality (Phred score).
Sequencing machines assign each generated read a quality score using the Phred scale. The phred score represents the liklelyhood that a given read contains errors. High quality reads have a high score.