Powered by

smallRNAseq Sample Report

Loading report..

Highlight Samples

Regex mode off

    Rename Samples

    Click here for bulk input.

    Paste two columns of a tab-delimited table here (eg. from Excel).

    First column should be the old name, second column the new name.

    Regex mode off

      Show / Hide Samples

      Regex mode off

        Export Plots

        px
        px
        X

        Download the raw data used to create the plots in this report below:

        Note that additional data was saved in smallRNAseq_Sample_Report_multiqc_report_data when this report was generated.


        Choose Plots

        If you use plots from MultiQC in a publication or presentation, please cite:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411

        Save Settings

        You can save the toolbox settings for this report to the browser.


        Load Settings

        Choose a saved report profile from the dropdown box below:

        About MultiQC

        This report was generated using MultiQC, version 1.9

        You can see a YouTube video describing how to use MultiQC reports here: https://youtu.be/qPbIlO_KWN0

        For more information about MultiQC, including other videos and extensive documentation, please visit http://multiqc.info

        You can report bugs, suggest improvements and find the source code for MultiQC on GitHub: https://github.com/ewels/MultiQC

        MultiQC is published in Bioinformatics:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411

        smallRNAseq Sample Report

        This report includes summaries of data quality, data processing, and snapshots of results for your small RNAseq study. This report should assist you to get a general picture of the study, to spot any irregularities in the sample or data, and to explore the most significant results in differential gene expression. Please consult our small RNAseq report documentation on how to use this report.

        If you have any question regarding this report, please contact the Zymo representative who sent you this report.

        General Statistics

        Showing 9/9 rows and 6/12 columns.
        Sample NameM Seqs% Trimmed GC% Reads PF% miRNANo. miRNA% non-miRNA RNA types
        CD4_S1
        3.2
        45.0%
        82.2%
        59.8%
        695
        32.0%
        CD4_S2
        2.5
        47.0%
        81.1%
        33.9%
        569
        16.0%
        CD4_S4
        3.4
        46.0%
        81.8%
        54.1%
        671
        37.7%
        NK_S2
        3.2
        49.0%
        45.3%
        34.1%
        531
        7.4%
        NK_S3
        3.1
        50.0%
        81.5%
        41.7%
        627
        20.4%
        NK_S4
        6.9
        50.0%
        69.7%
        37.5%
        791
        33.3%
        monocytes_S2
        1.9
        48.0%
        77.8%
        43.8%
        648
        13.0%
        monocytes_S3
        6.1
        52.0%
        54.2%
        37.7%
        778
        28.3%
        monocytes_S4
        4.4
        50.0%
        85.4%
        30.3%
        770
        35.8%

        FastQC

        FastQC is a quality control tool for high throughput sequence data, written by Simon Andrews at the Babraham Institute in Cambridge.

        Sequence Quality Histograms

        The mean quality value across each base position in the read.

        To enable multiple samples to be plotted on the same graph, only the mean quality scores are plotted (unlike the box plots seen in FastQC reports).

        Taken from the FastQC help:

        The y-axis on the graph shows the quality scores. The higher the score, the better the base call. The background of the graph divides the y axis into very good quality calls (green), calls of reasonable quality (orange), and calls of poor quality (red). The quality of calls on most platforms will degrade as the run progresses, so it is common to see base calls falling into the orange area towards the end of a read.

        loading..

        Per Sequence Quality Scores

        The number of reads with average quality scores. Shows if a subset of reads has poor quality.

        From the FastQC help:

        The per sequence quality score report allows you to see if a subset of your sequences have universally low quality values. It is often the case that a subset of sequences will have universally poor quality, however these should represent only a small percentage of the total sequences.

        loading..

        Per Base N Content

        The percentage of base calls at each position for which an N was called.

        From the FastQC help:

        If a sequencer is unable to make a base call with sufficient confidence then it will normally substitute an N rather than a conventional base call. This graph shows the percentage of base calls at each position for which an N was called.

        It's not unusual to see a very low proportion of Ns appearing in a sequence, especially nearer the end of a sequence. However, if this proportion rises above a few percent it suggests that the analysis pipeline was unable to interpret the data well enough to make valid base calls.

        loading..

        Sequence Length Distribution

        All samples have sequences of a single length (51bp).

        Adapter Content

        The cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position.

        Note that only samples with ≥ 0.1% adapter contamination are shown.

        There may be several lines per sample, as one is shown for each adapter detected in the file.

        From the FastQC Help:

        The plot shows a cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position. Once a sequence has been seen in a read it is counted as being present right through to the end of the read so the percentages you see will only increase as the read length goes on.

        loading..

        Trim Galore

        Trim Galore is a wrapper around Cutadapt and FastQC to consistently apply adpater and quality trimming to FastQ files.

        Filtered Reads

        This plot shows the number of reads (SE) / pairs (PE) removed by Trim Galore.

        Read filtering criteria such as minimum and maximum read length can be found in Workflow Summary section.

        loading..

        Trimmed Sequence Lengths

        This plot shows the number of reads with certain lengths of adapter trimmed. Quality trimmed and hard trimmed sequences are not included.

        Obs/Exp shows the raw counts divided by the number expected due to sequencing errors. A defined peak may be related to adapter length. See the cutadapt documentation for more information on how these numbers are generated.

        loading..

        miRTrace

        miRTrace is a quality control software for small RNA sequencing data developed by Friedländer lab (KTH, Sweden).

        Read Length Distribution

        loading..

        Contamination Check

        loading..

        miRNA Complexity

        loading..

        Estimated RNA Type Counts

        The following bargraph provides an estimate of RNA types in the sample. Reads were mapped to RNA reference sequences with Bowtie, and subsequent transcript quantification was completed with RSEM. Current piRNA database entries may contain significant sequential overlap with miRNA entries and negatively affect miRNA results. To preserve miRNA results, piRNA were not included in this analysis. "Unmapped" reads may include RNA types that the pipeline does not test for, such as circRNA and piRNA. "miscellaneous" RNA, or miscRNA, includes unclassified RNA types, such as vault RNA and YRNA.

        loading..

        miRNA sample similarities

        The following sample similarities plots were generated from only miRNA transcript data using isomiRs.

        Similarity matrix of samples

        The similarities (Pearson correlation coefficient) between samples are visualized here in the form of heatmap. Larger values indicate higher similarity between samples. The similarities were calculated using Log2 values of normalized read counts of all mature miRNAs using isomiRs.

        loading..

        Multidimensional scaling analysis of samples

        Multidimensional scaling was conducted to visualize the distance/similarity between samples.

        loading..

        Sample similarities in non-coding genes excluding miRNAs and rRNAs

        The following sample similarities plots were generated from tRNA, mitochondrial tRNA, lncRNA, snoRNA, scaRNA, snRNA, and miscRNA gene data using DeSeq2.

        Similarity matrix of samples

        The similarities (Pearson correlation coefficient) between samples are visualized here in the form of heatmap. Larger values indicate higher similarity between samples. The similarities were calculated using Log2 values of normalized read counts of genes using DESeq2.

        loading..

        Multidimensional scaling analysis of samples

        Multidimensional scaling was conducted to visualize the distance/similarity between samples.

        loading..

        miRNA heatmap

        The following heatmap was generated from only miRNA data using isomiRs.

        Normalized read counts of top miRNAs with highest variance, calculated using isomiRs. Values plotted in Log2 scale after centering per miRNA. A static version of this figure can be download in the Download data section.

        loading..

        Gene heatmap for non-coding genes excluding miRNAs and rRNAs

        The following heatmap was generated from tRNA, mitochondrial tRNA, lncRNA, snoRNA, scaRNA, snRNA, and miscRNA gene data using DeSeq2. For simplification, this report treats tRNA gene copies with duplicate sequences as belonging to one gene.

        Normalized read counts of top genes with highest variance, calculated using DESeq2. Values plotted in Log2 scale. A static version of this figure can be download in the Download data section.

        loading..

        miRNA Differential Expression

        isomiRs The following differential expression analysis was generated using only miRNA data with isomiRs.

        Summary table of miRNAs differential expression

        General statistics of differentially expressed miRNAs in pairwise comparisons. miRNAs with adjusted p-values smaller than 0.05 were considered differentially expressed.

        Showing 3/3 rows and 4/4 columns.
        Comparison(group1_vs_group2)Higher in group 1Higher in group 2Not differentially expressedDid not pass filter
        CD4_vs_NK
        43
        27
        344
        238
        CD4_vs_monocytes
        73
        71
        270
        238
        monocytes_vs_NK
        51
        50
        339
        212

        Scatter plot

        Scatter plot is a simple and straightforward way to visualize differential gene expression results. Expression levels of miRNAs in one group are shown on X-axis while those in other are shown on Y-axis.
        Red dots represent differentially expressed miRNAs (adjusted p-values<0.05). Grey dots represent non-differentially expressed miRNAs.

        loading..

        MA plot

        MA plot is a type of visualization of differential gene expression results often used in publications. Mean expression levels are shown on X-axis while Log2 of fold changes are shown on Y-axis.
        Red dots represent differentially expressed miRNAs (adjusted p-values<0.05). Grey dots represent non-differentially expressed miRNAs. Shrinkage of effect size was carried out using the 'ashr' method in DESeq2

        loading..

        Top differentially expressed miRNAs in comparison CD4 vs. NK

        Top 50 differentially expressed miRNAs, ranked by FDR, in comparison CD4 vs. NK. Full isomiRs results can be downloaded in the Download data section.
        miRNAs with positive Log2 fold changes have higher expression in CD4. miRNAs with negative Log2 fold changes have higher expression in NK.

        Showing 50/50 rows and 4/4 columns.
        RankGene nameMean normalized countsLog2 Fold changeFalse discovery rate
        117336.851.7e-42
        22345.381.0e-20
        32718-4.082.0e-20
        4132-3.962.4e-17
        574-5.391.7e-15
        6705.631.3e-13
        71145.451.4e-12
        8277-3.665.8e-12
        94918-2.728.1e-12
        101683.569.9e-12
        1121822.651.4e-11
        1228432.942.1e-08
        13623.678.8e-08
        146922.313.5e-07
        15323.487.7e-07
        162252.261.1e-06
        17393.361.3e-06
        1836-5.261.3e-06
        1972792.411.3e-06
        20145539-2.331.7e-06
        2128-7.551.7e-06
        221002.053.7e-06
        23105782.309.9e-06
        241762-1.721.0e-05
        256563.622.2e-05
        26772.962.8e-05
        2711871.623.6e-05
        28708352.125.1e-05
        29197.029.0e-05
        30933.111.1e-04
        31114-2.301.3e-04
        324571.561.4e-04
        33883871.681.4e-04
        3421072.241.6e-04
        35263341.482.1e-04
        36125.192.1e-04
        37516342.373.0e-04
        38542551.664.9e-04
        39136.445.2e-04
        403873.335.2e-04
        413529-2.536.4e-04
        422491.549.4e-04
        4327319-1.761.3e-03
        4410721.561.7e-03
        45109-2.231.8e-03
        46342.231.8e-03
        4714-5.591.9e-03
        4864-2.114.2e-03
        493202.054.2e-03
        50928-1.456.6e-03

        Top differentially expressed miRNAs in comparison CD4 vs. monocytes

        Top 50 differentially expressed miRNAs, ranked by FDR, in comparison CD4 vs. monocytes. Full isomiRs results can be downloaded in the Download data section.
        miRNAs with positive Log2 fold changes have higher expression in CD4. miRNAs with negative Log2 fold changes have higher expression in monocytes.

        Showing 50/50 rows and 4/4 columns.
        RankGene nameMean normalized countsLog2 Fold changeFalse discovery rate
        111875.901.8e-106
        25163410.044.3e-105
        344544.821.9e-72
        41762-4.489.3e-66
        53015.911.3e-60
        617338.081.3e-60
        772795.571.9e-47
        86116.231.2e-45
        92258.527.8e-36
        104075.266.3e-35
        113529-6.501.5e-32
        12286095.034.8e-32
        132346.161.7e-29
        142154.771.4e-27
        15263342.682.2e-22
        1648704.321.1e-20
        1748874.112.3e-20
        1838710.533.6e-20
        19775.039.7e-20
        2021074.113.9e-19
        211684.298.2e-19
        22370-3.062.4e-17
        23705.543.8e-17
        242492.692.3e-16
        2588-5.301.3e-15
        26873453.184.5e-14
        273203.751.1e-12
        281144.931.3e-12
        2910722.502.8e-12
        30928-2.464.4e-12
        31397.385.7e-12
        32624.211.0e-11
        3340252.861.3e-11
        34106-3.802.8e-11
        355274.193.3e-11
        3669-5.015.7e-11
        37883872.296.5e-11
        38327-4.027.4e-11
        3965-4.171.4e-10
        40118261.601.1e-09
        41119-5.281.6e-09
        4218312.313.5e-09
        43105782.697.8e-09
        4427-8.338.3e-09
        4530-8.581.2e-08
        46286-5.054.6e-08
        47109-3.124.9e-08
        4873-5.155.5e-08
        49254.416.4e-08
        5095-5.511.5e-07

        Top differentially expressed miRNAs in comparison monocytes vs. NK

        Top 50 differentially expressed miRNAs, ranked by FDR, in comparison monocytes vs. NK. Full isomiRs results can be downloaded in the Download data section.
        miRNAs with positive Log2 fold changes have higher expression in monocytes. miRNAs with negative Log2 fold changes have higher expression in NK.

        Showing 50/50 rows and 4/4 columns.
        RankGene nameMean normalized countsLog2 Fold changeFalse discovery rate
        151634-7.671.7e-58
        21187-4.281.1e-50
        32718-5.344.6e-38
        4301-4.787.7e-37
        521824.226.9e-36
        6611-5.521.3e-34
        74454-3.471.1e-33
        84887-5.104.8e-33
        928609-4.782.2e-28
        101831-3.652.0e-27
        1117622.763.9e-22
        12225-6.264.2e-18
        13886.931.4e-16
        143703.021.4e-16
        154870-3.893.3e-16
        164918-2.974.4e-15
        17145539-3.221.1e-14
        18132-3.256.5e-14
        19407-3.546.5e-14
        207279-3.165.7e-13
        21696.366.8e-13
        22215-3.411.9e-12
        2374-9.812.0e-12
        24542552.505.6e-12
        2535293.962.8e-11
        261195.886.6e-11
        27387-7.204.5e-09
        28653.746.2e-09
        29735.541.2e-08
        3036-8.771.7e-08
        3128-8.471.8e-08
        322873.422.0e-08
        3377-3.454.1e-08
        34305.929.2e-08
        351043.315.2e-07
        364025-2.317.7e-07
        37273.807.8e-07
        38525.173.7e-06
        39225.434.0e-06
        401053.885.5e-06
        41210-11.351.3e-05
        422541.861.3e-05
        431425-1.982.1e-05
        4487345-2.142.3e-05
        45114-2.352.5e-05
        4611826-1.332.6e-05
        4728432.353.0e-05
        4864-2.554.1e-05
        49175.301.0e-04
        5021-3.531.4e-04

        Differential expression of non-coding genes except miRNAs and rRNAs

        The following differential expression analysis was generated from tRNA, mitochondrial tRNA, lncRNA, snoRNA, scaRNA, snRNA, and miscRNA gene data using DeSeq2. For simplification, this report treats tRNA gene copies with duplicate sequences as belonging to one gene.

        Summary table of differential expression in non-coding genes except miRNAs and rRNAs

        General statistics of differentially expressed genes in pairwise comparisons. Genes with adjusted p-values smaller than 0.05 were considered differentially expressed.

        Showing 3/3 rows and 4/4 columns.
        Comparison(group1_vs_group2)Higher in group 1Higher in group 2Not differentially expressedDid not pass filter
        CD4_vs_NK
        0
        0
        8650
        14496
        CD4_vs_monocytes
        13
        4
        1112
        22017
        monocytes_vs_NK
        0
        1
        8649
        14496

        Scatter plot

        Scatter plot is a simple and straightforward way to visualize differential expression results. Expression levels of genes in one group are shown on X-axis while those in other are shown on Y-axis.
        Red dots represent differentially expressed genes (adjusted p-values<0.05). Grey dots represent non-differentially expressed genes. Non-differentially expressed genes are downsampled to 1000 randomly chosen data points.

        loading..

        MA plot

        MA plot is a type of visualization of differential expression results often used in publications. Mean expression levels are shown on X-axis while Log2 of fold changes are shown on Y-axis.
        Red dots represent differentially expressed genes (adjusted p-values<0.05). Grey dots represent non-differentially expressed genes. Shrinkage of effect size was carried out using the 'ashr' method in DESeq2.
        Non-differentially expressed genes are downsampled to 1000 randomly chosen data points.

        loading..

        Top differentially expressed genes in comparison CD4 vs. NK

        No significant differential expression results were found for this comparison.


        Top differentially expressed genes in comparison CD4 vs. monocytes

        Top differentially expressed genes, ranked by FDR, in comparison CD4 vs. monocytes. Full DeSeq2 results can be downloaded in the Download data section.
        Genes with positive Log2 fold changes have higher expression in CD4. Genes with negative Log2 fold changes have higher expression in monocytes.

        Showing 17/17 rows and 5/5 columns.
        RankGene IDGene NameMean normalized countsLog2 Fold changeFalse discovery rate
        1ENSG000002745541549.961.5e-06
        2Y_RNA32-9.627.7e-05
        3SNORD641615.317.7e-05
        4SNORD107735.406.0e-04
        5ENSG000002521391714.613.9e-03
        6SNORD6824934.538.1e-03
        7SNORD8414354.688.1e-03
        8DNM3OS13-6.352.5e-02
        9ENSG0000025776412-7.042.5e-02
        10SNORD91B2103.664.8e-02
        11GAS5358993.534.8e-02
        12SNORD1053543.604.8e-02
        13SNORD8232833.824.8e-02
        14ENSG000002738851253.944.8e-02
        15ENSG00000260464375-5.244.8e-02
        16SNORD9534083.604.8e-02
        17SNORD91A8883.544.9e-02

        Top differentially expressed genes in comparison monocytes vs. NK

        Top differentially expressed genes, ranked by FDR, in comparison monocytes vs. NK. Full DeSeq2 results can be downloaded in the Download data section.
        Genes with positive Log2 fold changes have higher expression in monocytes. Genes with negative Log2 fold changes have higher expression in NK.

        Showing 1/1 rows and 5/5 columns.
        RankGene IDGene NameMean normalized countsLog2 Fold changeFalse discovery rate
        1ENSG00000274554154-7.773.1e-02

        Software Versions

        Software versions are collected at run time from the software output. This pipeline is adapted from nf-core smRNAseq pipeline.

        smrnaseq pipeline
        v2.2.0
        Nextflow
        v22.10.4
        R
        v4.0.5
        FastQC
        v0.11.9
        Trim Galore!
        v0.6.6
        Bowtie
        v1.3.0
        Samtools
        v1.16.1
        FASTX
        v0.0.14
        miRTrace
        v1.0.1
        rsem
        v1.2.28
        tximport
        v1.18.0
        mirtop
        v0.4.23
        isomiRs
        v1.18.1
        DESeq2
        v1.30.1

        Workflow Summary

        This section summarizes important parameters used in the pipeline. They were collected when the pipeline was started.

        Genome
        GRCh38
        Protocol
        zymo
        Downsampling cutoff
        None
        Min Trimmed Length
        18
        Max Trimmed Length
        None
        isomiRs FDR cutoff
        0.05
        isomiRs Log2FC cutoff
        0.585
        Trim 5' R1
        1
        Trim 3' R1
        0
        3' adapter
        TGGAATTCTCGGGTGCCAAGG

        Report generated on 2024-01-26, 22:04.