ChIPseq report for sample_report: MultiQC Report

General Statistics

Showing ⁷/₇ rows and ⁸/₁₉ columns.

Sample Name	M Reads Mapped	M Seqs	% GC	% Trimmed	% Mapped	Frag Length	NSC	RSC	% Dups	Length	% Failed	Error rate	M Reads Mapped	% Proper Pairs	% MapQ 0 Reads	M Total seqs	Number of peaks	FRiP score
BT474_1	23.1	23.9	42%	2.3%	97.3%	140	1.49	1.39	4.7%	36 bp	0%	0.12%	23.1	0.0%	0.7%	23.7	36147	0.118
BT474_2	23.1	23.8	44%	2.2%	97.5%	135	1.59	1.37	4.2%	36 bp	0%	0.12%	23.1	0.0%	0.7%	23.7	35368	0.129
BT474_Control	16.9	17.4	44%	2.1%	97.6%	105	1.02	0.88	1.2%	36 bp	0%	0.12%	16.9	0.0%	0.8%	17.3
MCF7_1	21.1	22.3	46%	2.7%	95.6%	120	2.48	1.33	8.7%	36 bp	0%	0.21%	21.1	0.0%	0.6%	22.1	78333	0.268
MCF7_2	21.5	22.7	43%	2.9%	95.9%	140	1.59	1.47	2.3%	36 bp	0%	0.15%	21.5	0.0%	0.7%	22.4	53356	0.148
MCF7_3	27.9	29.2	42%	2.7%	96.5%	140	1.81	1.40	4.3%	36 bp	0%	0.13%	27.9	0.0%	0.6%	28.9	63722	0.201
MCF7_Control	21.2	22.2	45%	2.8%	96.4%	105	1.03	0.80	0.6%	36 bp	0%	0.13%	21.2	0.0%	0.8%	22.0

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Group	Column	Description	ID	Scale
\|\|	Samtools Flagstat	M Reads Mapped	Reads Mapped in the bam file (millions)	`mapped_passed`	read_count
\|\|	FastQC	M Seqs	Total Sequences (millions)	`total_sequences`	read_count
\|\|	FastQC	% GC	Average % GC Content	`percent_gc`	None
\|\|	Cutadapt	% Trimmed	% Total Base Pairs trimmed	`percent_trimmed`	None
\|\|	Samtools Stats	% Mapped	% Mapped Reads	`reads_mapped_percent`	None
\|\|	phantompeakqualtools	Frag Length	Estimated fragment length (bp)	`Estimated_Fragment_Length_bp`	None
\|\|	phantompeakqualtools	NSC	Normalized strand cross-correlation	`NSC`	None
\|\|	phantompeakqualtools	RSC	Relative strand cross-correlation	`RSC`	None
\|\|	FastQC	% Dups	% Duplicate Reads	`percent_duplicates`	None
\|\|	FastQC	Length	Average Sequence Length (bp)	`avg_sequence_length`	None
\|\|	FastQC	% Failed	Percentage of modules failed in FastQC report (includes those not plotted here)	`percent_fails`	None
\|\|	Samtools Stats	Error rate	Error rate: mismatches (NM) / bases mapped (CIGAR)	`error_rate`	None
\|\|	Samtools Stats	M Non-Primary	Non-primary alignments (millions)	`non-primary_alignments`	read_count
\|\|	Samtools Stats	M Reads Mapped	Reads Mapped in the bam file (millions)	`reads_mapped`	read_count
\|\|	Samtools Stats	% Proper Pairs	% Properly Paired Reads	`reads_properly_paired_percent`	None
\|\|	Samtools Stats	% MapQ 0 Reads	% of Reads that are Ambiguously Placed (MapQ=0)	`reads_MQ0_percent`	None
\|\|	Samtools Stats	M Total seqs	Total sequences in the bam file (millions)	`raw_total_sequences`	read_count
\|\|	MACS2	Number of peaks	Number of peaks called	`num_peaks`	None
\|\|	MACS2	FRiP score	Fraction of reads in peaks	`frip_score`	None

FastQC

FastQC is a quality control tool for high throughput sequence data, written by Simon Andrews at the Babraham Institute in Cambridge.

Sequence Quality Histograms

The mean quality value across each base position in the read.

To enable multiple samples to be plotted on the same graph, only the mean quality scores are plotted (unlike the box plots seen in FastQC reports).

Taken from the FastQC help:

The y-axis on the graph shows the quality scores. The higher the score, the better the base call. The background of the graph divides the y axis into very good quality calls (green), calls of reasonable quality (orange), and calls of poor quality (red). The quality of calls on most platforms will degrade as the run progresses, so it is common to see base calls falling into the orange area towards the end of a read.

loading..

Per Sequence Quality Scores

The number of reads with average quality scores. Shows if a subset of reads has poor quality.

From the FastQC help:

The per sequence quality score report allows you to see if a subset of your sequences have universally low quality values. It is often the case that a subset of sequences will have universally poor quality, however these should represent only a small percentage of the total sequences.

loading..

Per Base Sequence Content

The proportion of each base position for which each of the four normal DNA bases has been called.

To enable multiple samples to be shown in a single plot, the base composition data is shown as a heatmap. The colours represent the balance between the four bases: an even distribution should give an even muddy brown colour. Hover over the plot to see the percentage of the four bases under the cursor.

To see the data as a line plot, as in the original FastQC graph, click on a sample track.

From the FastQC help:

Per Base Sequence Content plots out the proportion of each base position in a file for which each of the four normal DNA bases has been called.

In a random library you would expect that there would be little to no difference between the different bases of a sequence run, so the lines in this plot should run parallel with each other. The relative amount of each base should reflect the overall amount of these bases in your genome, but in any case they should not be hugely imbalanced from each other.

It's worth noting that some types of library will always produce biased sequence composition, normally at the start of the read. Libraries produced by priming using random hexamers (including nearly all RNA-Seq libraries) and those which were fragmented using transposases inherit an intrinsic bias in the positions at which reads start. This bias does not concern an absolute sequence, but instead provides enrichement of a number of different K-mers at the 5' end of the reads. Whilst this is a true technical bias, it isn't something which can be corrected by trimming and in most cases doesn't seem to adversely affect the downstream analysis.

Click a sample row to see a line plot for that dataset.

Rollover for sample name

Position: -

%T: -

%C: -

%A: -

%G: -

Per Sequence GC Content

The average GC content of reads. Normal random library typically have a roughly normal distribution of GC content.

From the FastQC help:

This module measures the GC content across the whole length of each sequence in a file and compares it to a modelled normal distribution of GC content.

In a normal random library you would expect to see a roughly normal distribution of GC content where the central peak corresponds to the overall GC content of the underlying genome. Since we don't know the the GC content of the genome the modal GC content is calculated from the observed data and used to build a reference distribution.

An unusually shaped distribution could indicate a contaminated library or some other kinds of biased subset. A normal distribution which is shifted indicates some systematic bias which is independent of base position. If there is a systematic bias which creates a shifted normal distribution then this won't be flagged as an error by the module since it doesn't know what your genome's GC content should be.

loading..

Per Base N Content

The percentage of base calls at each position for which an N was called.

From the FastQC help:

If a sequencer is unable to make a base call with sufficient confidence then it will normally substitute an N rather than a conventional base call. This graph shows the percentage of base calls at each position for which an N was called.

It's not unusual to see a very low proportion of Ns appearing in a sequence, especially nearer the end of a sequence. However, if this proportion rises above a few percent it suggests that the analysis pipeline was unable to interpret the data well enough to make valid base calls.

loading..

Sequence Length Distribution

All samples have sequences of a single length (36bp).

Overrepresented sequences

The total amount of overrepresented sequences found in each library.

FastQC calculates and lists overrepresented sequences in FastQ files. It would not be possible to show this for all samples in a MultiQC report, so instead this plot shows the number of sequences categorized as over represented.

Sometimes, a single sequence may account for a large number of reads in a dataset. To show this, the bars are split into two: the first shows the overrepresented reads that come from the single most common sequence. The second shows the total count from all remaining overrepresented sequences.

From the FastQC Help:

A normal high-throughput library will contain a diverse set of sequences, with no individual sequence making up a tiny fraction of the whole. Finding that a single sequence is very overrepresented in the set either means that it is highly biologically significant, or indicates that the library is contaminated, or not as diverse as you expected.

FastQC lists all of the sequences which make up more than 0.1% of the total. To conserve memory only sequences which appear in the first 100,000 sequences are tracked to the end of the file. It is therefore possible that a sequence which is overrepresented but doesn't appear at the start of the file for some reason could be missed by this module.

7 samples had less than 1% of reads made up of overrepresented sequences

Adapter Content

The cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position.

Note that only samples with ≥ 0.1% adapter contamination are shown.

There may be several lines per sample, as one is shown for each adapter detected in the file.

From the FastQC Help:

The plot shows a cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position. Once a sequence has been seen in a read it is counted as being present right through to the end of the read so the percentages you see will only increase as the read length goes on.

No samples found with any adapter contamination > 0.1%

Cutadapt

Cutadapt is a tool to find and remove adapter sequences, primers, poly-Atails and other types of unwanted sequence from your high-throughput sequencing reads.

This plot shows the number of reads with certain lengths of adapter trimmed. Obs/Exp shows the raw counts divided by the number expected due to sequencing errors. A defined peak may be related to adapter length. See the cutadapt documentation for more information on how these numbers are generated.

loading..

Alignment stats

Samtools show how each sequencing library aligned with the reference genome.

Percent Mapped

Alignment metrics from samtools stats; mapped vs. unmapped reads.

For a set of samples that have come from the same multiplexed library, similar numbers of reads for each sample are expected. Large differences in numbers might indicate issues during the library preparation process. Whilst large differences in read numbers may be controlled for in downstream processings (e.g. read count normalisation), you may wish to consider whether the read depths achieved have fallen below recommended levels depending on the applications.

Low alignment rates could indicate contamination of samples (e.g. adapter sequences), low sequencing quality or other artefacts. These can be further investigated in the sequence level QC (e.g. from FastQC).

loading..

Mapped reads per contig

The samtools idxstats tool counts the number of mapped reads per chromosome / contig. Chromosomes with < 0.1% of the total aligned reads are omitted from this plot.

loading..

Preseq (unfiltered)

Preseq (unfiltered) show library complexity of each sample before filtering.

Complexity curve

Note that the x axis is trimmed at the point where all the datasets show 80% of their maximum y-value, to avoid ridiculous scales.

loading..

Filtering of alignments

Filtering of alignments remove reads/alignments that are (1)duplicates, (2)in the blacklisted regions, (3)mapped to multiple locations, (4)reads containing >4 mismatches, (5)reads that have an insert size >2kb or (6)disconcordant paired-end reads. Unmapped reads and secondary alignments have already been removed in the alignment step.

loading..

deepTools

deepTools This section of the report shows ChIP-seq QC plots generated by deepTools.

Fingerprint plot

Signal fingerprint according to plotFingerprint

loading..

Read Distribution Profile after Annotation

Accumulated view of the distribution of sequence reads related to the closest annotated gene. All annotated genes have been normalized to the same size.

Green: -3.0Kb upstream of gene to TSS
Yellow: TSS to TES
Pink: TES to 3.0Kb downstream of gene

loading..

Strand-shift correlation plot

Strand-shift correlation plot is a useful ChIP-seq quality evaluation tool. One should expect to see a peak corresponding to the fragment length and secondary peak corresponding to the read length. The ChIP-seq samples should have higher strand cross-correlation than control samples. Please see ENCODE ChIP-Seq paper for more details. The plot data was generated using run_spp.R script from phantompeakqualtools.

loading..

NSC and RSC coefficients

NSC and RSC coefficients are ChIP-seq metrics derived from strand cross-correlation. Higher NSC scores indicate better peak enrichment compared to background. Higher RSC scores indicate better peak enrichment in ChIP fragments compared to phantom peaks derived from read length artifacts. ENCODE standards regard NSC over 1.05 and RSC over 0.8 to be indications of a good ChIP-seq experiment. Please see ENCODE ChIP-Seq paper for more details. The coefficients were generated using run_spp.R script from phantompeakqualtools.

Showing ⁷/₇ rows and ²/₂ columns.

Sample Name	NSC	RSC
BT474_1	1.49	1.39
BT474_2	1.59	1.37
BT474_Control	1.02	0.88
MCF7_1	2.48	1.33
MCF7_2	1.59	1.47
MCF7_3	1.81	1.40
MCF7_Control	1.03	0.80

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Visible	Group	Column	Description	ID	Scale
\|\|			NSC	Normalized Strand Cross-correlation	`NSC`	None
\|\|			RSC	Relative Strand Cross-correlation	`RSC`	None

MACS2

MACS2 identifies transcription factor binding sites. It is widely used in many ChIP-Seq and similar studies and pipelines.

Summary table of MACS2 peak calling results

General statistics of MACS2 peak calling results. FRiP score is generated by calculating the fraction of all mapped reads that fall into the MACS2 called peak regions. A read must overlap a peak by at least 20% to be counted.

Showing ⁵/₅ rows and ³/₃ columns.

Sample Name	Numbers of peaks	Fragment lengths	FRiP score
BT474_1	36147	134	0.118
BT474_2	35368	136	0.129
MCF7_1	78333	118	0.268
MCF7_2	53356	142	0.148
MCF7_3	63722	138	0.201

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Column	Description	ID	Scale
\|\|	Numbers of peaks	Numbers of peaks called by MACS2	`num_peaks`	None
\|\|	Fragment lengths	Fragment lengths estimated by MACS2	`fragment_len`	None
\|\|	FRiP score	Fraction of reads in peaks	`frip_score`	None

View genomic tracks in UCSC Genome Browser

This section gives you a quick and easy way to view the genomic tracks. However, we suggest that you download the results in the Download data section, so you can view/analyze the tracks at your convenience. The links in this section expire after 60 days. If you still need to view tracks through the links, please contact us.

How to view the genomic tracks:

Click on any of the links below to view the genomic track of interest.
To view multiple tracks together, simply click their links one by one.
You can hide tracks in UCSC genome browser if you don't want to view them any more.

Showing ⁷/₇ rows and ⁴/₄ columns.

Sample Name	Called peaks	Normalized fragment pileup	Fold enrichment	Summit locations
BT474_1	View track	View track	View track	View track
BT474_2	View track	View track	View track	View track
MCF7_1	View track	View track	View track	View track
MCF7_2	View track	View track	View track	View track
MCF7_3	View track	View track	View track	View track
BT474_Control		View track
MCF7_Control		View track

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Column	Description	ID	Scale
\|\|	Called peaks	Called peaks in narrowPeak or broadPeak format	`called peaks`	None
\|\|	Normalized fragment pileup	Fragment pileup normalized by numbers of aligned reads	`normalized fragment pileup`	None
\|\|	Fold enrichment	Fold enrichment over control sample	`fold enrichment`	None
\|\|	Summit locations	Summit locations in BED format	`summit location`	None

HOMER: Peak annotation

HOMER: Peak annotation is generated by calculating the proportion of peaks assigned to genomic features by HOMER annotatePeaks.pl.

loading..

Differential binding

DiffBind compute differentially bound sites from multiple ChIP-Seq experiments.

Similarity matrix of samples

The similarity of samples in terms of read counts in peaks. A static version of this figure with dendrogram and a PCA plot can be downloaded in the Download data section.

loading..

Venn diagram of peaks among conditions

Numbers of peaks shared by or unique to conditions. Overlapping peaks are considered shared peaks by DiffBind.

Summary table of differential binding analysis

General statistics of differentially bound regions in pairwise comparisons. Regions with q-values less than 0.05 were considered differentially bound.

Showing ¹/₁ rows and ³/₃ columns.

Comparison(cond.1_vs_cond.2)	Higher in condition 1	Higher in condition 2	Not differentially bound
BT474_vs_MCF7	9974	32763	24456

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Column	Description	ID	Scale
\|\|	Higher in condition 1	Numbers of peaks with higher read counts in condition 1	`up`	None
\|\|	Higher in condition 2	Numbers of peaks with higher read counts in condition 2	`down`	None
\|\|	Not differentially bound	Numbers of peaks with simliar read counts in both conditions	`not_diff`	None

Top differentially bound regions in comparison BT474_vs_MCF7

Top 50 differentially bound regions, ranked by FDR, in comparison BT474_vs_MCF7. Full DiffBind results can be downloaded in the Download data section.
Regions with positive Log2 fold changes have higher binding in BT474. Regions with negative Log2 fold changes have higher binding in MCF7.

We have generated a bigBed track for each comparison. Colors of regions reflect fold changes while score of regions reflect -log10(FDR)

Showing ⁵⁰/₅₀ rows and ⁶/₆ columns.

Rank	Chromosome	Start	End	Log2 fold change	False discovery rate	Annotation
1	chr9	111240772	111241791	-6.04	7.8e-61	Intergenic
2	chr10	9079630	9080802	-6.36	1.3e-59	Intergenic
3	chr20	46704366	46705313	-5.93	2.1e-59	Intergenic
4	chr21	41692561	41693579	-5.74	1.0e-57	intron (DSCAM intron 8 of 32)
5	chr20	49343658	49344645	-4.16	2.6e-57	Intergenic
6	chr10	8767839	8768949	-4.99	5.5e-50	Intergenic
7	chr8	91899639	91900338	-4.98	2.4e-46	intron (NECAB1 intron 5 of 12)
8	chr1	203058158	203059169	-4.08	2.0e-43	Intergenic
9	chr17	59939113	59940006	-4.88	2.3e-43	intron (BRIP1 intron 1 of 19)
10	chr17	57833385	57834197	-3.74	3.5e-43	intron (VMP1 intron 5 of 11)
11	chr12	76099828	76100618	-5.36	2.0e-42	Intergenic
12	chr1	109782456	109783690	-5.11	6.3e-42	Intergenic
13	chr13	69260074	69261207	-4.16	2.9e-41	Intergenic
14	chr17	54249881	54250440	5.85	1.0e-40	intron (ANKFN1 intron 1 of 16)
15	chr6	120288705	120289595	-4.61	1.4e-40	Intergenic
16	chr6	15221864	15222700	-4.51	1.5e-40	Intergenic
17	chr1	114576558	114577189	-5.52	6.9e-40	Intergenic
18	chr1	107962887	107963548	-4.72	5.9e-39	intron (NTNG1 intron 6 of 7)
19	chr8	91703968	91704684	-3.77	1.7e-38	Intergenic
20	chr16	77637649	77638489	-3.92	7.1e-38	Intergenic
21	chr15	24313258	24314117	4.78	8.4e-38	intron (PWRN4 intron 4 of 5)
22	chr3	15677474	15678258	-4.54	9.1e-38	intron (BTD intron 2 of 2)
23	chr17	59877639	59878235	-5.48	2.6e-37	intron (BRIP1 intron 8 of 19)
24	chr8	91924710	91925615	-4.29	2.6e-37	intron (NECAB1 intron 5 of 12)
25	chr16	17067434	17068780	-3.08	3.0e-37	Intergenic
26	chr2	237678298	237678905	-5.76	4.4e-37	Intergenic
27	chr5	82682695	82683742	-4.35	5.9e-37	Intergenic
28	chr10	172357	173271	-5.74	7.0e-37	Intergenic
29	chr11	100773024	100773999	4.17	8.8e-37	intron (ARHGAP42 intron 4 of 23)
30	chr20	45832956	45833655	-4.58	1.1e-36	Intergenic
31	chr1	191122047	191123169	-4.13	1.3e-36	Intergenic
32	chr8	91926552	91927572	-5.76	1.3e-36	intron (NECAB1 intron 5 of 12)
33	chr7	84165346	84166224	-3.85	1.5e-36	intron (LOC101927378 intron 1 of 3)
34	chr3	64249881	64250808	-4.39	2.1e-36	Intergenic
35	chr7	156518413	156519118	-4.78	4.0e-36	intron (LMBR1 intron 12 of 16)
36	chr17	36748022	36748493	4.75	5.7e-36	intron (SRCIN1 intron 1 of 18)
37	chr12	97949278	97950006	-4.65	2.7e-35	Intergenic
38	chr7	110036421	110037322	6.09	6.5e-35	Intergenic
39	chr7	112921167	112922248	-3.74	7.6e-35	Intergenic
40	chr20	55849904	55850665	-4.25	2.2e-34	Intergenic
41	chr11	35387354	35388000	-5.12	5.3e-34	intron (SLC1A2 intron 1 of 11)
42	chr3	61793415	61793878	-4.31	5.3e-34	intron (PTPRG intron 2 of 29)
43	chr5	172881207	172883268	-3.78	5.3e-34	Intergenic
44	chr9	82676809	82677372	-4.93	8.1e-34	Intergenic
45	chr4	106047356	106048266	-4.02	1.6e-33	Intergenic
46	chr10	3331910	3332581	-5.43	2.7e-33	Intergenic
47	chr9	77715827	77716500	-4.96	2.9e-33	intron (OSTF1 intron 1 of 9)
48	chr20	50003880	50004488	-4.28	3.3e-33	exon (NFATC2 exon 11 of 11)
49	chr17	58645190	58645834	-3.88	3.9e-33	intron (LOC388406 intron 2 of 3)
50	chr2	142259413	142260253	-5.64	3.9e-33	intron (LRP1B intron 2 of 90)

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Column	Description	ID	Scale
\|\|	Chromosome	Chromosome	`seqnames`	None
\|\|	Start	Start coordinates	`start`	None
\|\|	End	End coordinates	`end`	None
\|\|	Log2 fold change	Log2 fold change	`Fold`	None
\|\|	False discovery rate	False discovery rate	`FDR`	None
\|\|	Annotation	Annotation	`Annotation`	None

Download data

This section contains links to download your original data, data generated by various bioinformatics tools, and some static images. To download individual files, click on the corresponding links. There are also instructions at the bottom of the this section if you want to download everything in batch.

Links in this section expire after 60 days. If you want to download files after that, please contact us.

Data files of samples

Showing ⁷/₇ rows and ⁷/₇ columns.

Sample Name	Read #1	Alignment	Normalized fragment pileup	Fold enrichment	Peaks(narrowPeak/broadPeak)	Summit locations	Peaks with annotation
BT474_1	FASTQ	BAM	BigWig	BigWig	narrowPeak	BED	TSV
BT474_2	FASTQ	BAM	BigWig	BigWig	narrowPeak	BED	TSV
BT474_Control	FASTQ	BAM	BigWig
MCF7_1	FASTQ	BAM	BigWig	BigWig	narrowPeak	BED	TSV
MCF7_2	FASTQ	BAM	BigWig	BigWig	narrowPeak	BED	TSV
MCF7_3	FASTQ	BAM	BigWig	BigWig	narrowPeak	BED	TSV
MCF7_Control	FASTQ	BAM	BigWig

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Group	Column	Description	ID	Scale
\|\|		Read #1	Read #1 FASTQ files	`Read 1`	None
\|\|	BWA	Alignment	Filtered & sorted alignment BAM files	`Alignment`	None
\|\|	deepTools	Normalized fragment pileup	Fragment pileup normalized by number of aligned reads	`Fragment pileup`	None
\|\|	deepTools	Fold enrichment	Fold enrichment over control sample	`Fold enrichment`	None
\|\|	MACS2	Peaks(narrowPeak/broadPeak)	Peak calling results in narrowPeak/broadPeak format	`MACS2_peaks`	None
\|\|	MACS2	Summit locations	Summit locations in BED format	`MACS2_summits_bed`	None
\|\|	MACS2+HOMER	Peaks with annotation	Peak information along with annotation	`Peaks with annoation`	None

Result files of comparisons

This section contains results and figures of differential binding analyses. They include merged peaks and read counts, correlation matrix of samples, PCA plot of samples, and Venn diagram of shared peaks. If your study had replicates, the results also include statistical analyses of differential binding. If not, the results inclue peak overlap analyses.

Showing ¹/₁ rows and ³/₃ columns.

Comparison(cond.1_vs_cond.2)	DiffBind results	Scatter plots	MA plots
BT474_vs_MCF7	TSV	JPG	JPG

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Group	Column	Description	ID	Scale
\|\|	DiffBind	DiffBind results	Differential binding analysis results	`DiffBind_results`	None
\|\|	DiffBind	Scatter plots	Scatter plots showing normalized counts of two conditions	`DiffBind_XY_plot`	None
\|\|	DiffBind	MA plots	MA plots showing distribution of differential bindings	`DiffBind_MA_plot`	None

Instructions to download all files

Download a script to download all files. We assume it is in your Downloads folder.
Find and open Terminal(Mac/Linux) or Windows Powershell(Windows).
Type cd ~/Downloads and Enter. (If your download folder is different, please change accordingly)
Copy and Paste bash download_links.ps1 (Mac/Linux) or Powershell.exe -ExecutionPolicy Bypass -File .\download_links.ps1 (Windows) and Enter.

Software Versions

Software Versions are collected at run time from the software output. This pipeline is adapted from nf-core ChIPseq pipeline.

nf-core/chipseq: v1.1.0
Nextflow: v22.10.4
FastQC: v0.11.8
Trim Galore!: v0.5.0
BWA: v0.7.17-r1188
Samtools: v1.9
BEDTools: v2.27.1
BamTools: v2.5.1
deepTools: v3.2.1
Picard: v2.19.0
R: v3.4.1
Pysam: v0.15.2
MACS2: v2.1.2
HOMER: v4.9.1
featureCounts: v1.6.4
Preseq: v2.0.3
DiffBind: v2.14.0

Workflow Summary

Workflow Summary - this information is collected when the pipeline is started.

Data Type: Single-End
Genome: hg19
MACS2 Narrow Peaks: Yes
DiffBind FDR: 0.05

Toggle navigation Powered by v1.8

ChIPseq report for sample_report

MultiQC Toolbox

Apply Highlight Samples

Apply Rename Samples

Apply Show / Hide Samples

Export Plots

Choose Plots

Save Settings

Load Settings

About MultiQC

ChIPseq report for sample_report

General Statistics

General Statistics: Columns

FastQC

Sequence Quality Histograms Help

Per Sequence Quality Scores Help

Per Base Sequence Content Help

Rollover for sample name

Per Sequence GC Content Help

Per Base N Content Help

Sequence Length Distribution

Overrepresented sequences Help

Adapter Content Help

Cutadapt

Alignment stats

Percent Mapped Help

Mapped reads per contig

Preseq (unfiltered)

Complexity curve

Filtering of alignments

deepTools

Fingerprint plot

Read Distribution Profile after Annotation

Strand-shift correlation plot

NSC and RSC coefficients

Table Bguk: Columns

MACS2

Summary table of MACS2 peak calling results

Macs2 Results And Statistics: Columns

View genomic tracks in UCSC Genome Browser

How to view the genomic tracks:

View Genomic Tracks: Columns

HOMER: Peak annotation

Differential binding

Similarity matrix of samples

Venn diagram of peaks among conditions

Summary table of differential binding analysis

Differential Binding Table: Columns

Top differentially bound regions in comparison BT474_vs_MCF7 Help

Top Diffbind Bt474 Vs Mcf7: Columns

Download data

Data files of samples

Data Files Table: Columns

Result files of comparisons

Result Files Table: Columns

Instructions to download all files

Software Versions

Workflow Summary

Powered by

v1.8

Highlight Samples

Rename Samples

Show / Hide Samples

Sequence Quality Histograms

Per Sequence Quality Scores

Per Base Sequence Content

Per Sequence GC Content

Per Base N Content

Overrepresented sequences

Adapter Content

Percent Mapped

Top differentially bound regions in comparison BT474_vs_MCF7