Execute ATAC-seq data processing pipelines
pipeline-atacseqskillsetup L4★35
ammawla/encode-toolkit ↗What it does
Execute ENCODE ATAC-seq processing pipeline from FASTQ
Best for
Processing ATAC-seq FASTQ to peaks with proper Tn5 transposase offset correction.
Inputs
- · FASTQ files (paired-end or single-end)
- · Aligned BAM files
- · Reference genome (indexed)
- · Sample metadata sheet (CSV)
Outputs
- · Peak calls (narrowPeak/broadPeak format)
- · Signal tracks (bigWig)
- · Quality control report
Requires
- · Nextflow
- · Docker
- · BWA/Bowtie2 aligner
- · MACS2 peak caller
- · IDR (irreproducible discovery rate)
Preconditions
- · Reference genome indexed
- · FASTQ files available
- · Nextflow/Docker installed
Failure modes
- · Poor FASTQ quality (low pass rate)
- · Reference genome mismatch
- · Memory exhaustion on large datasets
- · Adapter contamination
Trust signals
- · ENCODE consortium standards documented
- · Maintained in ammawla/encode-toolkit
- · Peer-reviewed publications cited