cyberneticlibrary

Execute ATAC-seq data processing pipelines

pipeline-atacseqskillsetup L435
ammawla/encode-toolkit
What it does

Execute ENCODE ATAC-seq processing pipeline from FASTQ

Best for

Processing ATAC-seq FASTQ to peaks with proper Tn5 transposase offset correction.

Inputs
  • · FASTQ files (paired-end or single-end)
  • · Aligned BAM files
  • · Reference genome (indexed)
  • · Sample metadata sheet (CSV)
Outputs
  • · Peak calls (narrowPeak/broadPeak format)
  • · Signal tracks (bigWig)
  • · Quality control report
Requires
  • · Nextflow
  • · Docker
  • · BWA/Bowtie2 aligner
  • · MACS2 peak caller
  • · IDR (irreproducible discovery rate)
Preconditions
  • · Reference genome indexed
  • · FASTQ files available
  • · Nextflow/Docker installed
Failure modes
  • · Poor FASTQ quality (low pass rate)
  • · Reference genome mismatch
  • · Memory exhaustion on large datasets
  • · Adapter contamination
Trust signals
  • · ENCODE consortium standards documented
  • · Maintained in ammawla/encode-toolkit
  • · Peer-reviewed publications cited