cyberneticlibrary

Analyze ENCODE functional genomics screens

functional-screen-analysisskillsetup L235
ammawla/encode-toolkit
What it does

Analyze genome-wide CRISPR/RNAi screens for hits and pathway enrichment

Best for

Functional genomics when you have CRISPR/RNAi data and need to rank candidates by statistical rigor; MAGeCK gold standard.

Inputs
  • · screen data (gene IDs + log2 fold-change + p-value)
  • · cell type/condition
  • · background control replicates
Outputs
  • · hit gene list (FDR-corrected)
  • · pathway enrichment (KEGG/Reactome)
  • · candidate targets ranked by effect size + consistency
Requires
  • · MAGeCK (CRISPR screen statistics)
  • · clusterProfiler (enrichment)
  • · ggplot2 (visualization)
Preconditions

Screen replicate structure known (control vs treatment); gene IDs matched to reference (Ensembl/NCBI)

Failure modes
  • · replicates pooled without batch correction → inflates false positives
  • · pathway database outdated (KEGG 2015) → misses novel pathways
  • · effect size small vs noise → hits not reproducible
Trust signals
  • · MAGeCK (widely cited in CRISPR papers)
  • · FDR correction enforced
  • · Batch effect handling documented