cyberneticlibrary

Cross-study single-cell RNA analysis

scrna-meta-analysisskillsetup L435
ammawla/encode-toolkit
What it does

Integrate multiple scRNA-seq datasets across studies

Best for

Building cell atlases and assessing reproducibility of single-cell findings across studies with detection-aware quality filtering.

Inputs
  • · ENCODE scRNA-seq datasets from multiple studies/donors
  • · Sample metadata (tissue, cell type, platform)
Outputs
  • · Batch-corrected integrated matrix (genes × cells)
  • · Reproducibility-scored cell type assignments
  • · Cross-study quality metrics (TIN scores, detection limits)
Requires
  • · Seurat 3 / Harmony / LIGER
  • · Python (scanpy) or R (Seurat)
  • · ENCODE database API
Preconditions

Multiple scRNA-seq count matrices in standard format, knowledge of cell type biology

Failure modes

Severe batch effects persist after correction, detection-limit artifacts mask true heterogeneity, cross-contamination from ambient RNA, dropout prevents marker detection

Trust signals
  • · Mawla et al. 2019 meta-analysis framework
  • · Cross-study reproducibility focus
  • · TIN-based quality assessment
  • · Stuart et al. 2019 (8400+ citations), Luecken & Theis 2019 (1631 citations)