CloudATAC: a cloud-based framework for ATAC-Seq data analysis

Avinash M. Veerappa, M. Jordan Rowley, Angela Maggio, Laura Beaudry, Dale Hawkins, Allen Kim, Sahil Sethi, Paul L. Sorgen, Chittibabu Guda

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) generates genome-wide chromatin accessibility profiles, providing valuable insights into epigenetic gene regulation at both pooled-cell and single-cell population levels. Comprehensive analysis of ATAC-seq data involves the use of various interdependent programs. Learning the correct sequence of steps needed to process the data can represent a major hurdle. Selecting appropriate parameters at each stage, including pre-analysis, core analysis, and advanced downstream analysis, is important to ensure accurate analysis and interpretation of ATAC-seq data. Additionally, obtaining and working within a limited computational environment presents a significant challenge to non-bioinformatic researchers. Therefore, we present Cloud ATAC, an open-source, cloud-based interactive framework with a scalable, flexible, and streamlined analysis framework based on the best practices approach for pooled-cell and single-cell ATAC-seq data. These frameworks use on-demand computational power and memory, scalability, and a secure and compliant environment provided by the Google Cloud. Additionally, we leverage Jupyter Notebook’s interactive computing platform that combines live code, tutorials, narrative text, flashcards, quizzes, and custom visualizations to enhance learning and analysis. Further, leveraging GPU instances has significantly improved the run-time of the single-cell framework. The source codes and data are publicly available through NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis.

Original languageEnglish (US)
Article numberbbae090
JournalBriefings in bioinformatics
Volume25
DOIs
StatePublished - Jul 1 2024

Keywords

  • ATACseq
  • Google Cloud
  • NIH Cloud Lab
  • NIH strides
  • chromatin
  • single-cell ATAC

ASJC Scopus subject areas

  • Information Systems
  • Molecular Biology

Fingerprint

Dive into the research topics of 'CloudATAC: a cloud-based framework for ATAC-Seq data analysis'. Together they form a unique fingerprint.

Cite this