TY - JOUR
T1 - CloudATAC
T2 - a cloud-based framework for ATAC-Seq data analysis
AU - Veerappa, Avinash M.
AU - Rowley, M. Jordan
AU - Maggio, Angela
AU - Beaudry, Laura
AU - Hawkins, Dale
AU - Kim, Allen
AU - Sethi, Sahil
AU - Sorgen, Paul L.
AU - Guda, Chittibabu
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) generates genome-wide chromatin accessibility profiles, providing valuable insights into epigenetic gene regulation at both pooled-cell and single-cell population levels. Comprehensive analysis of ATAC-seq data involves the use of various interdependent programs. Learning the correct sequence of steps needed to process the data can represent a major hurdle. Selecting appropriate parameters at each stage, including pre-analysis, core analysis, and advanced downstream analysis, is important to ensure accurate analysis and interpretation of ATAC-seq data. Additionally, obtaining and working within a limited computational environment presents a significant challenge to non-bioinformatic researchers. Therefore, we present Cloud ATAC, an open-source, cloud-based interactive framework with a scalable, flexible, and streamlined analysis framework based on the best practices approach for pooled-cell and single-cell ATAC-seq data. These frameworks use on-demand computational power and memory, scalability, and a secure and compliant environment provided by the Google Cloud. Additionally, we leverage Jupyter Notebook’s interactive computing platform that combines live code, tutorials, narrative text, flashcards, quizzes, and custom visualizations to enhance learning and analysis. Further, leveraging GPU instances has significantly improved the run-time of the single-cell framework. The source codes and data are publicly available through NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis.
AB - Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) generates genome-wide chromatin accessibility profiles, providing valuable insights into epigenetic gene regulation at both pooled-cell and single-cell population levels. Comprehensive analysis of ATAC-seq data involves the use of various interdependent programs. Learning the correct sequence of steps needed to process the data can represent a major hurdle. Selecting appropriate parameters at each stage, including pre-analysis, core analysis, and advanced downstream analysis, is important to ensure accurate analysis and interpretation of ATAC-seq data. Additionally, obtaining and working within a limited computational environment presents a significant challenge to non-bioinformatic researchers. Therefore, we present Cloud ATAC, an open-source, cloud-based interactive framework with a scalable, flexible, and streamlined analysis framework based on the best practices approach for pooled-cell and single-cell ATAC-seq data. These frameworks use on-demand computational power and memory, scalability, and a secure and compliant environment provided by the Google Cloud. Additionally, we leverage Jupyter Notebook’s interactive computing platform that combines live code, tutorials, narrative text, flashcards, quizzes, and custom visualizations to enhance learning and analysis. Further, leveraging GPU instances has significantly improved the run-time of the single-cell framework. The source codes and data are publicly available through NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis.
KW - ATACseq
KW - Google Cloud
KW - NIH Cloud Lab
KW - NIH strides
KW - chromatin
KW - single-cell ATAC
UR - http://www.scopus.com/inward/record.url?scp=85199393757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85199393757&partnerID=8YFLogxK
U2 - 10.1093/bib/bbae090
DO - 10.1093/bib/bbae090
M3 - Article
C2 - 39041910
AN - SCOPUS:85199393757
SN - 1467-5463
VL - 25
JO - Briefings in bioinformatics
JF - Briefings in bioinformatics
M1 - bbae090
ER -