Leveraging STARE for Co-aligned data locality with netCDF and python MPI

Kwo Sen Kuo, Hongfeng Yu, Yu Pan, Michael L. Rilee

Research output: Contribution to conferencePaperpeer-review

4 Scopus citations

Abstract

We have leveraged STARE indexing to package partitioned data chunks from diverse datasets into netCDF files, distributed them on a cluster of 16 lightweight nodes with their placements spatiotemporally co-aligned, and demonstrated a few integrative analyses using netCDF parallel I/O and Python MPI, with single-user performance and scalability comparable to, or even better than, that of a parallel array database management system (ADBMS) such as SciDB. However, records of the node location and STARE index ranges for each data chunk, similar to the chunk maps of SciDB, must be maintained and consulted by the I/O and analysis code for coordinating the analytic operations in parallel, in order to achieve the good performance and scalability.

Original languageEnglish (US)
Pages10063-10066
Number of pages4
DOIs
StatePublished - 2019
Event39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Yokohama, Japan
Duration: Jul 28 2019Aug 2 2019

Conference

Conference39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019
Country/TerritoryJapan
CityYokohama
Period7/28/198/2/19

Keywords

  • Big Data
  • Data-intensive analysis
  • Interoperability
  • Parallel processing
  • Scalability

ASJC Scopus subject areas

  • Computer Science Applications
  • General Earth and Planetary Sciences

Fingerprint

Dive into the research topics of 'Leveraging STARE for Co-aligned data locality with netCDF and python MPI'. Together they form a unique fingerprint.

Cite this