TY - GEN
T1 - Adaptive Deep Learning based Time-Varying Volume Compression
AU - Pan, Yu
AU - Zhu, Feiyu
AU - Gao, Tian
AU - Yu, Hongfeng
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - Nowadays, floating-point temporal-spatial datasets are routinely generated from scientific observational apparatuses or computer simulations at an unprecedented pace. The sheer amount of these large volumetric datasets on the order of terabytes or petabytes consume massive resources in terms of bandwidth, storage and computational power. On the other hand, scientists, equipped with low-end post-analysis machines, often find it impossible to visualize and analyze these massive datasets with such limited resources in hand, not to mention their ultimate goal of real time analysis and visualization. To solve this discrepancy, a compact data representation has to be generated and a trade-off between resource consumption and analytical precision has to be found. There are many existing volumetric representation generating methods, almost all of which adopts some kind of hand-engineered heuristics to extract the effective portion of the datasets. However, the trade-off between resource consumption and analytical quality could not be well established due to the introduction of hand-engineered heuristics. In this paper, we present a deep learning based method that can adaptively capture the inherently complicated dynamics of temporal-spatial volumetric datasets without introducing any hand engineered features. We train an autoencoder based neural network with quantization and adaptation. Compared with existing methods, our method could learn data representation at a much lower compressed/uncompressed rate while preserving the details of original datasets. Also, our method could adapt with different data distribution and conduct compression and decompression in real time. Through extensive experiments, we show the effectiveness and efficiency of our approach over existing methods.
AB - Nowadays, floating-point temporal-spatial datasets are routinely generated from scientific observational apparatuses or computer simulations at an unprecedented pace. The sheer amount of these large volumetric datasets on the order of terabytes or petabytes consume massive resources in terms of bandwidth, storage and computational power. On the other hand, scientists, equipped with low-end post-analysis machines, often find it impossible to visualize and analyze these massive datasets with such limited resources in hand, not to mention their ultimate goal of real time analysis and visualization. To solve this discrepancy, a compact data representation has to be generated and a trade-off between resource consumption and analytical precision has to be found. There are many existing volumetric representation generating methods, almost all of which adopts some kind of hand-engineered heuristics to extract the effective portion of the datasets. However, the trade-off between resource consumption and analytical quality could not be well established due to the introduction of hand-engineered heuristics. In this paper, we present a deep learning based method that can adaptively capture the inherently complicated dynamics of temporal-spatial volumetric datasets without introducing any hand engineered features. We train an autoencoder based neural network with quantization and adaptation. Compared with existing methods, our method could learn data representation at a much lower compressed/uncompressed rate while preserving the details of original datasets. Also, our method could adapt with different data distribution and conduct compression and decompression in real time. Through extensive experiments, we show the effectiveness and efficiency of our approach over existing methods.
KW - autoencoder
KW - deep learning
KW - scientific data
KW - volume compression
UR - http://www.scopus.com/inward/record.url?scp=85081362277&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081362277&partnerID=8YFLogxK
U2 - 10.1109/BigData47090.2019.9006146
DO - 10.1109/BigData47090.2019.9006146
M3 - Conference contribution
AN - SCOPUS:85081362277
T3 - Proceedings - 2019 IEEE International Conference on Big Data, Big Data 2019
SP - 1187
EP - 1194
BT - Proceedings - 2019 IEEE International Conference on Big Data, Big Data 2019
A2 - Baru, Chaitanya
A2 - Huan, Jun
A2 - Khan, Latifur
A2 - Hu, Xiaohua Tony
A2 - Ak, Ronay
A2 - Tian, Yuanyuan
A2 - Barga, Roger
A2 - Zaniolo, Carlo
A2 - Lee, Kisung
A2 - Ye, Yanfang Fanny
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE International Conference on Big Data, Big Data 2019
Y2 - 9 December 2019 through 12 December 2019
ER -