Skip to main content
eScholarship
Open Access Publications from the University of California

Improving Statistical Similarity Based Data Reduction for Non-Stationary Data

Published Web Location

https://sdm.lbl.gov/oapapers/ssdbm17-lee-upd.pdf
No data is associated with this publication.
Abstract

We propose a new class of lossy compression based on locally exchangeable measure that captures the distribution of repeating data blocks while preserving unique patterns. The technique has been demonstrated to reduce data volume by more than 100-fold on power grid monitoring data where a large number of data blocks can be characterized as following stationary probability distributions. To capture data with more diverse patterns, we propose two techniques to transform non-stationary time series into locally stationary blocks. We also propose a strategy to work with values in bounded ranges such as phase angles of alternating current. These new ideas are incorporated into a software package named IDEALEM. In experiments, IDEALEM reduces non-stationary data volume up to 100-fold. Compared with the state-of-the-art lossy compression methods such as SZ, IDEALEM can produce more compact output overall.

Item not freely available? Link broken?
Report a problem accessing this item