A Deep Learning Approach for Time Series Data Correction:‎Integrating Autoencoder-Based GANs andCorrelation Analysis

  • Authors

    • Sohyeon YUN Department of Electrical and Computer Engineering University of Seoul, Seoul, Korea
    • Han-joon Kim Department of Electrical and Computer Engineering University of Seoul, Seoul, Korea https://orcid.org/0000-0003-4510-5685
    https://doi.org/10.14419/ezf67m88

    Received date: December 27, 2025

    Accepted date: February 28, 2026

    Published date: April 26, 2026

  • Multivariate Time Series Data; Deep Learning; GAN; Anomaly Detection; Data Correc‎tion; Data Quality
  • Abstract

    This paper presents a novel correction framework for multivariate time series data ‎that enhances data quality by integrating Autoencoder-based Generative Adversar‎ial Networks (GANs) with correlation-aware analysis. With the widespread adop‎tion of deep learning models for sensor-driven applications such as quality control ‎and demand forecasting, data anomalies caused by sensor faults and network errors ‎have emerged as a critical challenge, often degrading model performance. In mul‎tivariate time series, anomaly correction is particularly difficult because erroneous ‎values must be distinguished from legitimate temporal variations while preserving ‎inter-variable dependencies and temporal consistency. To address this challenge, ‎we propose a data correction method that combines deep learning–based anomaly ‎detection with correlation-driven correction. An Autoencoder-based GAN is em‎ployed to identify anomalous patterns in multivariate time series, while a window ‎relevance matrix is introduced to guide precise correction. This matrix captures ‎complex relationships among variables by jointly incorporating dynamic time ‎warping and Pearson correlation coefficients within sliding windows. By leverag‎ing both temporal alignment and statistical dependency, the proposed approach ‎performs anomaly correction that maintains structural coherence across variables. ‎Extensive experiments conducted on diverse multivariate time series datasets ‎demonstrate that the corrected data consistently improves predictive performance ‎compared to raw data. These results indicate that the proposed method effectively ‎enhances data quality and model reliability, offering a robust solution for anomaly ‎correction in multivariate time series applications‎.

  • References

    1. Ge, M., Bangui, H., & Buhnova, B. (2018). Big data for internet of things: a survey. Future generation computer systems, 87, 601-614. https://doi.org/10.1016/j.future.2018.04.053.
    2. Choi, J., & Shin, Y. (2021). Design of Efficient Big Data Collection Method based on Mass IoT devices. The Journal of Korea Institute of Infor-mation, Electronics, and Communication Technology, 14(4), 300-306.
    3. Ren, S., Kim, J. S., Cho, W. S., Soeng, S., Kong, S., & Lee, K. H. (2021, April). Big data platform for intelligence industrial IoT sensor monitoring system based on edge computing and AI. In 2021 International conference on artificial intelligence in information and communication (ICAIIC) (pp. 480-482). IEEE. https://doi.org/10.1109/ICAIIC51459.2021.9415189.
    4. Farahani, M. A., El Kalach, F., Harper, A., Mccormick, M. R., Harik, R., & Wuest, T. (2025). Time-series forecasting in smart manufacturing sys-tems: An experimental evaluation of the state-of-the-art algorithms. Robotics and Computer-Integrated Manufacturing, 95, 103010. https://doi.org/10.1016/j.rcim.2025.103010.
    5. Su, N., Huang, S., & Su, C. (2024). Elevating smart manufacturing with a unified predictive maintenance platform: The synergy between data ware-housing, Apache spark, and machine learning. Sensors, 24(13), 4237. https://doi.org/10.3390/s24134237.
    6. Zamanzadeh Darban, Z., Webb, G. I., Pan, S., Aggarwal, C., & Salehi, M. (2024). Deep learning for time series anomaly detection: A survey. ACM Computing Surveys, 57(1), 1-42. https://dl.acm.org/doi/10.1145/3691338.
    7. Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection: A survey. ACM computing surveys (CSUR), 41(3), 1-58. https://doi.org/10.1145/1541880.1541882.
    8. Wang, F., Jiang, Y., Zhang, R., Wei, A., Xie, J., & Pang, X. (2025). A survey of deep anomaly detection in multivariate time series: taxonomy, appli-cations, and directions. Sensors, 25(1), 190. https://doi.org/10.3390/s25010190.
    9. Seba, A. M., Gemeda, K. A., & Ramulu, P. J. (2024). Prediction and classification of IoT sensor faults using hybrid deep learning model. Discover Applied Sciences, 6(1), 9. https://doi.org/10.1007/s42452-024-053633-7.
    10. Garg, A., Zhang, W., Samaran, J., Savitha, R., & Foo, C. S. (2021). An evaluation of anomaly detection and diagnosis in multivariate time se-ries. IEEE Transactions on Neural Networks and Learning Systems, 33(6), 2508-2517. https://doi.org/10.1109/TNNLS.2021.3105827.
    11. Niu, Z., Yu, K., & Wu, X. (2020). LSTM-based VAE-GAN for time-series anomaly detection. Sensors, 20(13), 3738. https://doi.org/10.3390/s20133738.
    12. Mathur, A. P., & Tippenhauer, N. O. (2016, April). SWaT: A water treatment testbed for research and training on ICS security. In 2016 international workshop on cyber-physical systems for smart water networks (CySWater) (pp. 31-36). IEEE. https://doi.org/10.1109/CySWater.2016.7469060.
    13. Ahmed, C. M., Palleti, V. R., & Mathur, A. P. (2017, April). WADI: a water distribution testbed for research in the design of secure cyber physical systems. In Proceedings of the 3rd international workshop on cyber-physical systems for smart water networks (pp. 25-28). https://dl.acm.org/doi/10.1145/3055366.3055375.
    14. Shin, H. K., Lee, W., Yun, J. H., & Min, B. G. (2021, August). Two ICS security datasets and anomaly detection contest on the HIL-based augment-ed ICS testbed. In Proceedings of the 14th Cyber Security Experimentation and Test Workshop (pp. 36-40). https://dl.acm.org/doi/10.1145/3474718.3474719.
    15. Fisch, A. T., Eckley, I. A., & Fearnhead, P. (2022). Subset multivariate collective and point anomaly detection. Journal of Computational and Graph-ical Statistics, 31(2), 574-585. https://doi.org/10.1080/10618600.2021.1987257.
    16. Zhanwei, S., & Zenghui, L. (2019). Abnormal detection method of industrial control system based on behavior model. Computers & Security, 84, 166-178. https://doi.org/10.1016/j.cose.2019.03.009
    17. Wang, H., Bah, M. J., & Hammad, M. (2019). Progress in outlier detection techniques: A survey. IEEE Access, 7, 107964-108000. https://doi.org/10.1109/ACCESS.2019.2932762.
    18. Chalapathy, R., & Chawla, S. (2019). Deep learning for anomaly detection: A survey. arXiv preprint arXiv:1901.03407.
    19. Boukerche, A., Zheng, L., & Alfandi, O. (2020). Outlier detection: Methods, models, and classification. ACM Computing Surveys (CSUR), 53(3), 1-37. https://dl.acm.org/doi/10.1145/3381028.
    20. Garg, A., Zhang, W., Samaran, J., Savitha, R., & Foo, C. S. (2021). An evaluation of anomaly detection and diagnosis in multivariate time se-ries. IEEE Transactions on Neural Networks and Learning Systems, 33(6), 2508-2517. https://doi.org/10.1109/TNNLS.2021.3105827.
    21. Ergen, T., & Kozat, S. S. (2019). Unsupervised anomaly detection with LSTM neural networks. IEEE transactions on neural networks and learning systems, 31(8), 3127-3141. https://doi.org/10.1109/TNNLS.2019.2935975.
    22. Chen, Z., Yeo, C. K., Lee, B. S., & Lau, C. T. (2018, April). Autoencoder-based network anomaly detection. In 2018 Wireless telecommunications symposium (WTS) (pp. 1-5). IEEE. https://doi.org/10.1109/WTS.2018.8363930.
    23. Xia, X., Pan, X., Li, N., He, X., Ma, L., Zhang, X., & Ding, N. (2022). GAN-based anomaly detection: A review. Neurocomputing, 493, 497-535. https://doi.org/10.1016/j.neucom.2021.12.093
    24. Wang, X., & Wang, C. (2019). Time series data cleaning: A survey. IEEE Access, 8, 1866-1881. https://doi.org/10.1109/ACCESS.2019.2962152.
    25. Lee, M. K., Moon, S. H., Kim, Y. H., & Moon, B. R. (2014, October). Correcting abnormalities in meteorological data by machine learning. In 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (pp. 888-893). IEEE. https://doi.org/ 10.1109/SMC.2014.6974024.
    26. Baghoussi, Y., Soares, C., & Mendes-Moreira, J. (2024). Corrector LSTM: Built-in training data correction for improved time-series forecast-ing. Neural Computing and Applications, 36(26), 16213-16231. https://doi.org/10.1007/s00521-024-09962-x.
    27. Zhang, J., & Yin, P. (2019, November). Multivariate time series missing data imputation using recurrent denoising autoencoder. In 2019 IEEE Inter-national Conference on Bioinformatics and Biomedicine (BIBM) (pp. 760-764). IEEE. https://doi.org/10.1109/BIBM47256.2019.8982996.
    28. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. (2020). Generative adversarial net-works. Communications of the ACM, 63(11), 139-144. https://dl.acm.org/doi/10.1145/3422622.
  • Downloads

  • How to Cite

    YUN , S. ., & Kim, H.- joon. (2026). A Deep Learning Approach for Time Series Data Correction:‎Integrating Autoencoder-Based GANs andCorrelation Analysis. International Journal of Basic and Applied Sciences, 15(2), 79-93. https://doi.org/10.14419/ezf67m88