A Novel Privacy Preserving Data mining using improved decision tree and KP-ABE on High Dimensional Data

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    In distributed data mining environment maintaining individual data or patterns is a major issue due to high dimensionality and data size. Distributed Data mining framework can help to find the essential decision making patterns from distributed data. Privacy preserving data mining (PPDM) has emerged as a main research area for data confidentiality and knowledge sharing in between the communicating parties. As the distributed data of the individuals are stored by the third party, it leads to the misuse of distributed information in digital networks. Most of the decision patterns generated using the machine learning models for business organizations, industries and individuals has to be encoded before it is publicly shared or published. As the amount of data collected from different sources are increasing exponentially, the time taken to preserve the patterns using the  traditional privacy preserving data mining models also increasing due to high computational attribute selection measures and noise in the distributed data. Also, filling sparse values using the conventional models are inefficient and infeasible for privacy preserving models. In this paper, a novel privacy preserving based classification model was designed and implemented on large datasets. In this model, a filter-based privacy preserving model using improved decision tree classifier is implemented to preserve the decision patterns using IPPDM-KPABE model. Experimental results proved that the proposed model has high computational efficiency compared to the traditional privacy preserving model on high dimensional datasets.


  • Keywords

    Privacy Preserving Data Mining, Decision Trees, ABE.

  • References

      [1] P. K. Fong and J. H. Weber-Jahnke, "Privacy Preserving Decision Tree Learning Using Unrealized Data Sets," in IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 2, pp. 353-364, Feb. 2012.

      [2] L. Liu M. Kantarcioglu B. Thuraisingham "Privacy Preserving Decision Tree Mining from Perturbed Data" Proc. 42nd Hawaii Int'l Conf. System Sciences (HICSS '09) 2009.

      [3] M. Shaneck Y. Kim "Efficient Cryptographic Primitives for Private Data Mining" Proc. 43rd Hawaii Int'l Conf. System Sciences (HICSS) pp. 1-9 2010.

      [4] L. Xu, C. Jiang, Y. Qian, J. Li, Y. Zhao and Y. Ren, "Privacy-Accuracy Trade-off in Differentially-Private Distributed Classification: A Game Theoretical Approach," in IEEE Transactions on Big Data, vol. PP, no. 99, pp. 1-1.

      [5] P.K. Fong Privacy Preservation for Training Data Sets in Database: Application to Decision Tree Learning 2008.

      [6] J. Vaidya C. Clifton "Privacy Preserving Association Rule Mining in Vertically Partitioned Data" Proc Eighth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (KDD '02) pp. 23-26 2002-July.

      [7] L. Xu, C. Jiang, Y. Qian, J. Li, Y. Zhao and Y. Ren, "Privacy-Accuracy Trade-off in Differentially-Private Distributed Classification: A Game Theoretical Approach," in IEEE Transactions on Big Data, vol. PP, no. 99, pp. 1-1.

      [8] M. Bewong, J. Liu, L. Liu, J. Li and K. K. R. Choo, "A Relative Privacy Model for Effective Privacy Preservation in Transactional Data," 2017 IEEE Trustcom/BigDataSE/ICESS, Sydney, NSW, 2017, pp. 394-401.

      [9] A. Gkoulalas-Divanis G. Loukides J. Sun "Publishing data from electronic health records while preserving privacy: A survey of algorithms" <em>JBI</em> vol. 50 pp. 4-19 2014.

      [10] H. Zakerzadeh C.C Aggarwal K. Barker "Managing dimensionality in data privacy anonymization" <em>KIS</em> vol. 49 no. 1 pp. 341-373 2016.

      [11] S. Qiu, B. Wang, M. Li, J. Liu and Y. Shi, "Toward Practical Privacy-Preserving Frequent Itemset Mining on Encrypted Cloud Data," in IEEE Transactions on Cloud Computing, vol. PP, no. 99, pp. 1-1.

      [12] Sastry, J.K.R., Ganesh, J.V., Bhanu, J.S., I2C based networking for implementing heterogeneous microcontroller based distributed embedded systems, Indian Journal of Science and Technology, Volume 8, Issue 15, 2015

      [13] Sastry, J.K.R., Naga Sai Tejasvi, T., Aparna, J., Dynamic scheduling of message flow within a distributed embedded system connected through a RS485 network, ARPN Journal of Engineering and Applied Sciences, Volume 12, Issue 9, 1 May 2017, Pages 2809-2817

      [14] Sastry, J.K.R., Suresh, A., Bhanu, S.J., Building heterogeneous distributed embedded systems through rs485 communication protocol, ARPN Journal of Engineering and Applied Sciences, 2015, 10(16), pp. 6793-6803




Article ID: 10874
DOI: 10.14419/ijet.v7i2.7.10874

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.