Frequent item set mining using normalized FP-growth algorithm

  • Authors

    • N K. Manikandan
    • D Manivannan
  • Item Setmining, FP Growth Algorithm, Association Rule Mining.
  • As the volume of data and its storage schemes are increasing drastically, the knowledge discovery from these huge volume of heterogeneous and high dimension data emerges as an essential process. Number of algorithms for data association analysis has been introduced considering time and main memory requirements. However this algorithms get completed when the items and records grows extremely high. In this paper we have created a data structure that can be reused without modifying the schema. So the aim of this work is to make an efficient association rule mining independent of the algorithm being selected.

    Our data structure make data access much faster by simplifying and reorganizing them by implementing shuffling strategy using hamming distance and inverted index mapping. In this work we explore our algorithm’s efficiency by using many datasets containing millions of records in it. We increased the runtime orders of the magnitude and reduced the auxiliary and main memory requirements.

  • References

    1. [1] R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,†Proc 20th Int’l Conf. Very Large Data Bases (VLDB), pp. 487-499, 1994.

      [2] R. Agrawal and R. Srikant, “Privacy-Preserving Data Mining,†Proc. ACM SIGMOD Conf., pp. 439-450, 2000.

      [3] D. Beaver, S. Micali, and P. Rogaway, “The Round Complexity of Secure Protocols,†Proc. 22nd Ann. ACM Symp. Theory of Computing (STOC), pp. 503-513, 1990.

      [4] M. Bellare, R. Canetti, and H. Krawczyk, “Keying Hash Functions for Message Authentication,†Proc. 16th Ann. Int’l Cryptology Conf. Advances in Cryptology (Crypto), pp. 1-15, 1996.

      [5] J.C. Benaloh, “Secret Sharing Homomorphisms: Keeping Shares of a Secret Secret,†Proc. Advances in Cryptology (Crypto), pp. 251-260, 1986.

      [6] J. Brickell and V. Shmatikov, “Privacy-Preserving Graph Algorithms in the Semi-Honest Model,†Proc. 11th Int’l Conf. Theory and Application of Cryptology and Information Security (ASIACRYPT), pp. 236-252, 2005.

      [7] D.W.L. Cheung, J. Han, V.T.Y. Ng, A.W.C. Fu, and Y. Fu, “A Fast Distributed Algorithm for Mining Association Rules,†Proc. Fourth Int’l Conf. Parallel and Distributed Information Systems (PDIS), pp. 31-42, 1996.

      [8] D.W.L Cheung, V.T.Y. Ng, A.W.C. Fu, and Y. Fu, “Efficient Mining of Association Rules in Distributed Databases,†IEEE Trans. Knowledge and Data Eng., vol. 8, no. 6, Dec. 1996.

      [9] T. ElGamal, “A Public Key Cryptosystem and a Signature Scheme Based on Discrete Logarithms,†IEEE Trans. Information Theory, vol. IT-31, no. 4, July 1985.

      [10] A.V. Evfimievski, R. Srikant, R. Agrawal, and J. Gehrke, “Privacy Preserving Mining of Association Rules,†Proc. Eighth ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 217-228, 2002.

      [11] R. Fagin, M. Naor, and P. Winkler, “Comparing Information without Leaking It,†Comm. ACM, vol. 39, pp. 77-85, 1996.

      [12] M. Freedman, Y. Ishai, B. Pinkas, and O. Reingold, “Keyword Search and Oblivious Pseudorandom Functions,†Proc. Second Int’l Conf. Theory of Cryptography (TCC), pp. 303-324, 2005.

      [13] M.J. Freedman, K. Nissim, and B. Pinkas, “Efficient Private Matching and Set Intersection,†Proc. Int’l Conf. Theory and Applications of Cryptographic Techniques (EUROCRYPT), pp. 1-19, 2004.

      [14] O. Goldreich, S. Micali, and A. Wigderson, “How to Play Any Mental Game or a Completeness Theorem for Protocols with Honest Majority,†Proc. 19th Ann. ACM Symp. Theory of Computing (STOC), pp. 218-229, 1987.

  • Downloads

  • How to Cite

    K. Manikandan, N., & Manivannan, D. (2018). Frequent item set mining using normalized FP-growth algorithm. International Journal of Engineering & Technology, 7(1.8), 59-61.