Analysis of Improved Agglomerative Hierarchical Clustering Algorithm for Distributed Data Mining In Version Control Systems (VCS)


  • S. G.Raja
  • Dr. K.Nirmala



Clustering process, classification, Convergence process, centroid.


The recent year researches are active areas and techniques most essentially and become commercial that have databases in knowledge in discovery and Data mining. In many cases with commodities and commonplace that have Business applications of data mining software. Although the business applications of data mining compared to disorganized discipline that are still relative in technical data of data mining. In this paper, the clustering algorithm on basis of newton-raphson methods has been utilized asymptotically for the attainment of the conduction of the good feasible linear data for most rapid accuracy correlated to the initial state algorithms with improved Agglomerative Hierarchical Clustering convergence. This algorithm provides the merits on following state algorithm during providence a calculation speed well than previous clustering methods.




[1] H. Cherait and N. Bounour. “Toward a Version Control System for Aspect Oriented Softwareâ€. In Proceeding of Model and Data Engineering (MEDI’11). Obidos, Portugal. LNCS 6918, pp. 110–121, September 28-30th 2011

[2] Lile Hattori, Marco D’Ambros, Michele Lanza and Mircea Lungu. “Software Evolution Comprehension: Replay to the Rescueâ€. In Proceedings of IEEE 19th International Conference on Program Comprehension (ICPC), pp. 161 – 170. 2011

[3] L. Yu and S. Ramaswamy, “Mining CVS repositories to understand open-source project developer roles,†in Proc. - ICSE 2007 Work. Fourth Int. Work. Min. Softw. Repos. MSR 2007, 2007, pp. 7–10

[4] Manish Verma, Mauly Srivastava, Neha Chack, Atul Kumar Diswar, Nidhi Gupta,†A Comparative Study of Various Clustering Algorithms in Data Mining,†International Journal of Engineering Reserch and Applications (IJERA), Vol. 2, Issue 3, pp.1379-1384, 2012.

[5] Sunny Wong, Yuanfang Cai, and Michael Dalton. “Change Impact Analysis with Stochastic Dependenciesâ€. Department of Computer Science, Drexel University, Technical Report DUCS-10-07, October. 2010

[6] Lanza, M. and Robbes, R. “A Change-based Approach to Software Evolutionâ€. In Proceedings of ENTCS’07, Volume 166, ISSN: 1571-0661, pp. 93-109. 2007

[7] Lanza, M. and Robbes, R. “A Change-based Approach to Software Evolutionâ€. In Proceedings of ENTCS’07, Volume 166, ISSN: 1571-0661, pp. 93-109. 2007

[8] L. Hattori and M. Lanza. “Syde: A tool for collaborative software developmentâ€. In Proceedings of ICSE 2010 (32nd ACM/IEEE Intl. Conf. on Software Engineering), pp.235– 238. 2010

[9] Kargupta H., Huang W., Sivakumar K., and Johnson E. Distributed clustering using collective principal component analysis. Knowledge and Information Systems Journal, 3:422–448, 2001.

[10] Eisenhardt M., Muller W., and Henrich A. Classifying Documents by Distributed P2P Clustering. In Proceedings of Informatik 2003, GI Lecture Notes in Informatics, Frankfort, Germany, 2003.

[11] Victor Y. Pan and Ai-Long Zheng. Real and complex polynomial root-finding with eigen-solving and preprocessing. In International Symposium on Symbolic and Algebraic Computation, pages 219–226, 2010.

[12] W. Maalej and H.-J. Happel, “Can Development Work Describe Itself?†7th IEEE Work. Conf. Min. Softw. Repos. (MSR 2010), pp. 191–200, 2010.

[13] T. Eisenbarth, R. Koschke, and D. Simon. Aiding Program Comprehension by Static and Dynamic Feature Analysis. In Proceedings of the IEEE International Conference of Software Maintenance (ICSM 2001), November 2001.

[14] MuhammedShafi. P,Selvakumar.S*, Mohamed Shakeel.P, “An Efficient Optimal Fuzzy C Means (OFCM) Algorithm with Particle Swarm Optimization (PSO) To Analyze and Predict Crime Dataâ€, Journal of Advanced Research in Dynamic and Control Systems, Issue: 06,2018, Pages: 699-707

[15] Selvakumar, S & Inbarani, Hannah & Mohamed Shakeel, P. (2016). A hybrid personalized tag recommendations for social E-Learning system. 9. 1187-1199.

[16] Shakeel, P.M., Baskar, S., Dhulipala, V.R.S. et al., “Cloud based framework for diagnosis of diabetes mellitus using K-means clusteringâ€, Health Inf Sci Syst (2018) 6: 16.

View Full Article: