Correlation-based clustering and the modified naïve-Bayesian-classification for gene-sequence data analysis

  • Authors

    • vijay Arputharaj Scholar, Karpagam university,Lecturer, Jigjiga University
    • Dr. S.Sheeja Associate Professor Dept. Of Computer Applications, Karpagam Academy of Higher Education
    • Dr. K. Anuradha
    2019-03-22
    https://doi.org/10.14419/ijet.v7i4.25557
  • Clustering, Classification, Gene Sequence, Data Analysis.
  • Correlation based Clustering separates the statistical data from the most favourable amount of clusters with corresponding to the statistically analysed data points. As we know, Data mining is the technique of figuring out progression of determining patterns inside huge statistics and datasets, which concerns techniques related to connection with machine related learning, statistics and also the advanced database systems. This technique denotes the gene sequence using the novel classification technique, which improves the accuracy of classification under the course of dimensionality. Grouping the gene data using correlation-based clustering will reduce the execution time.

     

     

  • References

    1. [1] ENFSI DNA Working Group, DNA-Database Management Review and Recommendations, with financial support from the ISEC Programme, European Commission- Directorate General Justice and Home Affairs April 2012.

      [2] Marina Andrade & Manuel Alberto M. Ferreira, Criminal and Civil Identification with DNA Databases Using Bayesian Networks, International Journal of Security, (IJS), Volume (3): Issue (4), PP 65-74, 2010.

      [3] V.N. Rajavarman and S.P. Rajagopalan, Feature Selection in Data-Mining for Genetics Using Genetic Algorithm, Journal of Computer Science 3 (9):723-725, 2007, ISSN 1549-3636, Science Publications, 2007, PP 723-725.

      [4] Chan Wai Keung Brian, Data Mining Using Genetic Algorithm, City University of Hong Kong, Dissertation, Hong Kong, August 2006.

      [5] Yang, J. and V. Honoavar, 2005. Feature Extraction Construction and Selection: A data Mining Perspective, chapter 1: Feature Subset Selection Using a Genetic Algorithm, H. Liu and H. Motoda Eds, massachussetts: kluwer academic publishers Ed., pp: 117-136.

      [6] Bates Congdon, C., 2002. A comparison of genetic algorithm and other machine learning systems on a complex classi. Cation task from common disease research. Ph.D Thesis, University of Michigan.

      [7] VIJAY ARPUTHARAJ J and Dr.R.MANICKA CHEZIAN, 2013. DATA MINING WITH HUMAN GENETICS TO ENHANCE GENE BASED ALGORITHM AND DNA DATABASE SECURITY .International Journal of Computer Engineering & Technology (IJCET).Volume:4, Issue: 3, Pages: 176-181.

      [8] Dr.C.Sunil Kumar,J.Seetha, S.R.Vinotha, Security Implications of Distributed Database Management System Models, International Journal of Soft Computing And Software Engineering (JSCSE),e-ISSN: 2251-7545, Vol.2, No.11, 2012, PP 20-28.

      [9] Mount David W., Bioinformatics – Sequence and Genome Analysis, Cold Spring Harbor Laboratory Press, 2001.

      [10] Rajesh S., Prathima S., Reddy L.S.S., Unusual Pattern Detection in DNA Database Using KMP Algorithm, International Journal of Computer Applications (0975 - 8887)Volume 1 – No. 22, 2010.

      [11] Kurzrock R., Kantarjian, H. M. Druker B. J., Talpaz, M. (2003). "Philadelphia chromosome positive leukemias: From basic mechanisms to molecular therapeutics". Annals of internal medicine 138 (10): 819–830. https://doi.org/10.7326/0003-4819-138-10-200305200-00010.

      [12] Pakakasama S., Kajanachumpol S., Kanjanapongkul S., Sirachainan N., Meekaewkunchorn A., Ningsanond V., Hongeng, S. (2008). "Simple multiplex RT-PCR for identifying common fusion transcripts in childhood acute leukemia". International Journal of Laboratory Hematology 30 (4): 286–291. https://doi.org/10.1111/j.1751-553X.2007.00954.x.

      [13] Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL. GenBank. Nucleic Acids Res.2006; 34(Database):D16–20.

      [14] Manju B R, Dr A R Rajan and Dr V Sugumaran, “Optimizing the Parameters of Wavelets for Pattern Matching using GAâ€, International Journal of Advanced Research in Engineering & Technology (IJARET), Volume 3, Issue 1, 2012, pp. 77 - 85, ISSN Print: 0976-6480, ISSN Online: 0976-6499.

      [15] Vijay Arputharaj J and Dr.R.Manicka Chezian, “A Collective Algorithmic Approach- For Enhanced DNA Database Securityâ€, International Journal of Management and Information technology, Vol4, No1, 2013, ISSN 2278-5612,PP 174-178.

  • Downloads

  • How to Cite

    Arputharaj, vijay, S.Sheeja, D., & K. Anuradha, D. (2019). Correlation-based clustering and the modified naïve-Bayesian-classification for gene-sequence data analysis. International Journal of Engineering & Technology, 7(4), 5292-52996. https://doi.org/10.14419/ijet.v7i4.25557