A Normalized Least Mean Square and Dynamic Time Warping (DTW) Algorithm for an Intelligent Quran Tutoring System

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    Al-Quran is the most recited holy book in the Arabic language. Over 1.3-billion Muslim all over the world have an obligation to recite and learn Al-Quran. Learners from non-Arabic as well as from Arabic speaking communities face difficulties with Al-Quran recitation in the absence of a teacher (ustad) around. Advancement in speech recognition technology creates possible solutions to develop a system that has a capability to auricularly discern and validate the recitation. This paper investigates the speech recognition accuracy of template-based acoustic models and propose enhancement methods to improve the accuracy. A new scheme consists of enhancement of Normalized Least Mean Square (NLMS) and Dynamic Time Warping (DTW) algorithms have been proposed. The performance of the speech recognition accuracy was further improved by incorporating an adaptive optimal filtering with modified humming window for MFCC (Mel-frequency cepstral coefficients) using matching technique dynamic programming (DP), DTW (Dynamic Time Wrapping). The proposed scheme increases 5.5% of relative improvement in recognition accuracy achieved over conventional speech recognition process.



  • Keywords

    Dynamic Time Wrapping (DTW); Mel Frequency Cepstral Coefficient (MFCC); Normalize Least Mean Square (NLMS).

  • References

      [1] Dhiman, J., Ahmad, S., & Gulia, K. (2013). Comparison between Adaptive Filter Algorithms (LMS, NLMS and RLS). International Journal of Science, Engineering and Technology Research, 2(5), 1100-1103.

      [2] Mangamma, V., & Saravanan, V. (2014). Noise cancellation of speech signal by using adaptive filtering with averaging algorithm. international Conference on Innovations in Engineering and Technology, 3(3), 1917–1920.

      [3] Hadei, S. & M. lotfizad (2011). A family of Adaptive Filter Algorithms in noise cancellation for speech enhancement. International Journal of Computer and Electrical Engineering, 2(2), 1793–8163.

      [4] Anusuya, M. A., & Katti, S. K. (2010). Speech recognition by machine: A review. International Journal of Computer Science and Information Security, 6(3), 181-205.

      [5] Ibrahim, N. J., Razak, Z., Yusoff, Z. M., Idris, M. Y. I., Tamil, E. M., Noor, N. M., ... & Naemah, N. (2008). Quranic verse recitation recognition module for support in j-QAF learning: A review. International Journal of Computer Science and Network Security, 8(8), 207-216.

      [6] Ghule, K. R., & Deshmukh, R. R. (2015). Feature extraction techniques for speech recognition: A review. International Journal of Scientific and Engineering Research, 6(5), 2229-5518.

      [7] Liu, Y., Xiao, M., & Tie, Y. (2013). A noise reduction method based on LMS adaptive filter of audio signals. Proceedings of the 3rd International Conference on Multimedia Technology, pp. 1001-1008.

      [8] Darabian, D., Marvi, H., & Sharif Noughabi, M. (2015). Improving the performance of MFCC for Persian robust speech recognition. Journal of AI and Data Mining, 3(2), 149-156.

      [9] Kim, C., & Seo, K. D. (2005). Robust DTW-based recognition algorithm for hand-held consumer devices. IEEE Transactions on Consumer Electronics, 51(2), 699-709.

      [10] Afroz, F., Huq, A., & Sandrasegaran, K. (2015). Performance analysis of adaptive noise canceller employing NLMS algorithm. International Journal of Wireless and Mobile Networks, 7(2), 45-58.

      [11] Ahmed, A. H., & Abdo, S. M. (2017). Verification system for Quran recitation recordings. International Journal of Computer Applications, 163(4), 6-11.

      [12] Arora, S. J., & Singh, R. P. (2012). Automatic speech recognition: A review. International Journal of Computer Applications, 60(9), 34-44.

      [13] Karpagavalli, S., & Chandra, E. (2016). A review on automatic speech recognition architecture and approaches. International Journal of Signal Processing, Image Processing and Pattern Recognition, 9(4), 393-404.

      [14] Mansour, A. H., Salh, G. Z. A., & Mohammed, K. A. (2015). Voice recognition using dynamic time warping and mel-frequency cepstral coefficients algorithms. International Journal of Computer Applications, 116(2), 34-41.

      [15] Mohammed, J. R., Shafi, M. S., Imtiaz, S., Ansari, R. I., & Khan, M. (2012). An efficient adaptive noise cancellation scheme using ALE and NLMS filters. International Journal of Electrical and Computer Engineering, 2(3), 325-332.




Article ID: 25761
DOI: 10.14419/ijet.v7i4.15.25761

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.