A Holistic Framework for Automated Answer Scoring: Unifying ‎Syntactic and Semantic Analysis

Deepender; Tarandeep Singh   Walia; Vinay   Kumar; Pavitra  Kumari; Sanju; Narender  Kumar

doi:10.14419/f18ev204

Authors and Affiliations

Deepender School of Computer Applications, Lovely Professional University, Punjab, India
Tarandeep Singh Walia School of Computer Applications, Lovely Professional University, Punjab, India
Vinay Kumar Department of Statistics, Central University of Haryana, Mahendergarh, India
Pavitra Kumari Department of Statistics, Central University of Haryana, Mahendergarh, India
Sanju Faculty of Agriculture, Guru Kashi University, Bathinda, Punjab, India
Narender Kumar Associate Professor, Dayanand College, Hisar

About this article

DOI:

https://doi.org/10.14419/f18ev204

Received:

04-06-2025

Accepted:

27-06-2025

Published:

02-07-2025

Views:

1300

Downloads:

559

Download PDF

Keywords:

Automated Answer Scoring; RNN; LSTM; Natural Language Processing; RoBERTa; Syntactic ‎Feature; Semantic Feature.

Abstract

In educational and assessment contexts, automated response scoring is crucial, especially for ‎effectively managing extensive assessments. This research integrates syntactic and semantic ‎data to present a novel hybrid model to handle issues in this domain. Recognizing the value ‎of integrating these complementary elements for improved comprehension and assessment ‎accuracy, the model has modular components that extract syntactic and semantic information ‎from responses. The hybrid model ensures a more reliable and flexible scoring system by ‎combining rule-based approaches with machine learning and deep learning techniques in a ‎unique way. The model gains more accuracy and adaptability by leveraging the advantages of ‎each method, which enables it to be used in a variety of educational settings. Experiments ‎were carried out utilizing a Hindi dataset to assess the hybrid model's efficacy. The model's ‎performance was evaluated using key performance indicators, including as accuracy, ‎precision, recall, and F1 score. The findings showed that the hybrid model works noticeably ‎better than conventional scoring techniques, offering superior outcomes in terms of accuracy ‎and flexibility. This suggests that the approach has the potential to completely transform ‎automated answer scoring in educational settings that are bilingual and culturally diverse. All ‎things considered, this hybrid strategy presents a viable way to enhance automated answer ‎scoring systems. This model can improve the fairness and scalability of assessments across ‎many languages and educational settings by combining syntactic and semantic elements and ‎utilizing cutting-edge technologies, which will help create evaluation procedures that are more ‎inclusive and dependable‎.

References

Sharma, R., et al. (2020). Leveraging Automated Scoring in Telecaller Recruitment: A Case Study of Startup Apna. International Conference on Information Systems, 128-140.

Kumar, S., & Kumar, A. (2021). Challenges in Hindi Natural Language Processing: A Review. International Journal of Computational Linguistics, 38(3), 412-426.

Singh, V., & Gupta, R. (2022). Advances in Hindi Natural Language Processing for Automated Scoring. ACM Transactions on Asian Language Information Processing, 21(1), 56-68.

Smith, A. (2018). Advancements in Rule-based Natural Language Processing Systems. Journal of Computational Linguistics, 35(2), 215-228.

Jones, B., et al. (2019). Machine Learning Models for Automated Answer Scoring: A Review. Educational Technology Research and Development, 67(4), 923-939.

View more references (18)

Johnson, C., & Brown, D. (2020). Rule-based Approaches to Automated Answer Scoring: Challenges and Opportunities. Journal of Educational Technology, 45(3), 421-435.

Garcia, M., et al. (2021). Enhancing Automated Answer Scoring with Machine Learning Techniques. International Journal of Artificial Intelligence in Education, 31(1), 87-102

Choi, H., et al. (2022). Integrating Syntactic and Semantic Features in Automated Scoring: A Hybrid Approach. International Conference on Arti-ficial Intelligence, 156-168.

Goldberg, Y. (2017). Neural Network Methods for Natural Language Processing. Synthesis Lectures on Human Language Technologies. https://doi.org/10.1007/978-3-031-02165-7.

Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer. Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324.

Rajkomar, A., et al. (2018). Scalable and Accurate Deep Learning with Electronic Health Records. npj Digital Medicine, 1(1), 1-10.

Lipton, Z. C. (2016). The Mythos of Model Interpretability. arXiv preprint arXiv:1606.03490.

Kotsiantis, S. B., et al. (2007). Supervised Machine Learning: A Review of Classification Techniques. Emerging Artificial Intelligence Applications in Computer Engineering, 3(1), 3-24.

Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735-1780. https://doi.org/10.1162/neco.1997.9.8.1735.

Devlin, J., et al. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805.

Walia, T. S. Investigating the scope of semantic analysis in natural language processing considering accuracy and performance. In Recent Advances in Computing Sciences (pp. 323-328). CRC Press.

Walia, T. S. (2024). Hybrid Approach for Automated Answer Scoring Using Semantic Analysis in Long Hindi Text. Revue d'Intelligence Artificiel-le, 38(1). https://doi.org/10.18280/ria.380122.

Deepender, & Walia, T. S. (2022, November). Investigating the Role of Semantic Analysis in Automated Answer Scoring. In International Confer-ence on Innovations in Computational Intelligence and Computer Vision (pp. 559-571). Singapore: Springer Nature Singapore. https://doi.org/10.1007/978-981-99-2602-2_42.

Sanju, Kumar, V., & Kumari, P. (2024). Evaluating the Performance of Bayesian Approach for Imputing Missing Data under different Missingness Mechanism. Sankhya B, 1-11. https://doi.org/10.1007/s13571-024-00344-w.

Sanju, Kumar, V., & Deepender. (2023). Evaluation of imputation techniques for genotypic data of soybean crop under missing completely at ran-dom mechanism.

Singh, D. (2023). Non-linear growth models for acreage, production and productivity of food-grains in Haryana.

Shrivastava, U., & Verma, J. K. (2021, December). A Study on 5G Technology and Its Applications in Telecommunications. In 2021 International Conference on Computational Performance Evaluation (ComPE) (pp. 365-371). IEEE. https://doi.org/10.1109/ComPE53109.2021.9752402.

KUMAR, V., & Kumari, P. (2023). Analysis of Incomplete Data Under Different Missingness Mechanism using Imputation Methods for Wheat Genotypes. Current Agriculture Research Journal, 11(3). https://doi.org/10.12944/CARJ.11.3.33

How to Cite

Deepender, Walia, T. S. . ., Kumar, V. . ., Kumari , P. ., Sanju, & Kumar , N. . (2025). A Holistic Framework for Automated Answer Scoring: Unifying ‎Syntactic and Semantic Analysis. International Journal of Basic and Applied Sciences, 14(2), 572-576. https://doi.org/10.14419/f18ev204

Download Citation