Optimized Random Forest Classifier for Predicting Ideal Candidate for The General Election
-
https://doi.org/10.14419/zxxek146
Received date: July 22, 2025
Accepted date: September 10, 2025
Published date: September 17, 2025
-
Bayesian Optimization Technique; Decision Trees; Election Candidate; Election Party; Random Forest Classifiers -
Abstract
Electoral systems and candidate selections are the two important pillars of modern democratic elections. The integrity and competence of candidates are significant contributing factors to government enactment. Hence, the selection of competent candidates is an indispensable process in the electoral system. This research proposes a new model, named RANWIN (Random forest classifier-based model for selecting Winning candidates), for predicting the winning probability of the candidate and the party. Our model integrates Random Forest Classifier (RFC) and Bayesian Optimization Technique (BOT) to achieve outstanding performance. It considers several factors for predicting the winning probability of candidates, including the popularity of the candidate, past election history, party change or personalization, vote bank, etc. For predicting the winning probability of the party, this method considers the manifestoes, vote bank, leaders, electoral history, intraparty struggles, major rallies hosted in the constituency, and the strength of alliance parties etc. After calculating the impact of all these parameters, RANWIN uses RFC to predict the winning possibility of the candidates as well as parties. To enhance the efficiency of the intended framework, we apply BOT in the prevailing RFC to find out the optimal hyperparameters of the classification process. To prove the better performance of our framework, its enactment is related to other existing approaches regarding accuracy, precision, the Area Under the receiver operating Characteristic (AUC) curve, recall, F-measure, and Root Mean Square Error (RMSE). The empirical results demonstrate that RANWIN can be considered as a more effective method with higher predictive accuracy, precision, AUC, recall, F-measure, and lower RMSE of 94.32%, 95.5%, 90.6%, 97.6%, 98.0%, and 16.23%, correspondingly. As a result, we can prove that RANWIN is an improved framework for candidate selection and has a positive impact on the current political landscape as related to approaches based on other machine learning techniques. The key goal of this research is to help political parties select the right candidates to win public office.
-
References
- Austin Ranney, ‘Candidate Selection’, in David Butler, Howard R. Penniman and Austin Ranney (eds), Democracy at the Polls, Washington, DC, American Enterprise Institute, 1981, pp. 75–106, quoted on p. 75.
- Gideon Rahat and Reuven Y. Hazan, ‘Candidate Selection Methods: An Analytical Framework’, Party Politics, 7:3 2001, pp. 297–322. https://doi.org/10.1177/1354068801007003003.
- Söderlund, P., Schoultz, A., Papageorgiou, A., Coping with complexity: Ballot position effects in the Finnish open-list proportional representation system, Electoral Studies, vol. 71, 2021, 102330, https://doi.org/10.1016/j.electstud.2021.102330.
- Jean-François Daoust & Gabrielle Péloquin-Skulski What Are the Consequences of Snap Elections on Citizens’ Voting Behav-ior?, Representation, 57:1, 95-108, 2021, https://doi.org/10.1080/00344893.2020.1804440.
- S. Issenberg, How President Obama's campaign used big data to rally individual voters, MIT Technol Rev 116(1), 2012, 38–49
- L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, 2001, pp. 5–32. https://doi.org/10.1023/A:1010933404324.
- Biau, G., Scornet, E. A random forest guided tour. TEST 25, 197–227 (2016). https://doi.org/10.1007/s11749-016-0481-7.
- Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In R. B. L. et al. (Eds.), Proceedings of the 22nd ACM SIGKDD In-ternational Conference on Knowledge Discovery and Data Mining (pp. 785–794). https://doi.org/10.1145/2939672.2939785.
- Alao, J. O., & Karami, A. (2025). Harnessing social media sentiment for predictive insights into the 2023 Nigerian presidential election. IEEE Ac-cess, 13, 12345–12358.
- A. Liaw and M. Wiener, "Classification and regression by random forest," R News , vol. 2, no. 3, 2002, pp. 18- 22.
- S. Xia, G. Wang, Z. Chen, Y. Duan, and Q. Jiu, "Complete random forest based class noise filtering learning for improving the generalizability of classifiers," IEEE Trans. Know/. Data Eng., vol. 31, no. 11, November 2019, pp. 2063- 2078. https://doi.org/10.1109/TKDE.2018.2873791.
- M. A. Hannan, J. A. Ali, A. Mohamed, and M. N. Uddin, "A ran dom forest regression based space vector PWM inverter controller for the induction motor d1ive," IEEE Trans. Ind. Electron., vol. 64, no. 4, April 2017, pp. 2689- 2699. https://doi.org/10.1109/TIE.2016.2631121.
- Z. Ge, Z. Song, S. X. Ding, and B. Huang, "Data mining and analytics in the process industry: The role of machine learning," IEEE Access, vol. 5, 2017, pp. 20590- 20616. https://doi.org/10.1109/ACCESS.2017.2756872.
- Lewis-Beck, M.S., Rice, T.W., Forecasting presidential elections: a comparison of naive models. Polit Behav, vol. 6, no. 1, 1984, pp. 9–21, 1984. https://doi.org/10.1007/BF00988226.
- Dwi Prasetyo N, Hauf C, Twitter-based election prediction in the developing world. In: Proceedings of the 26th ACM Conference on Hypertext & Social Media, ACM, 2015, pp 149–158. https://doi.org/10.1145/2700171.2791033.
- Kagan V, Stevens A, Subrahmanian V Using twitter sentiment to forecast the 2013 pakistani election and the 2014 indian election. IEEE Intell Syst 1:2–5, 2015. https://doi.org/10.1109/MIS.2015.16.
- Andreas Graefe AC (2014) State-by-state political economy model. https://pollyvote.com/en/components/econometric-models/jerom e-jerome/
- Campbell JE, Norpoth H et al, A recap of the 2016 election forecasts. PS: Polit Sci Polit 50(2):331–338, 2017. https://doi.org/10.1017/S1049096516002766.
- Tien C, Lewis-Beck MS, In forecasting the 2016 election result, modelers had a good year. pollsters did not. USApp–American Politics and Policy Blog, 2016.
- Dassonneville R, Lewis-Beck MS, Comparative election forecasting. synthetic models for europe. In: Conference on methodological innovations in the study of elections in Europe and beyond, College Station, 2014.
- Lewis-Beck MS, Tien CP, House forecasts: structure-x models for 2018. PS: Polit Sci Polit 51(S1):17–20. https://doi.org/10.1017/S1049096518001257.
- Dassonneville R, Lewis-Beck MS, Mongrain P, Forecasting Dutch elections: an initial model from the March 2017 legislative contests. Res Polit 4(3):2053168017720023, 2017. https://doi.org/10.1177/2053168017720023.
- Holbrook TM, Incumbency, national conditions, and the 2012 presidential election. PS: Polit Sci Polit 45(4):640–643, 2012. https://doi.org/10.1017/S1049096512000923.
- Traugott MW, Public opinion polls and election forecasting. PS: Polit Sci Polit 47(2):342–344, 2014. https://doi.org/10.1017/S1049096514000171.
- Blumenthal M, Polls, forecasts, and aggregators. PS: Polit Sci Polit 47(2):297–300, 2014. https://doi.org/10.1017/S1049096514000055.
- Silver N (2018) How fvethirtyeights house, senate and governor models work. https://fvethirtyeight.com/methodology/how-fvethirty eights-house-and-senate-models-work/. Accessed 08 October 2021.
- Feldman R, Techniques and applications for sentiment analysis. Communication ACM 56(4):82–89, 2013. https://doi.org/10.1145/2436256.2436274.
- Tumasjan A, Sprenger TO, Sandner PG, Welpe IM, Predicting elections with twitter: what 140 characters reveal about political sentiment. ICWSM 10(1):178–185, 2010. https://doi.org/10.1609/icwsm.v4i1.14009.
- Mislove A, Lehmann S, Ahn Y-Y, Onnela J-P, Rosenquist JN, Understanding the demographics of twitter users. ICWSM 11(5th):25, 2011.
- Mustafaraj E, Finn S, Whitlock C, Metaxas PT, Vocal minority versus silent majority: Discovering the opinions of the long tail. In: Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Con-ference on, IEEE, 2011, pp 103–110. https://doi.org/10.1109/PASSAT/SocialCom.2011.188.
- Skoric M, Poor N, Achananuparp P, Lim E-P, Jiang J , Tweets and votes: A study of the 2011 Singapore general election. In: System Science (HICSS), 2012 45th Hawaii International Conference on, IEEE, 2012, pp 2583–2591. https://doi.org/10.1109/HICSS.2012.607.
- Wan V, Campbell W. Support vector machines for speaker verifcation and identifcation. In: IEEE proceeding. 2000
- Yeung KY, Bumgarner RE, Raftery AE. Bayesian model averaging: development of an improved multi-class, gene selection and classifcation tool for microarray data. Bioinformatics. 2005;21:2394–402. https://doi.org/10.1093/bioinformatics/bti319.
- Quinlan JR, Rivest RL. Inferring decision trees using the minimum description length principle. Inform Comput. 1989;80(3):227–48. https://doi.org/10.1016/0890-5401(89)90010-2.
-
Downloads
-
How to Cite
Raju , K., Lavanya, R. ., Lalitha , R. ., & R, S. S. (2025). Optimized Random Forest Classifier for Predicting Ideal Candidate for The General Election. International Journal of Basic and Applied Sciences, 14(5), 649-658. https://doi.org/10.14419/zxxek146
