A novel approach based on sequence prediction for webpage access


  • Nguyen Thon Da Faculty of Information Systems, University of Economics and Law, VNU-HCM
  • Tan Hanh Faculty of Information Technology, Posts and Telecommunications Institute of Technology






CPT, CPT , Sequence Prediction, Web Mining.


Predicting the next item of a sequence over a finite alphabet is highly important in Web Mining. This paper presents a solution to improve the performance of sequence prediction; first and foremost, predicting what is the next Web page that will be visited by that user for prefetching the Web page. The proposed approach is how to decrease the complexity of the prediction space. Experimental results on a few real-life datasets show that the time execution of this novel approach is better than that of traditional approaches.




[1] T. C. Bell, J. G. Cleary, and I. H. Witten, Text compression: Prentice-Hall, Inc., 1990.

[2] C. Draa, J. Tayeb, S. Niar, and E. Grislin, "Application sequence prediction for energy consumption reduction in mobile systems." pp. 23-30.

[3] A. J. Majda, I. Timofeyev, and E. V. Eijnden, “Models for stochastic climate prediction,†Proceedings of the National Academy of Sciences, vol. 96, no. 26, pp. 14687-14691, 1999. https://doi.org/10.1073/pnas.96.26.14687.

[4] G. Suchacka, and S. Stemplewski, "Application of Neural Network to Predict Purchases in Online Store." pp. 221-231.

[5] S. Bahram, D. Sen, and R. S. Amant, "Prediction of web page accessibility based on structural and textual features." p. 31.

[6] M. T. Hassan, K. N. Junejo, and A. Karim, "Learning and predicting key Web navigation patterns using Bayesian models." pp. 877-887.

[7] D. Bonino, F. Corno, and G. Squillero, "A real-time evolutionary algorithm for web prediction." pp. 139-145.

[8] M. Li, X. Yu, and K. H. Ryu, “MapReduce-based web mining for prediction of web-user navigation,†Journal of Information Science, vol. 40, no. 5, pp. 557-567, 2014. https://doi.org/10.1177/0165551514544096.

[9] N. Labroche, N. Monmarché, and G. Venturini, "A new clustering algorithm based on the chemical recognition system of ants." pp. 345-349.

[10] L. Jianhui, and Z. Bingjie, "A Web Prediction Pattern Recommendation Algorithm." pp. 263-266.

[11] Q. Yang, T. Li, and K. Wang, “Building association-rule based sequential classifiers for web-document prediction,†Data mining and knowledge discovery, vol. 8, no. 3, pp. 253-273, 2004. https://doi.org/10.1023/B:DAMI.0000023675.04946.f1.

[12] R. Geetharamani, P. Revathy, and S. G. Jacob, “Prediction of users webpage access behaviour using association rule mining,†Sadhana, vol. 40, no. 8, pp. 2353-2365, 2015. https://doi.org/10.1007/s12046-015-0424-0.

[13] P. Fournier-Viger, T. Gueniche, and V. S. Tseng, "Using Partially-Ordered Sequential Rules to Generate More Accurate Sequence Prediction." pp. 431-442.

[14] E. Frias-Martinez, and V. Karamcheti, "A prediction model for user access sequences."

[15] M. Géry, and H. Haddad, "Evaluation of web usage mining approaches for user's next request prediction." pp. 74-81.

[16] D. Dhyani, S. Bhowmick, and W.-K. Ng, "Modelling and predicting a Web page accesses using Markov processes." pp. 332-336.

[17] V. M. Rao, and V. V. Kumari, “An efficient hybrid successive Markov model for predicting web user usage behavior using web usage mining,†International Journal of Data Engineering (IJDE), vol. 1, no. 5, pp. 43-62, 2010.

[18] X. Dongshan, and S. Junyi, “A new markov model for web access prediction,†Computing in Science & Engineering, vol. 4, no. 6, pp. 34-39, 2002. https://doi.org/10.1109/MCISE.2002.1046594.

[19] C. S. Iliopoulos, C. Makris, Y. Panagis, K. Perdikuri, E. Theodoridis, and A. Tsakalidis, “The weighted suffix tree: an efficient data structure for handling molecular weighted sequences and its applications,†Fundamenta Informaticae, vol. 71, no. 2, 3, pp. 259-277, 2006.

[20] V. S. Tseng, K. W. Lin, and J.-C. Chang, “Prediction of user navigation patterns by mining the temporal web usage evolution,†Soft Computing-A Fusion of Foundations, Methodologies and Applications, vol. 12, no. 2, pp. 157-163, 2008.

[21] M. Narvekar, and S. S. Banu, “Predicting user's Web navigation behavior using hybrid approach,†Procedia Computer Science, vol. 45, pp. 3-12, 2015. https://doi.org/10.1016/j.procs.2015.03.073.

[22] B. Nigam, S. Tokekar, and S. Jain, “Evaluation of models for predicting user's next request in web usage mining,†international Journal on Cybernetics & informatics (UCi), vol. 4, pp. 1-13, 2015.

[23] M. A. Awad, and I. Khalil, “Prediction of user's web-browsing behavior: Application of markov model,†IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 42, no. 4, pp. 1131-1142, 2012. https://doi.org/10.1109/TSMCB.2012.2187441.

[24] M. Awad, L. Khan, and B. Thuraisingham, “Predicting WWW surfing using multiple evidence combination,†The VLDB Journal—The International Journal on Very Large Data Bases, vol. 17, no. 3, pp. 401-417, 2008.

[25] P. Sampath, A. Wahi, and D. Ramya, “A COMPARATIVE ANALYSIS OF MARKOV MODEL WITH CLUSTERING AND ASSOCIATION RULE MINING FOR BETTER WEB PAGE PREDICTION,†Journal of Theoretical & Applied Information Technology, vol. 63, no. 3, 2014.

[26] J. Zhu, J. Hong, and J. G. Hughes, "Using markov chains for link prediction in adaptive web sites," Soft-Ware 2002: Computing in an Imperfect World, pp. 60-73: Springer, 2002. https://doi.org/10.1007/3-540-46019-5_5.

[27] P. Thwe, "Using Markov Model and Popularity and Similarity Based PageRank Algorithm for Web Page Access Prediction."

[28] S. Dubey, and N. Mishra, “Web page prediction using hybrid model,†International Journal on Computer Science and Engineering, vol. 3, no. 5, pp. 2170-2176, 2011.

[29] F. Khalil, J. Li, and H. Wang, "Integrating recommendation models for improved web page prediction accuracy." pp. 91-100.

[30] T. Gueniche, P. Fournier-Viger, and V. S. Tseng, "Compact Prediction Tree: A Lossless Model for Accurate Sequence Prediction." pp. 177-188.

[31] T. Gueniche, P. Fournier-Viger, R. Raman, and V. S. Tseng, "CPT+: Decreasing the time/space complexity of the Compact Prediction Tree." pp. 625-636.

[32] V. Padmanabhan, and J. Mogul, “Using Prefetching to Improve World Wide Web Latency,†Computer Communications, vol. 16, pp. 358-368, 1998.

[33] J. Pitkow, and P. Pirolli, "Mininglongestrepeatin g subsequencestopredict worldwidewebsurfing." p. 1.

P. Laird, and R. Saul, “Discrete sequence prediction and its applications,†Machine learning, vol. 15, no. 1, pp. 43-68, 1994. https://doi.org/10.1007/BF01000408

View Full Article: