Designing a Data Algorithm Prediction Model based on R
DOI:
https://doi.org/10.14419/ijet.v7i3.33.21184Keywords:
Splitting data, utility functions, Cross validation, Analysis, Accuracy, Visualization, Prediction.Abstract
This study is about data analysis and prediction model using R open source language, R is a language and environment for statistical computing and graphics so it’s a full function programming language. In this study through using the Titanic datasets, we created a model that predict the survival rates of the test dataset, we used the Train dataset that has the survived variable with levels of "0"(perished) and "1"(survived) data and the test dataset with no survived levels to predict the survival rates on the test. In this study, we used the R functions, packages, and machine learning algorithms that provided in R, to combine, analysis and splitting the data we created utility functions to make features and predictive potential values for our new variables. We Analyzed the Training data set by cross-validation with 82% accuracy, visualization, and decision tree and then leveraged it to test data to predict the survival rate on test data.
References
[1] Practical data Processing and Analysis Using R
[3] https://www.kaggle.com/c/titanic/data
[4] Practical data Processing and Analysis Using R
[5] Patent Big Data Analysis by R Data Language for Technology Management. International Journal of Software Engineering and Its Applications Vol. 10, No. 1(2016), pp, 69-78
[6] http://www.dodomira.com/tag/ggplot2/
[7] https://thebook.io/006723/ch10/03/04/03/
[9] http://blog.heartcount.io/random-forest-ver-10
[10] https://cran.r-project.org/web/packages/rpart.plot/rpart.plot.pdf
[11] http://www.dodomira.com/2016/05/29/564/
[12] A Review Study on Big Data. Anshul Jatain et al, International Journal of Computer Science and Mobile Computing, Vol.6 Issue.6, June- 2017, pg. 8-13
[14] http://dataaspirant.com/2017/02/03/decision-tree-classifier-implementation-in-r/
How to Cite
License
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution Licensethat allows others to share the work with an acknowledgement of the work''s authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal''s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
Accepted 2018-10-07