Least Square Regression for Prediction Problems in Machine Learning using R
Keywords:Independent variable, Dependent variable, Least square regression.
The most commonly used prediction technique is Ordinary Least Squares Regression (OLS Regression). It has been applied in many fields like statistics, finance, medicine, psychology and economics. Many people, specially Data Scientists using this technique know that it has not gone with enough training to apply it and should be checked why & when it can or canâ€™t be applied.
Itâ€™s not easy task to find or explain about why least square regression  is faced much criticism when trained and tried to apply it. In this paper, we mention firstly about fundamentals of linear regression and OLS regression along with that popularity of LS method, we present our analysis of difficulties & pitfalls that arise while OLS method is applied, finally some techniques for overcoming these problems.
 Breiman, L. (1984). Classification and Regression Trees. New York: Routledge.
 Book: â€œRegression Analysis and Linear Models: Concepts, Applications, and Implementationâ€ by Richard B. Darlington, Andrew F. Hayes
 A. M. Bagirov, C. Clausen, M. Kohler, "Estimation of a regression function by maxima of minima of linear functions", IEEE Trans. Inf. Theory, vol. 55, no. 2, pp. 833-845, Febâ€™2009.
 L. GyÃ¶rfi, M. Kohler, A. KrzyÅ¼ak, H. Walk, A Distribution-Free Theory of Nonparametric Regression, New York, NY, USA: Springer-Verlag, 2002.
 D. W. Hosmer, S. Lemeshow, R. X. Sturdivant, Applied Logistic Regression, New York, NY, USA: Wiley, 2013.
 Widely Linear Complex-Valued Kernel Methods for Regression by Rafael Boloix-Tortosa ; Juan JosÃ© Murillo-Fuentes ; Irene Santos ; Fernando PÃ©rez-Cruz, published in IEEE Transactions on Signal Processing (Volume: 65, Issue: 19, Oct.1, 1 2017).
 Basics of R: https://www.udemy.com/r-basics/
 D. Buchczik, Least Median of Squares in Multivariate Calibration, 2005.
 Least product relative error estimation - Chen, Kani - Lin, YuanyuanWang, Zhanfeng Ying, Zhiliang - Journal of Multivariate Analysis, VL - 144, 2016, DA - 2016/02/01/, 0047-259X
 C. C. Aggarwal, Outlier Analysis, New York, NY, USA: Springer, 2013.
 P. Chen, L. Jiao, F. Liu, J. Zhao, Z. Zhao, S. Liu, "Semi-supervised double sparse graphs-based discriminant analysis for dimensionality reduction", Pattern Recognit., vol. 61, pp. 361-378, Jan. 2017.
View Full Article:
How to Cite
LicenseAuthors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under aÂ Creative Commons Attribution Licensethat allows others to share the work with an acknowledgement of the work''s authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal''s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (SeeÂ The Effect of Open Access).