Detection of outliers in the linear regression model with application to well water pollution data on the outskirts of the city of Mosul
Abstract
The research idea is concerned with identifying the effect of outliers on the parameters of the multiple linear regression analysis models. Where the outliers values present in the data are detected and diagnosed if they are in the independent or the dependent variable, which causes an impact on the estimation of the parameters of the studied model.The extreme data types and methods of processing them have been identified to obtain a better model with high efficiency or reduce the impact of These values on the model; the MSE standard was developed to compare treatment methods and was applied to real data taken from the Dams and Water Resources Research Center, University of Mosul. As suggested by (2009) is the best in detection among the methods that have been used.
References
- Shaker ,Saleh Muayad,(2009)," Improving the hippocampal M method in estimating multiple linear regression model parameters", Iraqi Journal of Statistical Science, No.16,Pp.219-242.
- Neter,J. et al. (1990). "Applied Linear statistical Models:Regression, Analysis of Variance, and Experimental Designs".(3rd edition ). Irwin, Homewood, IL 60430, Boston, MA 02116.
- Rousseew P.J. and Leroy, A.M. (1987). Robust Regression and Outlier Detection. Wiley-Interscience, New York.
- Tukey, J.W.(1977). "Exploratory Data Analysis", Addison-Wesley reading, MA.
- Yohai, V.J. (1987). "High breakdown-point and high efficiency estimates for regression", The Annals of Statistics 15, 642-65.
- Shaker ,Saleh Muayad,(2017)," Proposed robust methods for the median analysis of the linear regression model and their comparison with the ordinary least squares estimators using simulation",PHD. Thesis, University of Mosul, Mosul, Iraq.
- AL-Mutery, Abed Al-Aziz Mnahe, (2010)," Methods for discovering anomalous and affecting observations on linear regression",King Saud University, AL-Reyad.
- AL-Saeg, Mumen Amer Hsan,(2013)," The effect of outliers on the results of some statistical hypotheses",BSc. Thesis, University of Mosul, Mosul ,Iraq.
- Yusef, Isaam Al-deen Yusef Abd Alla,(2020)," The effect of outliers on the parameters of the multiple linear regression analysis model",PHD. Thesis, AL-Sudan University.
- Al-Youzbakey, K.T. and Sulaiman, A.M. (2020). "Hydrochemical Evaluation for Al-Sada Area Wells and their Suitability for Agricultural Usages", Journal of Umm Al-Qura University for Applied Science, Dams and Water Resources Researches Center, University of Mosul, Mosul, Iraq.
- Belsley,D. et al. (1980). "Regression Diagnostics: Identifying Infuential Data and sources of Collinearity", Wiley, New York, p:105.
- Chatterjee, S. and Hadi, A. S.(1988)."Sensitivity Analysis in Linear Regression", New York: john Wiley.
- Fox, John,(1997). "Applied Regression Analysis, Linear Models, and Related Methods", Sage publications.