Journals Information
Mathematics and Statistics Vol. 11(4), pp. 738 - 745
DOI: 10.13189/ms.2023.110416
Reprint (PDF) (723Kb)
A New Procedure for Multiple Outliers Detection in Linear Regression
Ugah Tobias Ejiofor 1, Arum Kingsley Chinedu 1,*, Charity Uchenna Onwuamaeze 1, Everestus Okafor Ossai 1, Henrrietta Ebele Oranye 1, Nnaemeka Martin Eze 1, Mba Emmanuel Ikechukwu 1, Ifeoma Christy Mba 2, Comfort Njideka Ekene-Okafor 3, Asogwa Oluchukwu Chukwuemeka 4, Nkechi Grace Okoacha 5
1 Department of Statistics, Faculty of Physical Sciences, University of Nigeria, Nigeria
2 Department of Economics, Faculty of Social Sciences, University of Nigeria, Nigeria
3 Department of Computer Sciences/Mathematics, Faculty of Natural Sciences and Environmental Studies, Godfrey Okoye University, Nigeria
4 Department of Mathematics and Statistics, Alex Ekwueme Federal University Ndufu Alike, Nigeria
5 Basic Science Unit, School of Science and Technology, Pan-Atlantic University, Nigeria
ABSTRACT
In this paper, a simple asymptotic test statistic for identifying multiple outliers in linear regression is proposed. Sequential methods of multiple outliers detection test for the presence of a single outlier each time the procedure is applied. That is, the most severe or extreme outlying observation (the observation with the largest absolute internally studentized residual from the original fit of the mode to the entire observations) is tested first. If the test detects this observation as an outlier, then this observation is deleted, and the model is refitted to the remaining (reduced) observations. Then the observation with the next largest absolute internally studentized residual from the reduced sample is tested, and so on. This procedure of deleting observations and recomputing studentized residuals is continued until the null hypothesis of no outliers fails to be rejected. However, in this work our method or procedure entails calculating and uses only one set of internally studentized residuals obtained from fitting the model to the original data throughout the test exercise, and hence the procedure of deleting an observation, refitting the data to the remaining observations (reduced values) and recomputing the absolute internally studentized residuals at each stage of the test is avoided. The test statistic is incorporated into a technique (procedure) that entails a sequential application of a function of the internally studentized residuals. The procedure is a straightforward multistage method and is based on a result giving large sample properties of the internally studentized residuals. Approximate critical values of this test statistic are obtained based on approximations that depend on the application of the Bonferroni inequality since their exact values are not available. The new test statistic is very simple to compute, efficient and effective in large data sets, where more complex methods are difficult to apply because of their enormous computational demands or requirements. The results of the simulation study and numerical examples clearly show that the proposed test statistic is very successful in the identification of outlying observations.
KEYWORDS
Asymptotic, Bonferroni Upper Bound, Critical Values, Internally Studentized Residuals, Outlier, Robust, Regression Diagnostics, Test Statistic
Cite This Paper in IEEE or APA Citation Styles
(a). IEEE Format:
[1] Ugah Tobias Ejiofor , Arum Kingsley Chinedu , Charity Uchenna Onwuamaeze , Everestus Okafor Ossai , Henrrietta Ebele Oranye , Nnaemeka Martin Eze , Mba Emmanuel Ikechukwu , Ifeoma Christy Mba , Comfort Njideka Ekene-Okafor , Asogwa Oluchukwu Chukwuemeka , Nkechi Grace Okoacha , "A New Procedure for Multiple Outliers Detection in Linear Regression," Mathematics and Statistics, Vol. 11, No. 4, pp. 738 - 745, 2023. DOI: 10.13189/ms.2023.110416.
(b). APA Format:
Ugah Tobias Ejiofor , Arum Kingsley Chinedu , Charity Uchenna Onwuamaeze , Everestus Okafor Ossai , Henrrietta Ebele Oranye , Nnaemeka Martin Eze , Mba Emmanuel Ikechukwu , Ifeoma Christy Mba , Comfort Njideka Ekene-Okafor , Asogwa Oluchukwu Chukwuemeka , Nkechi Grace Okoacha (2023). A New Procedure for Multiple Outliers Detection in Linear Regression. Mathematics and Statistics, 11(4), 738 - 745. DOI: 10.13189/ms.2023.110416.