Mathematics and Statistics Vol. 8(2), pp. 201 - 210
DOI: 10.13189/ms.2020.080216
Reprint (PDF) (329Kb)


Improved Frequency Table with Application to Environmental Data


Mohammed M. B. 1,2,*, Adam M. B. 2,3, Zulkafli H. S. 2, Ali N. 2
1 Mathematics and Computer Science, Federal University of Kashere, Nigeria
2 Department of Mathematics, Universiti Putra Malaysia, Malaysia
3 Institute for Mathematical Research, Universiti Putra Malaysia, Malaysia

ABSTRACT

This paper proposes three different statistics to be used to represent the magnitude observations in each class when estimating the statistical measures from the frequency table for continuous data. The existing frequency tables use the midpoint as the magnitude of observations in each class, which results in an error called grouping error. Using the midpoint is due to the assumption that the observations in each class are uniformly distributed and concentrated around their midpoint, which is not always valid. In this research, construction of the frequency tables using the three proposed statistics, the arithmetic mean, median, and midrange and midpoint are respectively named, Method 1, Method 2, Method 3, and the Existing method. The four methods are compared using root-mean-squared error (RMSE) by performing simulation studies using three distributions, normal, uniform, exponential distributions. The simulation results are validated using real data, Glasgow weather data. The findings indicated that using the arithmetic mean to represent the magnitude of observations in each class of the frequency table leads to minimal error relative to other statistics. It is followed by using the median, for data simulated from normal and exponential distributions, and using midrange for data simulated from uniform distribution. Meanwhile, in choosing the appropriate number of classes used in constructing the frequency tables, among seven different rules used, the freedman and Diaconis rule is the recommended rule.

KEYWORDS
Frequency Table, Statistical Measures, Midpoint, Number of Classes

Cite This Paper in IEEE or APA Citation Styles
(a). IEEE Format:
[1] Mohammed M. B. , Adam M. B. , Zulkafli H. S. , Ali N. , "Improved Frequency Table with Application to Environmental Data," Mathematics and Statistics, Vol. 8, No. 2, pp. 201 - 210, 2020. DOI: 10.13189/ms.2020.080216.

(b). APA Format:
Mohammed M. B. , Adam M. B. , Zulkafli H. S. , Ali N. (2020). Improved Frequency Table with Application to Environmental Data. Mathematics and Statistics, 8(2), 201 - 210. DOI: 10.13189/ms.2020.080216.