

POSTGRADUATE CORNER 

Year : 2011  Volume
: 2
 Issue : 2  Page : 140142 


Measures of central tendency: The mean
S Manikandan
Assistant Editor, Journal of Pharmacology and Pharmacotherapeutics, India
Date of Web Publication  6Jun2011 
Correspondence Address: S Manikandan Department of Pharmacology, Indira Gandhi Medical College and Research Institute, Kadirkamam, Puducherry India
Source of Support: None, Conflict of Interest: None  Check 
DOI: 10.4103/0976500X.81920
How to cite this article: Manikandan S. Measures of central tendency: The mean. J Pharmacol Pharmacother 2011;2:1402 
In any research, enormous data is collected and, to describe it meaningfully, one needs to summarise the same. The bulkiness of the data can be reduced by organising it into a frequency table or histogram. ^{[1]} Frequency distribution organises the heap of data into a few meaningful categories. Collected data can also be summarised as a single index/value, which represents the entire data. These measures may also help in the comparison of data.
Central Tendency   
Central tendency is defined as "the statistical measure that identifies a single value as representative of an entire distribution." ^{[2]} It aims to provide an accurate description of the entire data. It is the single value that is most typical/representative of the collected data. The term "number crunching" is used to illustrate this aspect of data description. The mean, median and mode are the three commonly used measures of central tendency.
Mean   
Mean is the most commonly used measure of central tendency. There are different types of mean, viz. arithmetic mean, weighted mean, geometric mean (GM) and harmonic mean (HM). If mentioned without an adjective (as mean), it generally refers to the arithmetic mean.
Arithmetic mean
Arithmetic mean (or, simply, "mean") is nothing but the average. It is computed by adding all the values in the data set divided by the number of observations in it. If we have the raw data, mean is given by the formula
Where, Σ (the uppercase Greek letter sigma), X refers to summation, refers to the individual value and n is the number of observations in the sample (sample size). The research articles published in journals do not provide raw data and, in such a situation, the readers can compute the mean by calculating it from the frequency distribution (if provided).
Where, ƒ is the frequency and X is the midpoint of the class interval and n is the number of observations. ^{[3]} The standard statistical notations (in relation to measures of central tendency) are mentioned in [Table 1]. Readers are cautioned that the mean calculated from the frequency distribution is not exactly the same as that calculated from the raw data. It approaches the mean calculated from the raw data as the number of intervals increase. ^{[4]}
Advantages   
The mean uses every value in the data and hence is a good representative of the data. The irony in this is that most of the times this value never appears in the raw data.
Repeated samples drawn from the same population tend to have similar means. The mean is therefore the measure of central tendency that best resists the fluctuation between different samples. ^{[6]}
It is closely related to standard deviation, the most common measure of dispersion.
Disadvantages   
The important disadvantage of mean is that it is sensitive to extreme values/outliers, especially when the sample size is small. ^{[7]} Therefore, it is not an appropriate measure of central tendency for skewed distribution. ^{[8]}
Mean cannot be calculated for nominal or nonnominal ordinal data. Even though mean can be calculated for numerical ordinal data, many times it does not give a meaningful value, e.g. stage of cancer.
Weighted mean
Weighted mean is calculated when certain values in a data set are more important than the others. ^{[9]} A weight w _{i} is attached to each of the values x _{i} to reflect this importance.
For example, When weighted mean is used to represent the average duration of stay by a patient in a hospital, the total number of cases presenting to each ward is taken as the weight.
Geometric Mean
It is defined as the arithmetic mean of the values taken on a log scale. It is also expressed as the n ^{th} root of the product of an observation.
GM is an appropriate measure when values change exponentially and in case of skewed distribution that can be made symmetrical by a log transformation. GM is more commonly used in microbiological and serological research. One important disadvantage of GM is that it cannot be used if any of the values are zero or negative.
Harmonic mean
It is the reciprocal of the arithmetic mean of the observations.
Alternatively, the reciprocal of HM is the mean of reciprocals of individual observations.
HM is appropriate in situations where the reciprocals of values are more useful. HM is used when we want to determine the average sample size of a number of groups, each of which has a different sample size.
Degree of Variation between the Means   
If all the values in a data set are the same, then all the three means (arithmetic mean, GM and HM) will be identical. As the variability in the data increases, the difference among these means also increases. Arithmetic mean is always greater than the GM, which in turn is always greater than the HM. ^{[5]}
The other measures of central tendency (median and mode) and the guidelines for selecting the appropriate measure of central tendency will be dealt with in the subsequent issue.
References   
1.  Manikandan S. Frequency distribution. J Phamacol Pharmacother 2011;2:546. 
2.  Gravetter FJ, Wallnau LB. Statistics for the behavioral sciences. 5 ^{th} ed. Belmont: Wadsworth  Thomson Learning; 2000. 
3.  Sundar Rao PS, Richard J. Introduction to biostatistics and research methods. 4 ^{th} ed. New Delhi, India: Prentice Hall of India Pvt Ltd; 2006. 
4.  Sundaram KR, Dwivedi SN, Sreenivas V. Medical statistics principles and methods. 1 ^{st} ed. New Delhi, India: B.I Publications Pvt Ltd; 2010. 
5.  Norman GR, Streiner DL. Biostatistics the bare essentials. 2 ^{nd} ed. Hamilton: B.C. Decker Inc; 2000. 
6.  Glaser AN. High Yield Biostatistics. 1 ^{st} Ed. New Delhi, India: Lippincott Williams and Wilkins; 2000. 
7.  Dawson B, Trapp RG. Basic and Clinical Biostatistics. 4 ^{th} ed. New York: McGraw Hill; 2004. 
8.  Swinscow TD, Campbell MJ. Statistics at square one. 10 ^{th} ed. New Delhi, India: Viva Books Private Limited; 2003. 
9.  Petrie A, Sabin C. Medical statistics at a glance. 3 ^{rd} ed. Oxford: WileyBlackwell; 2009. 
[Table 1]
This article has been cited by  1 
Potentially toxic elements and polycyclic aromatic hydrocarbons in street dust of Yazd, a central capital city in Iran: contamination level, source identification, and ecological–health risk assessment 

 Mohammad Javad Nematollahi, Sara Dehdaran, Farid Moore, Behnam Keshavarzi   Environmental Geochemistry and Health. 2021; 43(1): 485   [Pubmed]  [DOI]   2 
Trace elements in the shoreline and seabed sediments of the southern Caspian Sea: investigation of contamination level, distribution, ecological and human health risks, and elemental partition coefficient 

 Mohammad Javad Nematollahi, Behnam Keshavarzi, Farid Moore, Rolf David Vogt, Hassan Nasrollahzadeh Saravi   Environmental Science and Pollution Research. 2021; 28(43): 60857   [Pubmed]  [DOI]   3 
Development and characterization of nanoemulsions and nanoemulgels for transdermal delivery of statins 

 Mduduzi N. Sithole, Suzanne Marais, Sumari M. Maree, Lissinda H. Du Plessis, Jeanetta Du Plessis, Minja Gerber   Expert Opinion on Drug Delivery. 2021; 18(6): 789   [Pubmed]  [DOI]   4 
Attributes and impact of advance payment system on cash flow, project and organisational performance 

 Emmanuel Dele Omopariola, Abimbola Olukemi Windapo, David J. Edwards, Nicholas Chileshe   Journal of Financial Management of Property and Construction
. 2021; aheadofp(aheadofp)   [Pubmed]  [DOI]   5 
Accurate Medication Adherence Measurement Using Administrative Data for Frequently Hospitalized Patients 

 Rafia S. Rasu, Suzanne L. Hunt, Junqiang Dai, Huizhong Cui, Milind A. Phadnis, Nishank Jain   Hospital Pharmacy. 2021; 56(5): 451   [Pubmed]  [DOI]   6 
Model of Choice Photovoltaic Panels Considering Customers’ Expectations 

 Dominika Siwiec, Andrzej Pacana   Energies. 2021; 14(18): 5977   [Pubmed]  [DOI]   7 
Discriminating Between SecondOrder Model With/Without Interaction Base on Central Tendency Estimation 

 Owhondah P.S., Enegesele D., Biu O.E., Wokoma D.S.A.   African Journal of Mathematics and Statistics Studies. 2021; 4(3): 47   [Pubmed]  [DOI]   8 
Evaluation the Performance of Several Gridded Precipitation Products over the Highland Region of Yemen for Water Resources Management 

 Ali Hamoud ALFalahi, Naeem Saddique, Uwe Spank, Solomon H. Gebrechorkos, Christian Bernhofer   Remote Sensing. 2020; 12(18): 2984   [Pubmed]  [DOI]   9 
Impact of health information technology optimization on clinical quality performance in health centers: A national crosssectional study 

 Robert Baillieu, Hank Hoang, Alek Sripipatana, Suma Nair, Sue C. Lin, Mustafa Ozkaynak   PLOS ONE. 2020; 15(7): e0236019   [Pubmed]  [DOI]   10 
Antimicrobial action of 1,10phenanthrolinebased compounds on carbapenemaseproducing Acinetobacter baumannii clinical strains: efficacy against planktonic and biofilmgrowing cells 

 Roberta F. Ventura, Anna Clara M. Galdino, Livia Viganor, Ricardo P. Schuenck, Michael Devereux, Malachy McCann, André L.S. Santos, Ana Paula F. Nunes   Brazilian Journal of Microbiology. 2020; 51(4): 1703   [Pubmed]  [DOI]   11 
Lowerlimb motorperformance asymmetries in English communitylevel female field hockey players: Implications for knee and ankle injury prevention 

 Nicholas C. Clark, Lucy H. Clacher   Physical Therapy in Sport. 2020; 43: 43   [Pubmed]  [DOI]   12 
Ecologicalhealth risk assessment and bioavailability of potentially toxic elements (PTEs) in soil and plant around a copper smelter 

 Mohammad Javad Nematollahi, Behnam Keshavarzi, Fatemeh Zaremoaiedi, Mohammad Ali Rajabzadeh, Farid Moore   Environmental Monitoring and Assessment. 2020; 192(10)   [Pubmed]  [DOI]   13 
Perceived and Reported Reliability of the Electricity Supply at Three Urban Locations in Indonesia 

 Kay A. Kunaifi,Ange`le Reinders   Energies. 2018; 11(1): 140   [Pubmed]  [DOI]   14 
Quality dimensions in online communities influence purchase intentions 

 ChungYu Wang,HsiaoChing Lee,LiWei Wu,ChihChung Liu   Management Decision. 2017; 55(9): 1984   [Pubmed]  [DOI]   15 
Population Size Estimation of Men Who Have Sex with Men in Tbilisi, Georgia; Multiple Methods and Triangulation of Findings 

 Lela Sulaberidze,Ali Mirzazadeh,Ivdity Chikovani,Natia Shengelia,Nino Tsereteli,George Gotsadze,Ruan Yuhua   PLOS ONE. 2016; 11(2): e0147413   [Pubmed]  [DOI]  



