Descriptive Statistics Calculator (2024)

Calculator Use

What are Descriptive Statistics?

Descriptive statistics summarize certain aspects of a data set or a population using numeric calculations. Examples of descriptive statistics include:

  • mean, average
  • midrange
  • standard deviation
  • quartiles

This calculator generates descriptive statistics for a data set. Enter data values separated by commas or spaces. You can also copy and paste data from spreadsheets or text documents. See allowable data formats in the table below.

Descriptive Statistics Formulas and Calculations

This calculator uses the formulas and methods below to find the statistical values listed.

Minimum

Ordering a data set x1 ≤ x2 ≤ x3 ≤ ... ≤ xn from lowest to highest value, the minimum is the smallest value x1.

\[ \text{Min} = x_1 = \text{min}(x_i)_{i=1}^{n} \]

Maximum

Ordering a data set x1 ≤ x2 ≤ x3 ≤ ... ≤ xn from lowest to highest value, the maximum is the largest value xn.

\[ \text{Max} = x_n = \text{max}(x_i)_{i=1}^{n} \]

Range

The range of a data set is the difference between the minimum and maximum.

\[ \text{Range} = x_n - x_1 \]

Sum

The sum is the total of all data values x1 + x2 + x3 + ... + xn

\[ \text{Sum} = \sum_{i=1}^{n}x_i \]

Size, Count

Size or count is the number of data points in a data set.

\[ \text{Size} = n = \text{count}(x_i)_{i=1}^{n} \]

Mean

The mean of a data set is the sum of all of the data divided by the size. The mean is also known as the average.

For a Population

\[ \mu = \dfrac{\sum_{i=1}^{n}x_i}{n}\]

For a Sample

\[ \overline{x} = \dfrac{\sum_{i=1}^{n}x_i}{n}\]

Median

Ordering a data set x1 ≤ x2 ≤ x3 ≤ ... ≤ xn from lowest to highest value, the median is the numeric value separating the upper half of the ordered sample data from the lower half. If n is odd the median is the center value. If n is even the median is the average of the 2 center values.

If n is odd the median is the value at position p where

\[ p = \dfrac{n + 1}{2} \] \[ \widetilde{x} = x_p \]

If n is even the median is the average of the values at positions p and p + 1 where

\[ p = \dfrac{n}{2} \] \[ \widetilde{x} = \dfrac{x_{p} + x_{p+1}}{2} \]

Mode

The mode is the value or values that occur most frequently in the data set. A data set can have more than one mode, and it can also have no mode.

Standard Deviation

Standard deviation is a measure of dispersion of data values from the mean. The formula for standard deviation is the square root of the sum of squared differences from the mean divided by the size of the data set.

For a Population

\[ \sigma = \sqrt{\dfrac{\sum_{i=1}^{n}(x_i - \mu)^{2}}{n}} \]

For a Sample

\[ s = \sqrt{\dfrac{\sum_{i=1}^{n}(x_i - \overline{x})^{2}}{n - 1}} \]

Variance

Variance measures dispersion of data from the mean. The formula for variance is the sum of squared differences from the mean divided by the size of the data set.

For a Population

\[ \sigma^{2} = \dfrac{\sum_{i=1}^{n}(x_i - \mu)^{2}}{n} \]

For a Sample

\[ s^{2} = \dfrac{\sum_{i=1}^{n}(x_i - \overline{x})^{2}}{n - 1} \]

Midrange

The midrange of a data set is the average of the minimum and maximum values.

\[ \text{MR} = \dfrac{x_{min} + x_{max}}{2} \]

Quartiles

Quartiles separate a data set into four sections. The median is the second quartile Q2. It divides the ordered data set into higher and lower halves. The first quartile, Q1, is the median of the lower half not including Q2. The third quartile, Q3, is the median of the higher half not including Q2. This is one of several methods for calculating quartiles.[1]

Interquartile Range

The range from Q1 to Q3 is the interquartile range (IQR).

\[ IQR = Q_3 - Q_1 \]

Outliers

Potential outliers are values that lie above the Upper Fence or below the Lower Fence of the sample set.

\[ \text{Upper Fence} = Q_3 + 1.5 \times IQR \] \[ \text{Lower Fence} = Q_1 - 1.5 \times IQR \]

Sum of Squares

The sum of squares is the sum of the squared differences between data values and the mean.

For a Population

\[ SS = \sum_{i=1}^{n}(x_i - \mu)^{2} \]

For a Sample

\[ SS = \sum_{i=1}^{n}(x_i - \overline{x})^{2} \]

Mean Absolute Deviation

Mean absolute deviation[2] is the sum of the absolute value of the differences between data values and the mean, divided by the sample size.

For a Population

\[ MAD = \dfrac{\sum_{i=1}^{n}|x_i - \mu|}{n} \]

For a Sample

\[ MAD = \dfrac{\sum_{i=1}^{n}|x_i - \overline{x}|}{n} \]

Root Mean Square

The root mean square describes the magnitude of a set of numbers. The formula for root mean square is the square root of the sum of the squared data values divided by n.

\[ RMS = \sqrt{\dfrac{\sum_{i=1}^{n}x_i^{2}}{n}} \]

Standard Error of the Mean

Standard error of the mean is calculated as the standard deviation divided by the square root of the count n.

For a Population

\[ {SE}_{\mu} = \dfrac{\sigma}{\sqrt{n}} \]

For a Sample

\[ {SE}_{\overline{x}} = \dfrac{s}{\sqrt{n}} \]

Skewness

Skewness[3] describes how far to the left or right a data set distribution is distorted from a symmetrical bell curve. A distribution with a long left tail is left-skewed, or negatively-skewed. A distribution with a long right tail is right-skewed, or positively-skewed.

For a Population

\[ \gamma_{1} = \dfrac{\sum_{i=1}^{n}(x_i - \mu)^{3}}{n\sigma^{3}} \]

For a Sample

\[ \gamma_{1} = \dfrac{n}{(n-1)(n-2)} \sum_{i=1}^{n} \left(\dfrac{x_i - \overline{x}}{s}\right)^{3} \]

Kurtosis

Kurtosis[3] describes the extremeness of the tails of a population distribution and is an indicator of data outliers. High kurtosis means that a data set has tail data that is more extreme than a normal distribution. Low kurtosis means the tail data is less extreme than a normal distribution.

For a Population

\[ \beta_{2} = \dfrac{\sum_{i=1}^{n}(x_i - \mu)^{4}}{n\sigma^{4}} \]

For a Sample

\[ \beta_{2} = \dfrac{n(n+1)}{(n-1)(n-2)(n-3)} \sum_{i=1}^{n} \left(\dfrac{x_i - \overline{x}}{s}\right)^{4} \]

Kurtosis Excess

Excess kurtosis describes the height of the tails of a distribution rather than the extremity of the length of the tails. Excess kurtosis means that the distribution has a high frequency of data outliers.

For a Population

\[ \alpha_{4} = \dfrac{\sum_{i=1}^{n}(x_i - \mu)^{4}}{n\sigma^{4}} - 3 \]

For a Sample (This is just Kurtosis in MS Excel and Google Sheets)

\[ \alpha_{4} = \dfrac{n(n+1)}{(n-1)(n-2)(n-3)} \sum_{i=1}^{n} \left(\dfrac{x_i - \overline{x}}{s}\right)^{4} - \dfrac{3(n-1)^{2}}{(n-2)(n-3)} \]

Coefficient of Variation

The coefficient of variation describes dispersion of data around the mean. It is the ratio of the standard deviation to the mean. The coefficient of variation is calculated as the standard deviation divided by the mean.

For a Population

\[ CV = \dfrac{\sigma}{\mu} \]

For a Sample

\[ CV = \dfrac{s}{\overline{x}} \]

Relative Standard Deviation

Relative standard deviation describes the variance of a subset of data from the mean. It is expressed as a percentage. Relative standard deviation is calculated as the standard deviation times 100 divided by the mean.

For a Population

\[ RSD = \left[ \dfrac{100 \times \sigma}{\mu} \right] \% \]

For a Sample

\[ RSD = \left[ \dfrac{100 \times s}{\overline{x}} \right] \% \]

Frequency

Frequency is the number of occurrences for each data value in the data set. Frequency is used to find the mode of a data set.

Acceptable Data Formats

Type
Unit

Your Format Input
Options

Actual Input Processed

Column (New Lines)

42
54
65
47
59
40
53

42, 54, 65, 47, 59, 40, 53

Comma Separated (CSV)

42,
54,
65,
47,
59,
40,
53,

or

42, 54, 65, 47, 59, 40, 53

42, 54, 65, 47, 59, 40, 53

Spaces

42 54
65 47
59 40
53

or

42 54 65 47 59 40 53

42, 54, 65, 47, 59, 40, 53

Mixed Delimiters

42
54 65,,, 47,,59,
40 53

42, 54, 65, 47, 59, 40, 53

References

[1] Wikipedia contributors. "Quartile." Wikipedia, The Free Encyclopedia. Last visited 28 May, 2020.

[2] Weisstein, Eric W. "Mean Deviation." From MathWorld--A Wolfram Web Resource. Mean Deviation. Last visited 28 May, 2020.

[3] Information Technology Lab, National Institute of Standards and Technology. Section 1.3.5.11 Measures of Skewness and Kurtosis. From the Engineering Statistics Handbook. Last visited 28 May, 2020.

Descriptive Statistics Calculator (2024)

FAQs

Can Excel calculate descriptive statistics? ›

To calculate descriptive statistics for a column of data, click on the Data ribbon. Click on Data Analysis in the Analysis section. Select Descriptive Statistics, then click OK. Click on the Input Range selection button, then select the range of cells for the column.

How to write descriptive statistics? ›

Generally, when writing descriptive statistics, you want to present at least one form of central tendency (or average), that is, either the mean, median, or mode. In addition, you should present one form of variability, usually the standard deviation.

How do you calculate percentages in descriptive statistics? ›

Percentage is calculated by taking the frequency in the category divided by the total number of participants and multiplying by 100%. To calculate the percentage of males in Table 3, take the frequency for males (80) divided by the total number in the sample (200). Then take this number times 100%, resulting in 40%.

What is an example of a descriptive statistic? ›

Descriptive statistics examples in a research study include the mean, median, and mode. Studies also frequently cite measures of dispersion including the standard deviation, variance, and range. These values describe a data set just as it is, so it is called descriptive statistics.

What is the formula for sample size in descriptive statistics? ›

Number to enroll * %retained = desired sample size, n • Number to enroll = desired sample size / %retained; • 954 /0.80 =1193 for treatment arm ; 477 /0.80 = 597 for control arm • Total n = 1,790 • Can round up to 1200 and 600 in treatment and control, respectively.

Can Excel be used to calculate statistics? ›

Microsoft Excel is a powerful statistical analysis tool that allows you to quickly and accurately calculate innumerable statistical measures. This lesson reviewed standard deviation, variance, kurtosis, standard error, skewness and confidence.

Is Excel good for statistical analysis? ›

Excel is the best tool for storing data. Many statistical programs such as R, SPSS, SAS and Minitab can easily export data into those programs. This makes it much easier to store data for analyses. That being said, many statisticians, including me, would not use Excel for statistical analyses.

How to interpret descriptive statistics? ›

Interpret the key results for Display Descriptive Statistics
  1. Step 1: Describe the size of your sample.
  2. Step 2: Describe the center of your data.
  3. Step 3: Describe the spread of your data.
  4. Step 4: Assess the shape and spread of your data distribution.
  5. Compare data from different groups.

What is the best way to display descriptive statistics? ›

There are several ways of presenting descriptive statistics in your paper. These include graphs, central tendency, dispersion and measures of association tables. Graphs: Quantitative data can be graphically represented in histograms, pie charts, scatter plots, line graphs, sociograms and geographic information systems.

What is a table of descriptive statistics? ›

Creates a table of descriptive statistics (i.e., the mean, N, number of missing values, standard deviation, range, sum, minimum, 25th percentile, median, 75th percentile, and maximum).

What is the main purpose of descriptive statistics? ›

The purpose of a descriptive statistic is to summarize data. Descriptive stats only make statements about the set of data from which they were calculated; they never go beyond the data you have.

What is the formula for descriptive statistics? ›

This is calculated by summing all of the data values and dividing by the total number of data items you have. It is normally called the mean or the average. If you have a data consisting of n observations (x1,...,xn) ( x 1 , . . . , x n ) then the mean (¯x) is given by the formula: ¯x=1nn∑i=1 xi.

What is the standard score in descriptive statistics? ›

Standard score, also known as a z-score, is computed by subtracting the population mean of the individual raw score and dividing the remainder by the standard deviation resulting in scores that have a mean of 50 and a standard deviation of 10.

How do you calculate the sample proportion in descriptive statistics? ›

The sample proportion P is given by P=X/N, where X denotes the number of successes and N denotes the size of the sample in question. This sample proportion reflects that particular sample, and other samples of the population may result in different sample proportions.

Top Articles
Latest Posts
Article information

Author: Jeremiah Abshire

Last Updated:

Views: 5826

Rating: 4.3 / 5 (54 voted)

Reviews: 85% of readers found this page helpful

Author information

Name: Jeremiah Abshire

Birthday: 1993-09-14

Address: Apt. 425 92748 Jannie Centers, Port Nikitaville, VT 82110

Phone: +8096210939894

Job: Lead Healthcare Manager

Hobby: Watching movies, Watching movies, Knapping, LARPing, Coffee roasting, Lacemaking, Gaming

Introduction: My name is Jeremiah Abshire, I am a outstanding, kind, clever, hilarious, curious, hilarious, outstanding person who loves writing and wants to share my knowledge and understanding with you.