EasyUnitConverter.com

Five Number Summary Calculator

Calculate the five-number summary (minimum, Q1, median, Q3, maximum) from your dataset. The five-number summary provides a concise description of the distribution's center, spread, and range, and forms the basis for box plot construction. See also our Interquartile Range Calculator, Box Plot Calculator, Median Calculator, and Percentile Calculator.

How to Use the Five Number Summary Calculator

The five-number summary is a set of five descriptive statistics that provides a complete picture of a dataset's distribution. It consists of the minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum. Together, these five values tell you about the center (median), spread (IQR and range), and shape (symmetry or skewness) of your data. The five-number summary is the foundation for constructing box plots, one of the most useful data visualization tools in statistics.

To use this calculator, simply enter your data values separated by commas. The calculator sorts the data and computes all five summary statistics along with additional measures including the IQR, range, outlier fences, mean, and standard deviation. You can enter any number of values (minimum 4 required for meaningful quartile calculation). The calculator uses linear interpolation for quartile computation, which is the most common method used by statistical software.

The five-number summary is particularly valuable for comparing distributions, identifying skewness, and detecting outliers. If the median is closer to Q1 than Q3, the data is right-skewed. If the median is closer to Q3, the data is left-skewed. If the distances are roughly equal, the data is approximately symmetric. The summary also reveals the spread at different parts of the distribution — a large gap between Q3 and Max compared to Min and Q1 suggests a long right tail.

Five Number Summary Formulas

Five-Number Summary:

{Min, Q1, Median, Q3, Max}

Quartile Positions (interpolation method):

Q1: position = 0.25 × (n - 1)

Q2: position = 0.50 × (n - 1)

Q3: position = 0.75 × (n - 1)

Interpolation:

value = x[floor(pos)] + frac(pos) × (x[floor(pos)+1] - x[floor(pos)])

Additional Measures:

IQR = Q3 - Q1

Range = Max - Min

Mid-hinge = (Q1 + Q3) / 2

Tri-mean = (Q1 + 2×Median + Q3) / 4

Example Calculation

Calculate the five-number summary for: 5, 7, 8, 12, 13, 14, 18, 21, 23, 27, 30

Sorted data: 5, 7, 8, 12, 13, 14, 18, 21, 23, 27, 30

n = 11

Minimum = 5

Q1: position = 0.25 × 10 = 2.5 → Q1 = 8 + 0.5×(12-8) = 10

Median: position = 0.50 × 10 = 5 → Median = 14

Q3: position = 0.75 × 10 = 7.5 → Q3 = 21 + 0.5×(23-21) = 22

Maximum = 30

Five-Number Summary: 5, 10, 14, 22, 30

IQR = 22 - 10 = 12

Range = 30 - 5 = 25

Lower Fence = 10 - 1.5×12 = -8 (no lower outliers)

Upper Fence = 22 + 1.5×12 = 40 (no upper outliers)

Summary Statistics Reference

StatisticFormulaUse
Minimummin(x)Smallest observation
Q1 (25th %ile)Median of lower halfLower quartile boundary
Median (Q2)Middle valueCentral tendency (robust)
Q3 (75th %ile)Median of upper halfUpper quartile boundary
Maximummax(x)Largest observation
IQRQ3 - Q1Spread of middle 50%
RangeMax - MinTotal spread
Mid-hinge(Q1 + Q3) / 2Center of middle 50%

Real-World Applications

Exploratory Data Analysis

The five-number summary (min, Q1, median, Q3, max) provides a complete picture of data distribution in just five values. Statistician John Tukey introduced it as part of EDA to quickly assess center, spread, symmetry, and potential outliers without assumptions about the distribution shape.

Standardized Test Reporting

College Board reports SAT score distributions using five-number summaries. For example: Min=400, Q1=900, Median=1060, Q3=1220, Max=1600. This tells students where they stand relative to all test-takers and whether the score distribution is symmetric or skewed.

Environmental Monitoring

Air quality reports use five-number summaries to describe pollutant concentrations over a period. The summary immediately shows whether typical days (median) are safe even if some extreme days (max) exceed thresholds, and how much day-to-day variability exists (IQR).

Additional Solved Examples

Example: Daily Sales Revenue

A shop records daily revenue ($) for 14 days: 320, 450, 280, 510, 390, 620, 440, 370, 480, 550, 410, 340, 690, 460. Compute the five-number summary.

Sorted: 280, 320, 340, 370, 390, 410, 440, 450, 460, 480, 510, 550, 620, 690

n = 14

Min = 280

Q1 = (370 + 390)/2 = 380 (average of 4th and 5th values in lower half)

Median = (440 + 450)/2 = 445 (average of 7th and 8th)

Q3 = (510 + 550)/2 = 530

Max = 690

Answer: Five-number summary: (280, 380, 445, 530, 690). The IQR is 530-380 = 150. Since Q3-Median (85) > Median-Q1 (65), the upper half is slightly more spread out, suggesting mild right-skew. The shop typically earns between $380 and $530 per day.

Example: Detecting Skewness from the Summary

Dataset A: (10, 30, 50, 70, 90). Dataset B: (10, 20, 30, 50, 150). Compare their distributions using five-number summaries.

A: Min=10, Q1=30, Med=50, Q3=70, Max=90

B: Min=10, Q1=20, Med=30, Q3=50, Max=150

A: Q1-Min=20, Med-Q1=20, Q3-Med=20, Max-Q3=20 (symmetric)

B: Q1-Min=10, Med-Q1=10, Q3-Med=20, Max-Q3=100 (right-skewed)

Answer: Dataset A is perfectly symmetric (equal spacing between all five numbers). Dataset B is strongly right-skewed: the upper whisker (Max-Q3 = 100) is much longer than the lower whisker (Q1-Min = 10). The five-number summary reveals skewness at a glance.

Common Mistakes

Forgetting to sort data before calculating

The five-number summary requires sorted data. Picking the "middle value" of unsorted data is a common error. Always sort from smallest to largest before identifying min, Q1, median, Q3, and max.

Including the median in both halves when finding Q1 and Q3

For odd n, the median should typically be excluded from both halves when calculating Q1 and Q3 (exclusive method). Including it in both halves (inclusive method) gives slightly different results. Be aware of which convention your software uses.

Confusing the five-number summary with a box plot

A box plot visually represents the five-number summary but also identifies outliers. The whiskers on a box plot extend to the most extreme non-outlier values, not necessarily min and max. The five-number summary always uses the actual min and max.

Key Takeaways

  • The five-number summary (min, Q1, median, Q3, max) provides a complete non-parametric description of data distribution.
  • It reveals center (median), spread (IQR = Q3-Q1), range (max-min), and symmetry (comparing distances between values).
  • It is robust to outliers in the sense that Q1 and Q3 are not affected by extreme values beyond the quartiles.
  • Symmetric data: Q1-Min is approximately equal to Q3-Max, and Median-Q1 is approximately equal to Q3-Median.
  • The five-number summary forms the basis for box-and-whisker plots, one of the most useful visualizations in statistics.

Frequently Asked Questions

What does the five-number summary tell you about distribution shape?

The five-number summary reveals skewness through the relative spacing of its components. If (Median - Q1) ≈ (Q3 - Median), the distribution is roughly symmetric. If (Q3 - Median) > (Median - Q1), it's right-skewed (positive skew). If (Median - Q1) > (Q3 - Median), it's left-skewed (negative skew). Similarly, comparing (Q1 - Min) with (Max - Q3) reveals tail behavior.

How is the five-number summary different from descriptive statistics?

Traditional descriptive statistics (mean, standard deviation) assume roughly symmetric data and are sensitive to outliers. The five-number summary uses order statistics (ranks) rather than arithmetic operations, making it robust to outliers and appropriate for any distribution shape. It provides a non-parametric description that doesn't assume normality. For symmetric data without outliers, both approaches give similar information.

Why might different calculators give different quartile values?

There are at least 9 different methods for computing quartiles, and they can give different results for small datasets. The main differences are: (1) Whether to include or exclude the median when splitting data into halves, (2) How to interpolate between adjacent values, (3) Whether to use inclusive or exclusive percentile definitions. For large datasets (n > 30), all methods converge to essentially the same values.

What is the minimum number of data points needed?

Technically, you need at least 5 data points for a meaningful five-number summary (one for each statistic). However, with very few data points, the quartiles may not be very informative. For reliable quartile estimates, at least 20-30 data points are recommended. With fewer than 10 points, the five-number summary should be interpreted cautiously as the quartile estimates have high uncertainty.

How do I use the five-number summary to draw a box plot?

To draw a box plot: (1) Draw a box from Q1 to Q3, (2) Draw a line inside the box at the median, (3) Calculate fences: lower = Q1 - 1.5×IQR, upper = Q3 + 1.5×IQR, (4) Draw whiskers from the box to the most extreme data points within the fences, (5) Plot any points beyond the fences as individual dots (outliers). The box represents the middle 50% of data, and the whiskers show the range of non-outlier values.

Can the five-number summary be used for comparing groups?

Yes, comparing five-number summaries (or their visual representation as side-by-side box plots) is an excellent way to compare distributions across groups. You can quickly see differences in center (median), spread (IQR), range, symmetry, and outlier patterns. This is often more informative than comparing just means and standard deviations, especially when distributions are non-normal or have different shapes.