What is a Box and Whisker Plot?
Also called: box plot, box and whisker diagram
A box and whisker plot is defined as a graphical method of displaying variation in a set of data. In most cases, a histogram analysis provides a sufficient display, but a box and whisker plot can provide additional detail while allowing multiple sets of data to be displayed in the same graph. Some types are called box and whisker plots with outliers.
Why Use a Box and Whisker Plot?
Box and whisker plots or diagrams are very effective and easy to read, as they can summarize data from multiple sources and display the results in a single graph. Box and whisker plots allow for comparison of data from different categories for easier, more effective decision-making.
When to Use a Box and Whisker Plot
Use box and whisker plots when you have multiple data sets from independent sources that are related to each other in some way. Examples include:
- Test scores between schools or classrooms
- Data from before and after a process change
- Similar features on one part, such as camshaft lobes
- Data from duplicate machines manufacturing the same products.
How to Make a Box and Whisker Plot
The procedure to develop a box and whisker plot comes from five statistics.
- Minimum value – the smallest value in the data set
- Second quartile – the value below which the lower 25% of the data are contained
- Median value – the middle number in a range of numbers
- Third quartile – the value above which the upper 25% of the data are contained
- Maximum value – the largest value in the data set
For example, given the following 20 data points, the five required statistics are displayed.
|1||113||Minimum value: 113|
|2nd Quartile: 124|
|Median value: 126.5|
|3rd Quartile: 130|
|20||136||Maximum value: 136|
Note that for a data set with an even number of values, the median is calculated as the average of the two middle values.
Here are the data represented in box and whisker plot format.
Left figure: The center represents the middle 50%, or 50th percentile of the data set, and is derived using the lower and upper quartile values. The median value is displayed inside the "box." The maximum and minimum values are displayed with vertical lines ("whiskers") connecting the points to the center box.
Right figure: For comparison, a histogram of the data is also shown, showing the frequency of each value in the data set.
Box and Whisker Plot Example
Suppose you wanted to compare the performance of three lathes responsible for the rough turning of a motor shaft. The design specification is 18.85 +/- 0.1 mm.
Diameter measurements from a sample of shafts taken from each roughing lathe are displayed in a box and whisker plot.
- Lathe 1 appears to be making good parts, and is centered in the tolerance.
- Lathe 2 appears to have excess variation, and is making shafts below the minimum diameter.
- Lathe 3 is performing with relatively less variation than Lathe 2; however, it is centered on the lower side of the specification and is making shafts below specification.