Disclaimer: This is work in progress intended to consolidate information from various sources and may deviate from them. Please consult the sources for the exact content!
Last updated: 15-Jun-2024
| Bar & Column Charts with Variations | 
- {definition} graphical representation of categorical data with rectangular figures (aka boxes) whose heights (column chart) or lengths (bar chart) are proportional to the values that they represent
 - {benefit} allow to visually encode/decode quantitative information-size as magnitude and area based on the relative position of the end of the box along the common scale
 - if the width of the box is the same, it's enough to compare the length
 - ⇒ the basis of comparison is one-dimensional [1]
 - ⇐ orient the reader to the relative magnitudes of the boxes
 - area is typically encoded when the width varies
 - ⇐ encoding by area is a poor encoding method as it can mislead
 - can represent negative and positive values
 - one of the most useful, simple, and adaptable techniques in graphic presentation [1]
 - easily understood by readers
 - sometimes avoided because they are so common
 - almost everything could be a bar chart
 - the length of each bar is proportional to the quantity or amount of each category represented [1]
 - ⇒the zero line must be shown [1]
 - ⇒the scale must not be broken [1]
 - {exception} an excessively long bar in a series of bars may be broken off at the end, and the amount involved shown directly beyond it [1]
 - {benefit} allow to visually represent categorical data
 - ⇒ occasionally represented without scales, grid lines or tick marks
 - the more data elements are presented, the more difficult it becomes to navigate and/or display the data
 - {benefit} allow us to easily compare magnitudes
 - sometimes without looking at the actual values
 - {type} bar chart
 - the box is shown horizontally
 - represents magnitude by length
 - allows comparing different items as of a specific time
 - {type} column chart
 - the box is shown vertically
 - represents magnitude by height
 - allows comparing different items over time
 - ⇐ it still displays discrete points
 - recommended for comparing similar items for different time periods [2]
 - effective way to show most types of comparisons [2]
 - {subtype} stacked chart
 - variation of bar/column charts in which the boxes of a dimension's components are staked over each other
 - {exception} spaces can be used between boxes if the values aren't cumulative [3]
 - {benefit} allows encoding a further dimension where the values are staked within the same box
 - {drawback} do not show data structure well
 - ⇒ make it challenging to compare values across boxes
 - {subtype} 100-percent chart
 - variation of stacked chart in which the magnitude totals to 100%
 - {benefit} allows to display part to whole relationships
 - ⇐ preferable to circle chart's angle and area comparison [1]
 - {subtype} clustered chart (aka grouped chart)
 - variation of bar/column charts that allows encoding further quantitative information in distinct boxes tacked together which occasionally overlap
 - ⇐ if there's space, it is usually kept to a minimum
 - e.g. can be used to display multiple data series
 - can be used with a secondary axis
 - {benefit} allows comparisons within the cluster/group as well between clusters/groups
 - {drawback} more challenging to make comparisons across points
 - {subtype} area chart (variable-width/variwide chart/graph)
 - variation of bar/column charts in which the height/width have significance being proportional to some measure or characteristics of the data elements represented [3]
 - {benefit} allow encoding a further dimension as part of the area
 - {subtype} deviation chart
 - variation of bar/column charts that display positive and negative values
 - {subtype} joined chart
 - variation of bar/column charts in which the boxes are tacked together
 - {benefit} allow to better use the space available
 - {subtype} paired chart
 - variation of bar/column charts in which the boxes are paired in mirror based on an axis
 - e.g. the values of one data series are displayed to the left, while the values for a second data series are displayed to the right
 - {benefit} allows to study the correlation and/or other relationships between the values of two data series
 - the hidden axes can have different scales
 - {subtype} circular chart (aka radial chart)
 - variation of bar/column charts in which the boxes are wrapped into a circle, the various categories being uniformly spaced along the radial or category axis [3]
 - the value scale can have any upper or lower value and can progress in either direction [3]
 - {benefit} useful to represent data that have a circular dimension in an aesthetic form
 - e.g. months, hours
 - {subtype} waterfall chart (aka progressing chart)
 - variation of bar/column charts in which the boxes are displayed progressively, the start of a box corresponding the end of the previous box
 - time and activity charts can be considered as variations of this subtype [3]
 - {advantage} allows to determine cumulative values, respectively the increase/decrease between consecutive boxes
 - {subtype}composite chart (aka mixed chart, combination chart, overlay chart)
 - variation of bar/column charts in besides boxes are used other graphic types of encoding (line, area)
 - ⇐ the different data graphics are overlaid on one another [3]
 - {benefit} allows to improve clarity or highlight the relationships between several data series [3]
 - {drawback} overlaying can result in clutter
 - used to
 - display totals, averages or frequencies
 - display time series
 - display the relationship between two or more items
 - make a comparison among several items
 - make a comparison between parts and the whole
 - can be confounded with
 - [histograms]
 - show distribution through the frequency of quantitative values against defined intervals of quantitative values
 - used for continuous numerical data or data that can be effectively modelled as continuous
 - it doesn't have spaces between bars
 - ⇐ older use of bar/column charts don't use spaces
 - if this aspect is ignored, histograms can be considered as a special type of area chart
 - [vertical line chart] (aka price chart, bar chart)
 - vertical line charts are sometimes referred as bar charts (see [3])
 - things to consider
 - distance between bars
 - the more distant the bars, the more difficult it becomes to make comparisons and the accuracy of judgment decreases
 - sorting
 - sorting the bars/columns by their size facilitates comparisons, though it can impede items' search, especially when there are many categories involved
 - {exception} not recommended for time series
 - clutter
 - displaying too many items in a cluster and/or too many labels can lead to clutter
 - {recommendation} display at maximum 3-4 clustered boxes
 - color
 - one should follow the general recommendations
 - trend lines
 - can be used especially with time series especially to represent the linear regression line
 - dual axis
 - {benefit} allows to compare the magnitudes of two data series by employing a secondary axis
 - overlapping
 - overlapping boxes can make charts easier to read
 - symbols
 - can be used to designate reference points of comparison for each of the bars [3]
 - {alternative} pie chart
 - can be used to dramatize comparisons in relation to the whole [2]
 - one should consider the drawbacks
 - {alternative} choropleth maps
 - more adequate for geographical dimensions
 - provide minimal encoding
 - {alternative} line charts
 - can be much more informative
 - provides an optimal dat-ink ratio
 - reduces the chart junk feeling
 - {alternative} dot plots
 - are closer to the original data
 
References:
[1] Anna C Rogers (1961) "Graphic Charts Handbook"
[2] Robert Lefferts (1981) "Elements of Graphics: How to prepare charts and graphs for effective reports"
[3] Robert L Harris (1996) "Information Graphics: A Comprehensive Illustrated Reference"
