For another assignment from Reproducible Research, I've prepared a self contained analysis from a large and fairly dirty data set, as if it were to be delivered to a Government decision maker.
By 'self contained', we mean that every aspect from start to finish must be self-evident & reproducable.
Similar to my previous post, here I also would have preferred to tidy the data in R and build the visualizations with Tableau.
The graphical systems in R are robust. And I find them super convenient to quickly plot the distribution of some data, or to spot check my work along the way. But R (in my opinion) is no place to produce "communication" level visualizations.
Visualizations built in R are simply too rigid and static for communication purposes. The moment one reaches a "tidy" data set that is for analysis, then it's time to switch to Tableau for communication.
If you would like to download my reproducible code to give it a run, click here.