- The raw data.
- A tidy data set
- A code book describing each variable and its values in the tidy data set.
- An explicit and exact recipe you used to go from 1 -> 2,3.
Jeffrey Leek, Assistant Professor of Biostatistics
Johns Hopkins Bloomberg School of Public Health
You know the raw data is in the right format if you
Some other important tips
Some other important tips
In some cases it will not be possible to script every step. In that case you should provide instructions like: