ABSTRACT

Chapter 9 discusses the final step in the workflow, data cleaning. The chapter is primarily focused on the identification and treatment – removal or otherwise – of discordant values, such as errors of value and outliers, as well as observations with missing data. This step completes the process of preparing the data for statistical tests, modeling, or visualization.