ABSTRACT

Common formatting tasks include selecting subsets of rows and columns from the data table, calculating new variables from the raw data values, computing summary statistics, and combining data from different sources. Mutate and transmute are used to add new columns derived from the values in existing columns. Almost all of the data used in environmental geography applications can be stored and manipulated in data frames, but the format of the data frame can differ. Table joins are a concept shared across many data science disciplines and are implemented in relational database management systems such as MySQL. There are many helpful resources that can help expand knowledge of dplyr, tidyr, and other tidyverse packages.