ABSTRACT

This addition to the Data Science Series introduces the principles of data science and the R language to the singular needs of water professionals. The book provides unique data and examples relevant to managing water utility and is sourced from the author’s extensive experience.

Data Science for Water Utilities: Data as a Source of Value is an applied, practical guide that shows water professionals how to use data science to solve urban water management problems. Content develops through four case studies. The first looks at analysing water quality to ensure public health. The second considers customer feedback. The third case study introduces smart meter data. The guide flows easily from basic principles through code that, with each case study, increases in complexity. The last case study analyses data using basic machine learning.

Readers will be familiar with analysing data but do not need coding experience to use this book. The title will be essential reading for anyone seeking a practical introduction to data science and creating value with R.

chapter Chapter 1|10 pages

Introduction

chapter Chapter 2|12 pages

Basics of the R Language

chapter Chapter 3|12 pages

Loading and Exploring Data

chapter Chapter 4|14 pages

Descriptive Statistics

chapter Chapter 5|16 pages

Visualising Data with ggplot2

chapter Chapter 6|16 pages

Sharing Results

chapter Chapter 7|14 pages

Managing Dirty Data

chapter Chapter 8|16 pages

Analysing the Customer Experience

chapter Chapter 9|14 pages

Basic Linear Regression

chapter Chapter 10|16 pages

Clustering Customers to Define Segments

chapter Chapter 11|14 pages

Working with Dates and Times

chapter Chapter 12|12 pages

Detecting Outliers and Anomalies

chapter Chapter 13|16 pages

Introduction to Machine Learning

chapter Chapter 14|4 pages

In Closing