ABSTRACT

R is a language and environment for statistical computing and graphics. It was developed by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand in the 1990s. R can run on a wide variety of operating systems including Windows, MacOS, and UNIX. The main way science differs from the other types of knowledge is that all knowledge in science is provisional and is capable of being disproved, revised, or reinforced based on the outcome of additional scientific study. With the explosion in the accessibility of personal computers over the last few decades of the 20th century advanced analytical and data storage methods, once only available to a handful of professional scientists, are now available to billions of people. As a result a new discipline, data science, has developed. Organisations and individuals who understand the value of data and incorporate data science into their work flows are in a position to improve their products, methods, and decision making.