ABSTRACT

This chapter investigates whether the recently minted term data science implies statistics is a subset of a more developed and expanded domain or is a buzzed-up cloaking of the current state of statistics. It aims to understand whether a data scientist is a super-statistician, whose new appellation signifies a larger skill set than that of the current statistician or, trivially, is the term data scientist a reimaging of the professional with a pretentious catchword of little exact meaning. The Internet accounts for big data, which include not only numbers but also text, voice, images, and so on. Big data account for the necessity of the computer. Also, big data account for the birth and continuing upping of the speed of high-performance statistical programs. The chapter conducts a text-by-text comparative investigation to determine whether data science is identical or at least similar to statistics.