ABSTRACT

This chapter discusses how to extract data from PDFs and online sources with Google Sheets and point-and-click tools, as well as legal issues surrounding scraping. In the exercises at the end of the chapters, readers will download and open files of various formats (about product recalls, unemployment rates and other topics), scrape data from websites and PDFs and create a dataset showing how many inventors in their state or area have received patents in recent years.