ABSTRACT

This chapter deals with parsing, cleaning, and processing text data. Common character functions for processing text data are introduced. Examples illustrate the removal of stop words and the assignment of sentiment values. Data sets are then extracted from reading and processing data from websites. Finally, regular expressions are introduced, and the website processing example is redone using functions for such expressions.