ABSTRACT

The purposes of this part of the textbook is fivefold:

To introduce the reader to the core concepts of topic modeling and text classification

To provide an introduction to three libraries used for traditional topic modeling (Scikit Learn, Gensim, and spaCy) for those with limited Python knowledge

To detail the problems and solutions to working with various topic modeling problems

To provide an overview of transformer-based topic modeling

To provide code that will be easily reproducible for readers who wish to apply these methods to their own domains.