ABSTRACT

The goal of this chapter is to introduce the text mining capabilities of RapidMiner through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RapidMiner using the text mining extension. We will present two different RapidMiner processes, namely Process01 and Process02, which respectively describe how text mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files.