ABSTRACT

This chapter introduces Waikato Environment for Knowledge Analysis (Weka), a powerful analytics tool containing algorithms for data preprocessing, supervised learning, clustering, association rule mining, and visualization. Weka is written in the Java programming language and is publicly available under the terms of the GNU General Public License. Weka offers four graphical user interfaces (GUI's). The chapter focuses on Weka's Explorer interface as it provides the fastest way to get you started mining one's own data. The data sets found in the Weka data set library represent a good starting point to help you learn about the basic data mining algorithms. Data sets not part of this library but used in this chapter are contained in the file datasetsWeka.zip available at our website. Links to additional data sets can be found by clicking on datasets listed under the Further Information heading at Weka's home page site.