ABSTRACT

We begin with an overview of ML research and different paradigms in the Želd. This chapter presents the principles for applying ML to text, focusing on concrete techniques that range from various ways of representing documents and selecting features to particular classiŽcation and clustering algorithms.