ABSTRACT

In this chapter you will learn what a corpus is (plural: corpora) and what the four methods are to which nearly all aspects of corpus-linguistic work can be reduced in some way.

Before we start to actually look at corpus linguistics, we have to clarify our terminology a little. While the actual programming tasks do not differ between them, in this book I will distinguish between a corpus, a text archive, and an example collection.