ABSTRACT

This chapter analyses how various tools for corpus creation and analysis can lead to the observation of linguistic regularities in corpora from different angles. It explores an overview of the tools available for the creation of corpus resources and for the retrieval and analysis of data from corpora, and then examines the main types of computer-assisted analyses which can be carried out using corpus tools. The construction, distribution and analysis of corpus resources all rely on both hardware and software tools. The chapter focuses on monolingual corpora, the tools and techniques are also relevant for the creation and analysis of multilingual corpora, both parallel and comparable. It discusses the general purpose applications that are found under all operating systems, not all applications specifically created for corpus annotation, management and analysis will work in all software environments. This chapter examines how statistics, visualization formats and linguistic annotation can be used to derive information from corpora.