The Making of the CorDis Corpus: Compilation and Markup

doi:10.4324/9780203868157-9

ABSTRACT

This chapter sets out to describe how different subcorpora were integrated to form the unifi ed body of texts known as the CorDis Corpus. In particular, it focuses on the process whereby CorDis was made an XML-valid, TEI-conformant corpus that can be easily interrogated using Xaira. In discussing specifi c examples illustrating the practice of markup, the chapter highlights the import of annotation as a way to enhance reliability of research and (re-)usability of data.1