Introducing Compara, the Portuguese-English Parallel Corpus

doi:10.4324/9781315760162-6

ABSTRACT

This paper is an introduction to the Portuguese-English parallel corpus, Compara. Compara is a machine-searchable and open-ended collection of Portuguese-English and English-Portuguese source texts and translations. It was made for people who have never used corpora before as well as for experienced corpus users. Compara’s encoding and alignment criteria allow users to inspect translators’ notes and investigate when and where translators have chosen to join, separate, delete, add and reorder sentences. Also, the corpus has been specifically designed to accommodate more than one translation per source text. Compara is freely accessible on the WWW.