ABSTRACT

Part-of-speech (PoS) tagging is a well-known problem and a common step in many natural language processing applications such as machine translation, word sense disambiguation, and syntactic parsing. A PoS tagger is a program that attempts to assign the correct PoS tag or lexical category to all words of a given text, typically by relying on the assumption that a word can be assigned a single PoS tag by looking at the PoS tags of neighbouring words.