ABSTRACT

Strongly related to text segmentation is word segmentation, which is defi ned as the problem of dividing a string of written language into its component words. In English, as well as in many other languages, using some form of the Greek or Latin alphabet, the space between words is a good approximation of a word delimiter.