ABSTRACT

This monograph is a translation of two seminal works on corpus-based studies of Mandarin Chinese words and parts of speech.

The original books were published as two pioneering technical reports by Chinese Knowledge and Information Processing group (CKIP) at Academia Sinica in 1993 and 1996, respectively. Since then, the standard and PoS tagset proposed in the CKIP report have become the de facto standard in Chinese corpora and computational linguistics, in particular in the context of traditional Chinese texts.

This new translation represents and develops the principles and theories originating from these pioneering works. The results can be applied to numerous fields; Chinese syntax and semantics, lexicography, machine translation and other language engineering bound applications.

Suitable for graduate and scholars in the fields of linguistics and Chinese, Mandarin Chinese Words and Parts of Speech provides a comprehensive survey of the issues around wordhood and PoS.

 

Chapter 6, 7, 8, 9, 10, 11, 12, 13, 14 and the appendixes V-VII of this book is freely available as a downloadable Open Access PDF under a Creative Commons Attribution-Non Commercial-No Derivatives 4.0 license available at https://www.taylorfrancis.com

chapter |6 pages

Overview

From Linguistics Studies to Language Resources Annotation and Processing

part 1|99 pages

Words, Segmentation Units, and Segmentation Standards

chapter 1|4 pages

Introduction

chapter 3|63 pages

“Sou” WenJieZi

Studies on Identification of Words and Segmentation Units

chapter 5|9 pages

Comparison of Two Segmentation Standards

part 2|126 pages

PoS Analysis of Contemporary Chinese

chapter 7|49 pages

V: Verbs

chapter 8|2 pages

A: Non-Predicative Adjectives

chapter 9|33 pages

N: Content Words

chapter 10|15 pages

D: Adverbs

chapter 11|7 pages

P: Preposition

chapter 12|7 pages

C: Conjunction

chapter 13|2 pages

T: Particles

chapter 14|1 pages

I: Interjections

part 3|4 pages

Resources