ABSTRACT

Proteins express information by their constituent numbers N: the number of amino acids apart from identity. The chapter concentrates on the length properties of proteins that typically receive glancing attention during sequence and folded structure inspection. The tools of Chapter Two are directed (again!) to archetypal molecules and sets. The compressibility of a sequence is explored through information-conserving transforms. The discussion is followed by the application of probability functions to the N-distributions expressed by proteomes. The chapter closes with attention to the information asymmetry of distributions.