Graph Analysis | 9 | Understanding Complex Datasets

ABSTRACT

In the previous chapter, we considered what might be called attributed data: sets of records, each of which speciﬁed values of the attributes of each object. When such data is clustered, the similarity between records is based on a combination of the similarity of the attributes. The simplest, of course, is Euclidean distance, where the squares of the diﬀerences between attributes are summed to give an overall similarity (and then a square root is taken).