Anonymizing Transaction Data
So far, we have considered relational data where all records have a ﬁxed set of attributes. In real life scenarios, there is often a need to publish unstructured data. In this chapter, we examine anonymization techniques for one type of unstructured data, transaction data. Like relational data, transaction data D consists of a set of records, t1, . . . , tn. Unlike relational data, each record ti, called a transaction, is an arbitrary set of items drawn from a universe I. For example, a transaction can be a web query containing several query terms, a basket of purchased items in a shopping transaction, a click stream in an online session, an email or a text document containing several text terms. Transaction data is a rich source for data mining . Examples are association rule mining , user behavior prediction , recommender systems (www.amazon.com), information retrieval  and personalized web search , and many other web based applications .