ABSTRACT

Often data have to be retained for various reasons including for regulatory compliance. The data retained may have sensitive information and could violate user privacy. Furthermore, manipulating such big data, such as combining sets of different types of data, could result in security and privacy violations. For example, while the raw data remove personally identifiable information, the derived data may contain private and sensitive information. The raw data about a person may be combined with the person’s address, which may be sufficient to identify the person.