TV Content Analysis and Annotation for Parental Control

doi:10.1201/b11723-9

ABSTRACT

Contents 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 3.2 TV Content Analysis for Harmful Content Detection . . . . . . . . . . . . . . . . . . . . . 58

3.2.1 Extracting Harmful Clues from a Single Modality . . . . . . . . . . . . . . . . . 59 3.2.1.1 Audio Analysis Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 3.2.1.2 Visual Analysis Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 3.2.1.3 Textual Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65

3.2.2 Combining Low-to Medium-Level Extracted Clues in Multimodal Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

3.2.3 Higher Level Semantics Extraction: The Role of Ontologies . . . . . . . 67 3.3 Knowledge-Based Framework for Violence Identiﬁcation in Movies . . . . . . 68

3.3.1 Preprocessing-Segmentation Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 3.3.2 Audiovisual Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

3.3.2.1 Visual Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 3.3.2.2 Audio Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

3.3.3 Domain Ontology Deﬁnition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

3.3.4 Inferencing Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 3.3.5 Implementation and Experimentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 3.3.6 Extensions-Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85

3.4 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

3.1 Introduction Although controversial, television is probably the most common medium of information and entertainment. Everyone has access to TV content through a number of end user devices (TV sets, mobile phones, PCs) and over a number of diﬀerent communication channels, but still limited control over the received content. Personalization services enhancing the overall viewers experience are nowadays made possible and oﬀered by a number of media service providers. However little progress has been noted on the development of intelligent, eﬃcient, human-like technological solutions for automatic identiﬁcation and ﬁltering of undesirable broadcasted TV content, which could further facilitate the protection of sensitive user groups (e.g., children). To better understand the underlying processes and provide a more intuitive description of the beneﬁts and functionalities arising from such technologies, an example use case is presented involving Bob and Mary’s family-themselves and their two children, a daughter, Pat, who is still a kid, and a son, Tom, who is a teenager. This use case further illustrates how advanced TV content ﬁltering services will operate. In this use case, Bob has just bought a brand new TV set with content ﬁltering capabilities. He plugs it to the power source and turns it on. The proﬁle settings screen appears. Bob creates four distinct user proﬁles for each one of his family members and browses through a harmful content hierarchy to select the corresponding content that should be ﬁltered out for each one of them, including himself. As Pat is still a kid, all content classiﬁed as harmful should be ﬁltered out; thus the top class of the hierarchy is selected. As Tom is a teenager, who likes action movies with ﬁghts and explosions and is old enough to watch partial nudity, the corresponding harmful content subclasses are deselected for him. However sexual actions, total nudity, and murders are disallowed. Mary does not like murders, blood splatter, and boxing; therefore, the corresponding classes are selected for her to be ﬁltered out. Finally, Bob creates an empty proﬁle for himself, as he wishes to view any type of received content. The next step is to ﬁlter out (i.e., skip or blur) the selected categories of received content for the corresponding user proﬁle. For this purpose, the TV set (or set-top box) is accompanied by dedicated ﬁltering software, that further allows for user proﬁle storage and harmful TV content ﬁltering on mobile phones and PCs, providing thus the desired protection from objectionable content for Bob and Mary’s family.