ABSTRACT

This chapter focuses on the extraction of valuable information from unstructured data that makes up much of the content on the deep web. It looks at the policy guidelines for extracting, and analyzes unstructured content from the deep web. Content analysis of data extracted from the deep web helps categorize all the data into specific categories for further analysis or use. Content analysis can identify repeating patterns, usability, and credibility of the data among other characteristics. Log analysis involves critical evaluation of content collected, processed, and stored within information systems. Risk analysis and mitigation are essential especially when accessing, mining, and logging website content from the dark web. Access and usage of content from the dark web have several risks that will have to be handled with care and diligence. Text analytics is possible due to the progress that has been made in computing enabling the use of natural language processing.