ABSTRACT

There is a lot of data on the Web, and this data has to be managed and mined so that the relevant nuggets of information can be extracted. In addition, the Web usage patterns have to be mined so that one can identify who is browsing the Web. Finally, the Web structure has to be mined so that Web searches can be made more efficient. Lots of technologies have to work together to support Web data mining. First, we need to index and retrieve the data; that is, we need to efficiently manage the digital libraries hosted on the Web. Next, we need support for E-commerce technologies. Third, we need to develop semantic Web technologies to understand the Web pages. This chapter describes all these aspects of Web data management and mining, as illustrated in Figure 8.1.