ABSTRACT

This chapter focuses on extracting structured data from web pages. A computer program for extracting such data is called a wrapper. In the 1990s, research community started taking interest in the Želd of IE. Roughly, there are three basic approaches for IE. These are as follows:

• Manual approach: Using web page and its source code, computer programmer identiŽes patterns and designs program to extract target data. But this approach is not scalable to a large number of websites.