ABSTRACT

With the tremendous growth in available information to the masses, the question is how users can searchtheusefulinformationintheshortesttime[PW09,NGW08,PYW10].Inotherwords,makinguse ofconsolidatedinformationrequiressuchsubstantialeªortssincethewebpagesaregeneratedforvisualizationandnotfordataexchange[KCSG07,W10].Toreachthisgoalrequiresmethodsdevelopedto optimizeauser’ssearchingprocess.¥ischapterintroducesamethodknownasthedataminingrobot (DMR)toextractandprocessdatabyusingPERLscriptinglanguage.¥eDMRcanbeunderstood quicklyasaso¯wareprogramthatservesforminingdataautomatically.Particularly,withdatamining fromservers,thismethoddoesnotuseanybrowsertohandletheWeb,butdoessodirectlybyusing PERLmodules(so¯wareprogramsarewrittenforaspeci¦cfunction)suchasLWP.¥euseofthese modules turns the DMR into an eªective solution to extract data with an accelerated speed.