ABSTRACT

SIMPR (Structured Information Management: Processing and Retrieval) is a project in the ESPRIT II programme of the Commission of the European Community. The SIMPR system provides software support for the creation, management and querying of very large information bases on CD-ROM. The information stored will typically be technical manuals, libraries of technical reports or other full-text documents. A full-text document is one with no prerequisites on its content or format. Each of these documents is composed of a number of texts. Each text is processed in two stages. It is first indexed to extract words and phrases with a high meaning content. Then, the subject(s) of the text are identified and appropriate classificators are attributed to the text.