ABSTRACT

String-matching algorithms are critical to several scientific fields. Beside text processing and databases, emerging applications such as DNA protein sequence analysis, data mining, information-security software, antivirus, and machine learning all exploit string matching algorithms [1]. All these applications usually process a large quantity of textual data, require high performance, and/or predictable execution times. Among all the string matching

algorithms, one of the most studied, especially for text processing and security applications, is the Aho-Corasick algorithm.