ABSTRACT

Analysis of DNA sequence data is important for comprehending the meaningful biological information buried in the data set. Due to the high time and computational complexity, developing efficient and effective analysis methods for processing large-scale DNA sequence data has always been a crucial task in the post-genomic era. With the recent progresses in computer science and electrical engineering, multiple novel methods have been introduced to the biology regime and applied to the DNA sequence analysis. These methods significantly improved the efficiency and accuracy of DNA sequence data analysis. Among these methods, data mining technique, which uncovers the information hidden in the data, is regarded as one of the most promising and effective approach. This paper provides a comprehensive review of recent progress on DNA sequence analysis technologies, and discusses the use of data mining method in this area.