ABSTRACT

Usually when bioinformatics researchers search for disease associated genes, they would use two types of data: a genotype data which codes the genetic information of each subject in the sample, and a phenotype data which codes the affection status of each subject. A genotype data is a m × n twodimensional array, in which every element is 0, 1 or 2 (coding for three kinds of genotypes). A phenotype data is a m × 1 vector, in which every element is either 0 (unaffected) or 1 (affected). Here m is the number of individuals in the sample, while n is the number of loci (gene position in the chromosomes) collected.