ABSTRACT

In parallel with the EST sequencing, a major cDNA project was launched in Japan. The first public release of the full-length Triticeae cDNA sequence database in 2008 contained 15 871 sequences (https://trifldb.psc.riken.jp), including 8530 putative full-length coding sequences and their annotations from wheat. The latest set incorporates 16 608 wheat fl-cDNAs, from which a set of 16 408 non-redundant potential protein-coding fl-cDNAs has been identified using stringent CD-Hit clustering at 98% identity or failed prediction of protein sequences with OrfPredictor (Manickavelu et al. 2012).