These databases were constructed by extracting the organism specific ESTs from dbEST, removing polyA sequences from the ends and trimming 5' and 3' regions with greater than 25% N's in a 20 base pair window. These "quality" sequences were then aligned using the cap2 program and the consensus sequences thus generated put into a database that is available on the web. A number of parasitic organisms were chosen that have between 3000 and 15000 ESTs. The attempt here is to provide useful information and analyses to the scientific community without curating the results in any way. A brief tutorial from a poster presentation on the Toxoplasma gondii database can be viewed here.
Toxoplasma gondii v3.0: All Toxoplasma gondii ESTs from dbEST as of October 10, 2000: 5,168 consensus sequences.
Eimeria tenella: All Eimeria tenella ESTs from dbEST as of November 11, 2000: 1,679 consensus sequences.