These databases were constructed by extracting the organism specific ESTs from dbEST, removing polyA sequences from the ends and trimming 5' and 3' regions with greater than 25% N's in a 20 base pair window. These "quality" sequences were then aligned using the cap2 program and the consensus sequences thus generated put into a database that is available on the web. A number of parasitic organisms were chosen that have between 3000 and 15000 ESTs. The attempt here is to provide useful information and analyses to the scientific community without curating the results in any way. A brief tutorial from a poster presentation on the Toxoplasma gondii database can be viewed here.
Comments or Questions