Release 9 of the Database of Transcribed Sequences (DoTS) for Homo sapiens was completed on Wed Sep 1 23:29:30 EDT 2004. The data sources include: * GenBank (Release 142) * NRDB (2004-07-06) * dbEST (2004-07-06) * Pfam (2003-11-07) * ProDom (prodom2003.1) * CDD (2003-11-07) * Gene Ontology (GO) consortium ontologies and assignments * NCBI gene trap tag records from 8 original sources: GGTC, Baygenomics, SIGTR, MFGC, CMHD, Lexicon Genetics, and the H.E.Ruley and P.Soriano labs * UCSC Genome Bioinformatics Group (genome version hg16) The files available are: humDoTS_rel9.fasta.gz The sequence for the consensus transcripts humDoTS_rel9_predictedProteins.fasta.gz The predicted protein translation of each assembled transcript humDoTS_rel9_DTAnatomy.gz The anatomy percent for assembled transcripts humDoTS_rel9_bestNRDBHits.dat.gz The best hit in NRDB for each consensus transcript humDoTS_rel9_accessionsPerAssembly.dat.gz The Genbank accessions of ESTs and mRNAs contained in each assembled transcript humDoTS_rel9_DTperDG.dat.gz Assembled transcripts belonging to each gene humDoTS_rel9_mRNAaccessionsPerAssembly.dat.gz The Genbank accessions of mRNAs contained in each assembled transcript humDoTS_rel9_brainTerms.dat Brain anatomy terms for which there are DoTS Transcripts. (Tab delimited: term, term ID, count of DTs) humDoTS_rel9_LL2DoTS.gz A mapping of DoTS to LocusLink humDoTS_rel9_predictedProteinDetails.txt.gz A tab delimited file of the translation details for proteins predicted using FrameFinder humDoTS_rel9_manuallyReviewedTranscriptsReport.txt.gz A tab delimited report for all manually reviewed DoTS Transcripts