Last Updated: February 5, 2002 This directory includes files containing GO Function predictions for several organisms as well as SwissProt. The following files are provided (description of format found below): musDoTS: musDoTS_ProDom_NoIEA.pred musDoTS_ProDom_ALL.pred musDoTS_CDD_NoIEA.pred musDoTS_CDD_ALL.pred humDoTS: humDoTS_Union_NO_IEA.pred SwissProt: SP_ProDom_NoIEA.pred SP_ProDom_ALL.pred SP_CDD_NoIEA.pred SP_CDD_ALL.pred C.Elegans: CE_Union_NO_IEA.pred A.Thaliana: AT_Union_NO_IEA.pred S.Cervisiae: SC_Union_NO_IEA.pred File Format =========== The format of these files generally follows the tab delimited Anotation File Format used by the Gene Ontology Consortium. See : http://www.geneontology.org/GO.annotation.html#file for a detailed description. Here we will explicitly state how we are using each field. Field Our usage ======================================================== DB will be CBIL in all cases DB_Object a unique identifier for the AA sequence in GUS (Genomics Unified Schema), our data warehouse DB_Object_Symbol an identifier from the source database: DoTS: DT.XXXXXX SwissProt Accession C.Elegans CEXXXXXXX A.Thaliana ATXXXXXXX S.Cerevisiae SCXXXXXXX GOId the GO id of the "leaf" molecular function terms DB:Reference Here we include evidence for the prediction The format is as follows: CBIL:AGF.< >|EVID:< >,RS.< >,R.< >,PVR=< > The < > are filled with the appropriate identifiers: Field Usage ---------------------------------------- AGF an internal identifier for the prediction EVID domain source id RS identifier for a rule set in our database that was used to generate the prediction a rule set is generated for a domain. Each rule set can have one or more rules associated with it. R identifier for the specific rule in that generated the prediction. PVR -log(similarity pval)/-log(rule pval) EVID always IEA DB_Object_Name Name or identifier from source database or, in the case of DoTS, the description. DB_Object_Type will either be protein or transcript taxon the NCBI taxon id for the organism Date date prediction was made (YYYYMMDD format) ============================================================= The CBIL Software and Data License, Version 1.0 Copyright (c) 2001 The Computational Biology and Informatics Laboratory (CBIL) at the University of Pennsylvania. All rights reserved. Redistribution and use of software in source and binary forms or of data, with or without modification, are permitted provided that the following conditions are met: 1. Redistributions of source code, binary software or data alone must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Redistributions as part of a package or product must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. 3. The end-user documentation included with the redistribution, if any, must include the following acknowledgment: "This product includes software and/or data developed by CBIL at the Center for Bioinformatics at the University of Pennsylvania (http://www.pcbi.upenn.edu)." Alternately, this acknowledgment may appear in the software or with the data itself, if and wherever such third-party acknowledgments normally appear. 4. The names "CBIL" and "Penn Center for Bioinformatics" must not be used to endorse or promote products derived from this software/data without prior written permission. THIS SOFTWARE/DATA IS PROVIDED ``AS IS'' AND ANY EXPRESSED OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL CBIL OR THE PENN CENTER FOR BIOINFORMATICS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. ==================================================================== This license is based on the open source license from the Apache Software Foundation (http://www.apache.org). ====================================================================