blastx_gbnr_files/2982_blastx.Z


BLASTX 2.0a10MP-WashU [15-May-1997] [Build 16:56:44 May 19 1997]

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice:  statistical significance is estimated under the assumption that the
equivalent of one entire reading frame in the query sequence codes for protein
and that significant alignments will involve only coding reading frames.

Query=  2982
        (368 letters)

  Translating both strands of query sequence in all 6 reading frames

Database:  ../_tempdbs/nrdb
           267,798 sequences; 79,948,537 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
  Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N

A sp|P15577|NU2M_PARTE NADH-UBIQUINONE OXIDOREDUCTASE CH... -3    76  0.0053    2
A pir||JS0233          NADH dehydrogenase (ubiquinone) (... -3    76  0.0053    2
A gnl|PID|d1021423     (AB001684) ORF46a [Chlorella vulg... -3    74  0.045     1
A gnl|PID|d1021481     (AB001684) ORF67 [Chlorella vulga... -2    59  0.83      1
A gnl|PID|d1021500     (AB001684) ORF41c [Chlorella vulg... +3    56  0.98      1
A gnl|PID|e248975      (Z74033) F38B7.4 [Caenorhabditis ... +2    72  0.99      1
A pir||I40760          hypothetical protein 3 - Campylob... -3    55  0.995     1
A gi|456382            (U05812) 3'-end [Herpetomonas mus... -3    60  0.998     1
A gnl|PID|d1021580     (AB001684) ORF54d [Chlorella vulg... -3    53  0.99991   1
A gnl|PID|d1021467     (AB001684) ORF49b [Chlorella vulg... -3    53  0.99991   1
A gnl|PID|d1021566     (AB001684) ORF51c [Chlorella vulg... -2    53  0.99993   1



>sp|P15577|NU2M_PARTE NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 2 >pir||S07734 NADH
            dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 2 - Paramecium
            tetraurelia mitochondrion (SGC6) >gi|515876 (X15917) ND2 protein
            (AA 1-193) (unusual start codon) [Paramecium aurelia] >gi|1019630
            (M26930) NADH dehydrogenase subunit 2 [Paramecium aurelia]
            Length = 193

  Minus Strand HSPs:

 Score = 76 (26.8 bits), Expect = 0.0053, Sum P(2) = 0.0053
 Identities = 20/63 (31%), Positives = 32/63 (50%), Frame = -3

Query:   213 LNKXLINKRIS*VPFSXFI**SMXFWFLFFSFFLFFWF-IKLNNPKDIXSLYYXXFFFFF 37
             LN  +IN   S +    F+  ++  + L  + F FF F +K+   K +  +YY    FFF
Sbjct:    24 LNLLMINLMFSKLGAIFFL--NLALYLLALALFFFFLFNVKVALLKSVSQIYYFNNIFFF 81

Query:    36 FFFL 25
              FF+
Sbjct:    82 KFFV 85

 Score = 43 (15.1 bits), Expect = 0.0053, Sum P(2) = 0.0053
 Identities = 7/13 (53%), Positives = 9/13 (69%), Frame = -1

Query:    47 FFFFFFFYFEENL 9
             F  FFF +F+ NL
Sbjct:   105 FLIFFFLFFKTNL 117


>pir||JS0233 NADH dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 2 - Paramecium
            tetraurelia mitochondrion (SGC6)
            Length = 193

  Minus Strand HSPs:

 Score = 76 (26.8 bits), Expect = 0.0053, Sum P(2) = 0.0053
 Identities = 20/63 (31%), Positives = 32/63 (50%), Frame = -3

Query:   213 LNKXLINKRIS*VPFSXFI**SMXFWFLFFSFFLFFWF-IKLNNPKDIXSLYYXXFFFFF 37
             LN  +IN   S +    F+  ++  + L  + F FF F +K+   K +  +YY    FFF
Sbjct:    24 LNLLMINLMFSKLGAIFFL--NLALYLLALALFFFFLFNVKVALLKSVSQIYYFNNIFFF 81

Query:    36 FFFL 25
              FF+
Sbjct:    82 KFFV 85

 Score = 43 (15.1 bits), Expect = 0.0053, Sum P(2) = 0.0053
 Identities = 7/13 (53%), Positives = 9/13 (69%), Frame = -1

Query:    47 FFFFFFFYFEENL 9
             F  FFF +F+ NL
Sbjct:   105 FLIFFFLFFKTNL 117


>gnl|PID|d1021423 (AB001684) ORF46a [Chlorella vulgaris]
            Length = 46

  Minus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.046, P = 0.045
 Identities = 13/35 (37%), Positives = 19/35 (54%), Frame = -3

Query:   138 WFLFFSFFLFFWFIKLNNPKDIXSLYYXXFFFFFF 34
             +F+FFSFF  F+FIK         ++    FF F+
Sbjct:     8 FFIFFSFFFLFFFIKKKERASFAFIFLSFLFFCFY 42

 Score = 72 (25.3 bits), Expect = 0.075, P = 0.072
 Identities = 16/37 (43%), Positives = 21/37 (56%), Frame = -3

Query:   138 WFLFFSFFLFFWFIKLNNPKDIXSLYYXXFFFFFFFF 28
             +F+FFSFF  F+FIK    K+  S  +    F FF F
Sbjct:     8 FFIFFSFFFLFFFIK---KKERASFAFIFLSFLFFCF 41


>gnl|PID|d1021481 (AB001684) ORF67 [Chlorella vulgaris]
            Length = 67

  Minus Strand HSPs:

 Score = 59 (20.8 bits), Expect = 1.8, P = 0.83
 Identities = 17/47 (36%), Positives = 22/47 (46%), Frame = -2

Query:   130 FLFFLPLLLVYKIK*SQRYKXVILXXLFFF------FFFFSILRRTY 8
             FLFFL L   YK K  ++ K +    L FF      F FF +  R +
Sbjct:     2 FLFFLLLFFFYKKKKIKKIKILFFSTLLFFKAFEKNFVFFRLFFRFF 48

 Score = 56 (19.7 bits), Expect = 4.0, P = 0.98
 Identities = 12/34 (35%), Positives = 16/34 (47%), Frame = -3

Query:   132 LFFSFFLFFWFIKLNNPKDIXSLYYXXFFFFFFF 31
             +F  F L F+F K    K I  L++    FF  F
Sbjct:     1 MFLFFLLLFFFYKKKKIKKIKILFFSTLLFFKAF 34


>gnl|PID|d1021500 (AB001684) ORF41c [Chlorella vulgaris]
            Length = 41

  Plus Strand HSPs:

 Score = 56 (19.7 bits), Expect = 4.0, P = 0.98
 Identities = 11/25 (44%), Positives = 18/25 (72%), Frame = +3

Query:    18 LKIEKKKKKKKXXHSITXLYLWDYL 92
             LK +K KKKKK   ++  L+L+D++
Sbjct:    10 LKKKKIKKKKKKKENVEFLFLFDFV 34


>gnl|PID|e248975 (Z74033) F38B7.4 [Caenorhabditis elegans]
            Length = 225

  Plus Strand HSPs:

 Score = 72 (25.3 bits), Expect = 4.3, P = 0.99
 Identities = 21/64 (32%), Positives = 33/64 (51%), Frame = +2

Query:    86 LFNFI-NQKKRKKE-KKRNQXHIDYYIKX--ENGTYEIRLLIXFLFKSFLMFSLFPNFIG 253
             +F F  NQ K  KE +KR + +I  +I+   ++  Y I +   F F SF  +  +  F G
Sbjct:   113 IFKFYKNQNKTDKESRKRIKKNIYLFIQTVLQDSLYLIDISFTFYFNSFYDYRFWTFFCG 172

Query:   254 IFVY 265
              FV+
Sbjct:   173 TFVW 176


>pir||I40760 hypothetical protein 3 - Campylobacter jejuni >gi|535807 (Z36940)
            hypothetical protein [Campylobacter jejuni]
            Length = 60

  Minus Strand HSPs:

 Score = 55 (19.4 bits), Expect = 5.3, P = 0.99
 Identities = 17/43 (39%), Positives = 23/43 (53%), Frame = -3

Query:   129 FFSFF-LFFWFIKLNNPKDIXSLYYXXFFFFFFFFLF*GELIXK 1
             +FSF  L F    LN  K+    +Y  F  +F FF+F  +LI K
Sbjct:     6 YFSFLKLDFEIYHLNTSKN----FYGFFILYFSFFIF--KLIYK 43


>gi|456382 (U05812) 3'-end [Herpetomonas muscarum]
           Length = 87

  Minus Strand HSPs:

 Score = 60 (21.1 bits), Expect = 6.3, P = 1.0
 Identities = 13/36 (36%), Positives = 18/36 (50%), Frame = -3

Query:   135 FLFFSFFLFFWFIKLNNPKDIXSLYYXXFFFFFFFF 28
             F+F   FL FW ++L+    I   +   FF F  FF
Sbjct:    42 FIFSYQFLGFWVVRLHLYSCINIAFLVCFFIFILFF 77


>gnl|PID|d1021580 (AB001684) ORF54d [Chlorella vulgaris]
            Length = 54

  Minus Strand HSPs:

 Score = 53 (18.7 bits), Expect = 9.3, P = 1.0
 Identities = 8/14 (57%), Positives = 11/14 (78%), Frame = -3

Query:    63 YYXXFFFFFFFFLF 22
             ++  FFFFFF F+F
Sbjct:    21 FFLIFFFFFFLFVF 34

 Score = 53 (18.7 bits), Expect = 9.3, P = 1.0
 Identities = 12/21 (57%), Positives = 14/21 (66%), Frame = -3

Query:    81 KDIXSLYYXXFF-FFFFFFLF 22
             K + SL +  F  FFFFFFLF
Sbjct:    12 KLLFSLMFFFFLIFFFFFFLF 32


>gnl|PID|d1021467 (AB001684) ORF49b [Chlorella vulgaris]
            Length = 49

  Minus Strand HSPs:

 Score = 53 (18.7 bits), Expect = 9.3, P = 1.0
 Identities = 11/25 (44%), Positives = 13/25 (52%), Frame = -3

Query:   147 MXFWFLFFSFFLFFWFIKLNNPKDI 73
             M   F FF  F FF+F K  + K I
Sbjct:     7 MFLIFFFFLIFFFFFFYKKKDQKQI 31


>gnl|PID|d1021566 (AB001684) ORF51c [Chlorella vulgaris]
            Length = 51

  Minus Strand HSPs:

 Score = 53 (18.7 bits), Expect = 9.5, P = 1.0
 Identities = 18/45 (40%), Positives = 23/45 (51%), Frame = -2

Query:   139 LVSFLFFLPLLLVYKIK*SQRYKXVILXXLFFFFFFFSILRRTYK 5
             L+SF+F    +L   IK S      IL  L FF FFFS  ++  K
Sbjct:     8 LISFIFLEKKILW--IKKSFFLLSFILIFLIFFSFFFSKKKKKKK 50


Parameters:
  matrix=/usr/local/src/bio/blast/blast2/matrix/aa/BLOSUM62

  ctxfactor=5.98
  E=10

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401
   +3      0   BLOSUM62        0.318   0.135   0.401    0.368   0.173   0.621
               q=9  r=2        0.244  0.0300   0.180
   +2      0   BLOSUM62        0.318   0.135   0.401    0.350   0.163   0.536
               q=9  r=2        0.244  0.0300   0.180
   +1      0   BLOSUM62        0.318   0.135   0.401    0.359   0.164   0.607
               q=9  r=2        0.244  0.0300   0.180
   -1      0   BLOSUM62        0.318   0.135   0.401    0.375   0.175   0.628
               q=9  r=2        0.244  0.0300   0.180
   -2      0   BLOSUM62        0.318   0.135   0.401    0.360   0.164   0.580
               q=9  r=2        0.244  0.0300   0.180
   -3      0   BLOSUM62        0.318   0.135   0.401    0.375   0.179   0.725
               q=9  r=2        0.244  0.0300   0.180

  Query
  Frame  MatID  Length  Eff.Length   E    S W   T  X     E2  S2
   +3      0      122       112      10. 60 3  12 22    0.12 32
                                                  29   0.096 35
   +2      0      122       113      10. 60 3  12 22    0.12 32
                                                  29   0.097 35
   +1      0      122       113      10. 60 3  12 22    0.12 32
                                                  29   0.097 35
   -1      0      122       113      10. 60 3  12 22    0.12 32
                                                  29   0.097 35
   -2      0      122       113      10. 60 3  12 22    0.12 32
                                                  29   0.097 35
   -3      0      122       112      10. 60 3  12 22    0.12 32
                                                  29   0.096 35


Statistics:

  Database:  ../_tempdbs/nrdb
    Title:  ../_tempdbs/nrdb
    Release date:  unknown
    Posted date:  1:59 PM EST Feb 4, 1998
    Format:  BLAST
  # of letters in database:  79,948,537
  # of sequences in database:  267,798
  # of database sequences satisfying E:  11
  No. of states in DFA:  581 (57 KB)
  Total size of DFA:  141 KB (192 KB)
  Time to generate neighborhood:  0.02u 0.00s 0.02t  Elapsed: 00:00:00
  No. of processors used:  6
  Search cpu time:  76.72u 0.69s 77.41t  Elapsed: 00:00:20
  Total cpu time:  76.76u 0.72s 77.48t  Elapsed: 00:00:20
  Start:  Sat Feb  7 04:02:43 1998   End:  Sat Feb  7 04:03:03 1998

Date: 16:18:58 on Sun 17 Dec 117.