Command line Training Set First Motif Summary of Motifs Termination Explanation


MEME - Motif discovery tool

MEME version 3.0 (Release date: 2004/07/26 08:17:15)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.


TRAINING SET

DATAFILE= CBF1_YPD.fsa
ALPHABET= ACGT
Sequence name            Weight Length  Sequence name            Weight Length  
-------------            ------ ------  -------------            ------ ------  
iYJR009C-1               1.0000    724  iYFR029W                 1.0000    329  
iYLL010C                 1.0000    571  iYGR129W                 1.0000    331  
iYKL192C                 1.0000    510  iYIL127C                 1.0000    350  
iYJL210W                 1.0000    263  iYNL095C                 1.0000    765  
iYGR203W                 1.0000    262  iYBR049C                 1.0000    393  

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme CBF1_YPD.fsa -dna -nmotifs 5 -minw 7 -maxw 11 -revcomp 

model:  mod=         zoops    nmotifs=         5    evt=           inf
object function=  E-value of product of p-values
width:  minw=            7    maxw=           11    minic=        0.00
width:  wg=             11    ws=              1    endgaps=       yes
nsites: minsites=        2    maxsites=       10    wnsites=       0.8
theta:  prob=            1    spmap=         uni    spfuzz=        0.5
em:     prior=   dirichlet    b=            0.01    maxiter=        50
        distance=    1e-05
data:   n=            4498    N=              10
strands: + -
sample: seed=            0    seqfrac=         1
Letter frequencies in dataset:
A 0.311 C 0.189 G 0.189 T 0.311 
Background letter frequencies (from dataset with add-one prior applied):
A 0.311 C 0.189 G 0.189 T 0.311 

P N
MOTIF 1     width = 10     sites = 10     llr = 127     E-value = 1.8e-009

SimplifiedA:a::::a::6
pos.-specificCa:a::::883
probabilityG:::a:a::11
matrixT::::a::21:
bits 2.4    
2.2    
1.9    
1.7       
Information 1.4         
content 1.2         
(18.3 bits)1.0         
0.7          
0.5          
0.2          
0.0
Multilevel CACGTGACCA
consensus TC
sequence
NAME STRAND START P-VALUE    SITES 
iYLL010C-3944.23e-07 GTTCTAAGACCACGTGACCAGAAAAGGACG
iYFR029W+974.23e-07 CTTATAGAAGCACGTGACCACAATTCACCC
iYJR009C-1+3804.23e-07 AAAAAAAGGTCACGTGACCAGAAAAGTCAC
iYNL095C-3246.79e-07 TTTGCGCGACCACGTGACCCGCTGCGGCTG
iYIL127C-1476.79e-07 TGAATGTATACACGTGACCCTAATGCTCTT
iYJL210W-619.35e-07 GACCCTTTACCACGTGACCGATTTTGACAA
iYBR049C+3011.63e-06 TGCTATATATCACGTGATCAATTTTTCCGA
iYGR203W+171.63e-06 TACACAATCGCACGTGATCATATATTTGCC
iYGR129W+732.48e-06 CATCGACCGTCACGTGACGAGCTGAGGTTA
iYKL192C+1843.86e-06 TGATATATGTCACGTGACTCTGATAAAACT

Motif 1 block diagrams

NameLowest
p-value
   Motifs
iYLL010C 4.2e-07

-1
iYFR029W 4.2e-07

+1
iYJR009C-1 4.2e-07

+1
iYNL095C 6.8e-07

-1
iYIL127C 6.8e-07

-1
iYJL210W 9.4e-07

-1
iYBR049C 1.6e-06

+1
iYGR203W 1.6e-06

+1
iYGR129W 2.5e-06

+1
iYKL192C 3.9e-06

+1
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700 725 750

Motif 1 in BLOCKS format

BL   MOTIF 1 width=10 seqs=10
iYLL010C                 (  394) CACGTGACCA  1 
iYFR029W                 (   97) CACGTGACCA  1 
iYJR009C-1               (  380) CACGTGACCA  1 
iYNL095C                 (  324) CACGTGACCC  1 
iYIL127C                 (  147) CACGTGACCC  1 
iYJL210W                 (   61) CACGTGACCG  1 
iYBR049C                 (  301) CACGTGATCA  1 
iYGR203W                 (   17) CACGTGATCA  1 
iYGR129W                 (   73) CACGTGACGA  1 
iYKL192C                 (  184) CACGTGACTC  1 
//


Motif 1 position-specific scoring matrix

log-odds matrix: alength= 4 w= 10 n= 4408 bayes= 10.2567 E= 1.8e-009 
  -997    241   -997   -997 
   168   -997   -997   -997 
  -997    241   -997   -997 
  -997   -997    241   -997 
  -997   -997   -997    168 
  -997   -997    241   -997 
   168   -997   -997   -997 
  -997    208   -997    -64 
  -997    208    -91   -164 
    95     67    -91   -997 

Motif 1 position-specific probability matrix

letter-probability matrix: alength= 4 w= 10 nsites= 10 E= 1.8e-009 
 0.000000  1.000000  0.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  0.000000  1.000000 
 0.000000  0.000000  1.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.000000  0.800000  0.000000  0.200000 
 0.000000  0.800000  0.100000  0.100000 
 0.600000  0.300000  0.100000  0.000000 





Time  3.77 secs.


P N
MOTIF 2     width = 9     sites = 10     llr = 94     E-value = 5.5e+003

SimplifiedAa99a5a2aa
pos.-specificC::1::::::
probabilityG:1::5:8::
matrixT:::::::::
bits 2.4
2.2
1.9
1.7     
Information 1.4      
content 1.2        
(13.6 bits)1.0         
0.7         
0.5         
0.2         
0.0
Multilevel AAAAAAGAA
consensus GA
sequence
NAME STRAND START P-VALUE    SITES 
iYKL192C-991.01e-05 GGTTCTGAGTAAAAGAGAAAGGTGGTGAT
iYGR129W-21.01e-05 TATTCTAAATAAAAGAGAAG
iYGR203W-622.68e-05 GATAAAGAGGAAAAAAGAACAAAATTGTC
iYJL210W+1132.68e-05 TGCTCAAGAAAAAAAAGAAACAATAAAGA
iYIL127C+1962.68e-05 CCCCGTGGCAAAAAAAGAAGACACCAATT
iYLL010C+3112.68e-05 CTTTCCAGAAAAAAAAGAAACAAAGCAAG
iYJR009C-1-6332.68e-05 ATCACAAGCTAAAAAAGAACCAAGTTTAC
iYNL095C-233.90e-05 TATAGAATCAAACAGAGAAAGGGATTGAA
iYFR029W+375.57e-05 TACAGGTATTAAAAGAAAACACTATCAAC
iYBR049C+3301.27e-04 AGGAGCTTCTAGAAGAAAATGAGACACAA

Motif 2 block diagrams

NameLowest
p-value
   Motifs
iYKL192C 1e-05

-2
iYGR129W 1e-05

-2
iYGR203W 2.7e-05

-2
iYJL210W 2.7e-05

+2
iYIL127C 2.7e-05

+2
iYLL010C 2.7e-05

+2
iYJR009C-1 2.7e-05

-2
iYNL095C 3.9e-05

-2
iYFR029W 5.6e-05

+2
iYBR049C 0.00013

+2
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700 725 750

Motif 2 in BLOCKS format

BL   MOTIF 2 width=9 seqs=10
iYKL192C                 (   99) AAAAGAGAA  1 
iYGR129W                 (    2) AAAAGAGAA  1 
iYGR203W                 (   62) AAAAAAGAA  1 
iYJL210W                 (  113) AAAAAAGAA  1 
iYIL127C                 (  196) AAAAAAGAA  1 
iYLL010C                 (  311) AAAAAAGAA  1 
iYJR009C-1               (  633) AAAAAAGAA  1 
iYNL095C                 (   23) AACAGAGAA  1 
iYFR029W                 (   37) AAAAGAAAA  1 
iYBR049C                 (  330) AGAAGAAAA  1 
//


Motif 2 position-specific scoring matrix

log-odds matrix: alength= 4 w= 9 n= 4418 bayes= 9.03604 E= 5.5e+003 
   168   -997   -997   -997 
   153   -997    -91   -997 
   153    -91   -997   -997 
   168   -997   -997   -997 
    68   -997    141   -997 
   168   -997   -997   -997 
   -64   -997    208   -997 
   168   -997   -997   -997 
   168   -997   -997   -997 

Motif 2 position-specific probability matrix

letter-probability matrix: alength= 4 w= 9 nsites= 10 E= 5.5e+003 
 1.000000  0.000000  0.000000  0.000000 
 0.900000  0.000000  0.100000  0.000000 
 0.900000  0.100000  0.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.500000  0.000000  0.500000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.200000  0.000000  0.800000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 





Time  7.18 secs.


P N
MOTIF 3     width = 10     sites = 2     llr = 30     E-value = 1.5e+005

SimplifiedA::::::::::
pos.-specificC:aa55::::a
probabilityGa::55a:aa:
matrixT::::::a:::
bits 2.4       
2.2       
1.9       
1.7        
Information 1.4          
content 1.2          
(21.3 bits)1.0          
0.7          
0.5          
0.2          
0.0
Multilevel GCCCCGTGGC
consensus GG
sequence
NAME STRAND START P-VALUE    SITES 
iYBR049C-2463.76e-07 TTGGGCCCATGCCGGGTGGCAAACCAAAGG
iYIL127C+1853.76e-07 TCTAGTACCCGCCCCGTGGCAAAAAAAGAA

Motif 3 block diagrams

NameLowest
p-value
   Motifs
iYBR049C 3.8e-07

-3
iYIL127C 3.8e-07

+3
SCALE
| | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375

Motif 3 in BLOCKS format

BL   MOTIF 3 width=10 seqs=2
iYBR049C                 (  246) GCCGGGTGGC  1 
iYIL127C                 (  185) GCCCCGTGGC  1 
//


Motif 3 position-specific scoring matrix

log-odds matrix: alength= 4 w= 10 n= 4408 bayes= 11.1053 E= 1.5e+005 
  -765   -765    240   -765 
  -765    240   -765   -765 
  -765    240   -765   -765 
  -765    140    140   -765 
  -765    140    140   -765 
  -765   -765    240   -765 
  -765   -765   -765    168 
  -765   -765    240   -765 
  -765   -765    240   -765 
  -765    240   -765   -765 

Motif 3 position-specific probability matrix

letter-probability matrix: alength= 4 w= 10 nsites= 2 E= 1.5e+005 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  0.500000  0.500000  0.000000 
 0.000000  0.500000  0.500000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  0.000000  1.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 





Time 10.59 secs.


P N
MOTIF 4     width = 10     sites = 2     llr = 30     E-value = 1.1e+005

SimplifiedA::::::a:a:
pos.-specificC::::aa:a::
probabilityGaaa::::::a
matrixT:::a::::::
bits 2.4       
2.2       
1.9       
1.7          
Information 1.4          
content 1.2          
(21.9 bits)1.0          
0.7          
0.5          
0.2          
0.0
Multilevel GGGTCCACAG
consensus
sequence
NAME STRAND START P-VALUE    SITES 
iYBR049C+1262.56e-07 ACTTTAGGTTGGGTCCACAGCAGGGTAATA
iYJR009C-1+2382.56e-07 TAAGGTGAGGGGGTCCACAGATATAACATC

Motif 4 block diagrams

NameLowest
p-value
   Motifs
iYBR049C 2.6e-07

+4
iYJR009C-1 2.6e-07

+4
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700

Motif 4 in BLOCKS format

BL   MOTIF 4 width=10 seqs=2
iYBR049C                 (  126) GGGTCCACAG  1 
iYJR009C-1               (  238) GGGTCCACAG  1 
//


Motif 4 position-specific scoring matrix

log-odds matrix: alength= 4 w= 10 n= 4408 bayes= 11.1053 E= 1.1e+005 
  -765   -765    240   -765 
  -765   -765    240   -765 
  -765   -765    240   -765 
  -765   -765   -765    168 
  -765    240   -765   -765 
  -765    240   -765   -765 
   168   -765   -765   -765 
  -765    240   -765   -765 
   168   -765   -765   -765 
  -765   -765    240   -765 

Motif 4 position-specific probability matrix

letter-probability matrix: alength= 4 w= 10 nsites= 2 E= 1.1e+005 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  0.000000  0.000000  1.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 





Time 14.09 secs.


P N
MOTIF 5     width = 9     sites = 2     llr = 27     E-value = 2.0e+005

SimplifiedA:a:5:::::
pos.-specificC:::::a:a:
probabilityGa:a5a:a:a
matrixT:::::::::
bits 2.4       
2.2       
1.9       
1.7        
Information 1.4        
content 1.2        
(19.6 bits)1.0         
0.7         
0.5         
0.2         
0.0
Multilevel GAGAGCGCG
consensus G
sequence
NAME STRAND START P-VALUE    SITES 
iYNL095C+3514.98e-07 AAATGATGATGAGGGCGCGGTCTAACTTT
iYGR203W-1181.32e-06 ACGGGTTATGGAGAGCGCGTATATAATGA

Motif 5 block diagrams

NameLowest
p-value
   Motifs
iYNL095C 5e-07

+5
iYGR203W 1.3e-06

-5
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700 725 750

Motif 5 in BLOCKS format

BL   MOTIF 5 width=9 seqs=2
iYNL095C                 (  351) GAGGGCGCG  1 
iYGR203W                 (  118) GAGAGCGCG  1 
//


Motif 5 position-specific scoring matrix

log-odds matrix: alength= 4 w= 9 n= 4418 bayes= 11.1085 E= 2.0e+005 
  -765   -765    240   -765 
   168   -765   -765   -765 
  -765   -765    240   -765 
    68   -765    140   -765 
  -765   -765    240   -765 
  -765    240   -765   -765 
  -765   -765    240   -765 
  -765    240   -765   -765 
  -765   -765    240   -765 

Motif 5 position-specific probability matrix

letter-probability matrix: alength= 4 w= 9 nsites= 2 E= 2.0e+005 
 0.000000  0.000000  1.000000  0.000000 
 1.000000  0.000000  0.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.500000  0.000000  0.500000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 
 0.000000  1.000000  0.000000  0.000000 
 0.000000  0.000000  1.000000  0.000000 





Time 17.57 secs.


P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001

NameCombined
p-value
   Motifs
iYJR009C-1 1.34e-05

+4
+1
-1
-2
iYFR029W 1.38e-03

+2
+1
iYLL010C 4.39e-03

+2
-1
-1
iYGR129W 4.10e-03

-2
+1
+2
iYKL192C 5.76e-03

-2
+1
-2
iYIL127C 2.79e-06

-1
+1
+3
+2
+2
iYJL210W 2.81e-03

-1
+2
iYNL095C 1.21e-04

-2
-1
+5
iYGR203W 1.99e-05

+1
-2
-5
iYBR049C 1.36e-07

+4
-3
+1
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700 725 750

Stopped because nmotifs = 5 reached.


CPU: ncc64010


EXPLANATION OF MEME RESULTS

The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information:


Go to top