FILTERS:
pvalue:0.1
Annotation-based Co-occurring Motif Pattern: crmMmus988 (988 (POU2F1__Pax-5))

This page describes one module in the cisRED database, and is divided into three sections.
  1. Module overview
    module size, module composition, and module discovery parameter settings.
  2. Module details
    atomic motifs in this module, and a description and Gene Ontology (GO) information about the search regions which contain this module.
  3. Definition of terms
    the types of objects contained in this database, and how these objects relate to each other.

Overview of module crmMmus988 (988 (POU2F1__Pax-5))

Your filter settings are not applied to the contents of these tables.
Parameters used for module discovery
Maximum allowable annotation p-value 0.0004
Maximum allowable discovery p-value 0.005
Window width 300 bp
Minimum distance between motif starts 10 bp
Module must be found in at least this many search regions 10

The number of times a group name appears under the heading 'Groups in this module' indicates the number of motifs from that group that must participate in each instance of the module.
Module summary statistics
Module ID (crmMmus) 988
Module size 2-way
Groups in this module crtMmus: 200135 (POU2F1)    200143 (Pax-5)   
Number of instances of this module 23
Number of search regions containing instances of this module 23
GO term FDR threshold 0.01
Total number of GO terms with FDR < threshold assigned to genes containing this module 0

Motifs in module crmMmus988 (988 (POU2F1__Pax-5))

Each row in this table represents one instance of the module. Each instance is a set of atomic motifs which all appear within the same search region. Where available, functional information about the gene associated with each a promoter-based search region is also provided.

Aside from viewing the module contents as presented here, you can:

Yellow is used in this table to highlight atomic motif craMmus69378, from which you requested this page.

Your filter settings are not applied to the contents of this table.
Motifs in group Motifs in group Gene / Region Gene / Region description
craMmus82443
craMmus82482
craMmus82521
craMmus82553
craMmus82236 ENSMUSG00000021453 growth arrest and DNA-damage-inducible 45 gamma [Source:MarkerSymbol;Acc:MGI:1346325]
craMmus106701 craMmus106187
craMmus106219
craMmus106244
craMmus106279
craMmus106310
craMmus106341
craMmus106372
craMmus106400
craMmus106427
craMmus106449
craMmus106476
craMmus106505
craMmus106527
craMmus106551
craMmus106583
craMmus106606
ENSMUSG00000025280 polymerase (RNA) III (DNA directed) polypeptide A [Source:MarkerSymbol;Acc:MGI:2681836]
craMmus143345
craMmus144029
craMmus144029
craMmus144059
ENSMUSG00000030507 developing brain homeobox 1 [Source:MarkerSymbol;Acc:MGI:94867]
craMmus60388
craMmus60444
craMmus59249 ENSMUSG00000036760
craMmus330152
craMmus330226
craMmus331513
craMmus331589
craMmus331634
craMmus331669
craMmus331708
craMmus331776
craMmus331806
craMmus331840
craMmus331870
craMmus331903
craMmus331944
craMmus331979
craMmus332018
craMmus332063
craMmus332093
craMmus332123
craMmus332160
craMmus332194
craMmus332235
craMmus332272
ENSMUSG00000042258 ISL1 transcription factor, LIM/homeodomain [Source:MarkerSymbol;Acc:MGI:101791]
craMmus32772 craMmus32749 ENSMUSG00000046272 olfactory receptor 1444 [Source:MarkerSymbol;Acc:MGI:3031278]
craMmus344486 craMmus344531
craMmus344551
craMmus344576
craMmus344608
craMmus344775
craMmus344799
craMmus344846
ENSMUSG00000048001 hairy and enhancer of split 5 (Drosophila) [Source:MarkerSymbol;Acc:MGI:104876]
craMmus345882
craMmus345922
craMmus346415
craMmus346445
craMmus346475
ENSMUSG00000048216 G protein-coupled receptor 85 [Source:MarkerSymbol;Acc:MGI:1927851]
craMmus253857 craMmus253892
craMmus253930
craMmus253969
craMmus254007
craMmus254041
craMmus254072
craMmus254097
craMmus254126
craMmus254159
craMmus254193
craMmus254250
craMmus254273
ENSMUSG00000049234 olfactory receptor 101 [Source:MarkerSymbol;Acc:MGI:2177484]
craMmus69346
craMmus69378
craMmus68867
craMmus68896
craMmus68929
craMmus68960
craMmus68993
ENSMUSG00000055639 dachshund 1 (Drosophila) [Source:MarkerSymbol;Acc:MGI:1277991]
craMmus400188 craMmus400414 ENSMUSG00000058621 olfactory receptor 643 [Source:MarkerSymbol;Acc:MGI:3030477]
craMmus422636
craMmus422671
craMmus422768
craMmus423388
craMmus423424
ENSMUSG00000062987 olfactory receptor 715 [Source:MarkerSymbol;Acc:MGI:3030549]
craMmus424724
craMmus425120
craMmus425199
craMmus425231
craMmus424260 ENSMUSG00000063452 olfactory receptor 800 [Source:MarkerSymbol;Acc:MGI:3030634]
craMmus426442 craMmus426343 ENSMUSG00000063760
craMmus425162 craMmus424735 ENSMUSG00000063804
craMmus425045 craMmus425263
craMmus425289
craMmus425311
craMmus425343
craMmus425364
craMmus425387
ENSMUSG00000063949 olfactory receptor 205 [Source:MarkerSymbol;Acc:MGI:3030039]
craMmus430807
craMmus430838
craMmus430872
craMmus430912
craMmus430953
craMmus430992
craMmus431070
craMmus431103
craMmus431138
craMmus431173
craMmus430418
craMmus430454
craMmus430486
craMmus430594
craMmus432525
craMmus432568
ENSMUSG00000066753 dynamin 3, opposite strand [Source:MarkerSymbol;Acc:MGI:3052332]
craMmus430585
craMmus430615
craMmus430700
craMmus430897
craMmus430924
ENSMUSG00000067028 RIKEN cDNA C230078M14Rik gene [Source:MarkerSymbol;Acc:MGI:3606070]
craMmus218659
craMmus218683
craMmus218703
craMmus218726
craMmus218748
craMmus218771
craMmus218853 ENSMUSG00000068819 olfactory receptor 1131 [Source:MarkerSymbol;Acc:MGI:3030965]
craMmus441518
craMmus441562
craMmus441609
craMmus441656
craMmus441703
craMmus440970
craMmus441011
craMmus441052
craMmus441099
craMmus441146
craMmus441192
craMmus441236
craMmus441273
ENSMUSG00000069662 myristoylated alanine rich protein kinase C substrate [Source:MarkerSymbol;Acc:MGI:96907]
craMmus440088 craMmus440000 ENSMUSG00000070377 olfactory receptor 43 [Source:MarkerSymbol;Acc:MGI:1333770]
craMmus171584 craMmus171493
craMmus171517
ENSMUSG00000070460 olfactory receptor 291 [Source:MarkerSymbol;Acc:MGI:3030125]
craMmus441404 craMmus440886
craMmus440931
craMmus440969
ENSMUSG00000070491

Definition of terms

Each cisRED database is based on a target genome, which is the genome of a target species, such as human, mouse, rat, or c.elegans.

The cisRED pipeline begins with the motif discovery stage, which operates on a large number of independent input sequence sets. An input sequence set consists of a sequence on the target genome, called the search region, and corresponding sequences from many other species. Search regions are generally 2-3k in size and are often located near a gene's transcription start site.

An atomic motif (or motif), is a set (or 'stack') of conserved sequences from some or all species of an input sequence set. A motif target sequence is the sequence that the target genome contributes to a motif. A motif can be represented by its sequence set/stack, or can be represented in a simplified form as a consensus sequence, a position frequency matrix, or a sequence logo.

A motif group (or group) is a set of similar motif target sequences. cisRED identifies groups using two different approaches: annotation and 'de novo'. Each motif group is labeled with a unique ID number. Currently, we consider annotation-based groups and modules to be more reliable than 'de novo' groups and modules. Work to improve genome-scale 'de novo' grouping is ongoing.

'De novo' groups of similar motif target sequences were identified by calculating edit distance dissimilarities between pairs of site sequences, for all significant site sequences, then hierarchically clustering the resulting similarity matrix using the OPTICS algorithm. An atomic motif is assigned to zero or one 'de novo' group.
Annotation-based groups of similar motif target sequences were identified by annotating against known site sequences (rather than against alignment matrices) in TRANSFAC, JASPAR, and ORegAnno. An atomic motif can belong to any number of annotation-based groups.

A module or pattern is a set of co-occuring motif group labels. A module is called either 'annotation-based' or 'de novo' depending on the type of groups it contains. (Currently, all groups in a module must be of the same type.) The size of a module is the number of co-occurring groups in that module. More precisely, a module may contain one or more references to a group label, and each group label reference counts towards the size of the group. A module of size N is called an N-way module.

A module instance (or pattern instance) is a list of motifs, all found in the same search region and in close proximity to each other, in which the motif target sequence of each motif in this list has been assigned to be in at least one of the module's group labels.

The module discovery process is based on a number of parameters, including the size of the 'window' (measured in bp), within which all the motifs in a module instance must be found, and the minimum number of search regions that must contain instances of the module.

Questions or comments: cisred@bcgsc.ca