FILTERS:
pvalue:0.1
Search by Symbol

Search for any of the following:
      - Ensembl gene IDs (e.g. ENSMUSG00000000001)
      - gene symbols (e.g. NP_001013818.1, Uty)
      - cisRED motif IDs (e.g. craMmus1)
      - cisRED group IDs (e.g. crtMmus4000)

Search by Sequence

Find motif sequences on either strand that contain an IUPAC query sequence, e.g. TAARNGCMT.

Search Groups & Modules
Find associated with at least one of these JASPAR, TRANSFAC or ORegAnno models:
Browse All Regions

View a list of all the search regions in this database, and the number of motifs found in each region.

Browse by Location

View a list of all the search regions in this database found in the specified genomic region.

chr: from to
Browse Groups & Modules

Browse summary information about motif groups or modules in this database.

Show

About the Mouse 4 Database

Usage note: Filters and cookies
cisRED manages your 'Filter' settings via a browser 'cookie'. You must allow your web browser to accept cookies from cisred.org for your filter settings to take effect.

The cisRED mouse v4 database holds ~223K conserved DNA sequence motifs (empirical p-value < 0.1) in promoter regions of ~17.5K target genes (Ensembl v47, NCBI m37, mm9).

The v4 motifs were generated by transforming the coordinates of the motifs in the mouse v3.1 database from the NCBI m35 genome build to the m37 genome build, using UCSC's web lift-over tool. In the conversion, 86 motifs (< 0.04% of the ~223k v3.1 motifs) were lost. At this point, we have not rerun the pattern discovery algorithm, so the v4 database contains no regulatory module predictions.

The v3.1 motifs were computationally discovered in comparative genomics sequence sets consisting of regions around the transcription start site (TSS) of a single, canonical transcript for each gene, and corresponding regions from other species. Input sequence sets were generated from genome sequence data for 38 vertebrate species (34 of which were mammalian), using data from Ensembl, ENCODE and low coverage read files. A typical input sequence set contained genome sequence data 16 vertebrate species. Search regions for motif discovery were -1.5Kb/+200b relative to a TSS, net of most types of repeats and of coding sequences, which were masked.

An overview of the pipeline and database is available in an NAR 2006 publication.

Access to this database is not available at db.cisred.org at this time.

Direct SQL queries can be run against the database cisred_Mmus_4 at db.cisred.org, with the username 'anonymous', and no password.

Questions or comments: cisred@bcgsc.ca