FILTERS:
pvalue:1.0
Region for Stat1 Peak: Stat1_PK_23
Go to UCSC

This page describes one search region that corresponds to a peak generated by chromatin immunoprecipication and massively parallel Solexa sequencing.

Page Contents

Overview
Genomic Context
Image of Motifs and Modules
Motifs Found in this Region
Search region overview
Search region location chr5:131,860,350-131,860,950 (+) (601 bp)
Assembly Human, Ensembl v35 (NCBI 35)
Region ID Stat1_PK_23
Modules
(Co-occurring motif patterns)
451 annotation-based module(s)
No 'de novo' modules exist in this database.

Genomic Context: Human genome assembly from Aug 2004. (UCSC hg17, NCBI 35, Ensembl v35)

The image shown below was created to provide context for this search region by showing a 100000 bp span of the genome, centered on this search region.

UCSC Genome Browser Image    
  • The first track shows the location of the search region in red.
  • The next two (or four) green tracks show the extent of the ChIP-seq data reported in this image, and the height of the ChIP-seq peaks located within that extent.
  • The last track shows known gene locations in this 100000 bp span of the genome.

This is a ChIP-seq-based search region and thus gene locations were not a factor in selecting the location of this region. However, the identity of any genes that overlap with, or are closest to, this search region are still of interest, and are described in the table below.

Gene near this search region
Ensembl ID ENSG00000113522 (RAD50) (NM_005732) (NM_133482)
transcripts:
1) ENST00000265335
Gene description DNA repair protein RAD50 (EC 3.6.-.-) (hRAD50). [Source:Uniprot/SWISSPROT;Acc:Q92878]
Gene type protein_coding
Transcript location(s) 1) chr5: 131,920,529-132,007,492 (+) (86,964 bp)
Distance from the region's midpoint to each transcript's TSS and TTS: 1) to TSS: -59,877, to TTS: -146,840
Gene near this search region
Ensembl ID ENSG00000184204
transcripts:
1) ENST00000331619
Gene type protein_coding
Transcript location(s) 1) chr5: 136,850-138,430 (-) (1,581 bp)
Distance from the region's midpoint to each transcript's TSS and TTS: 1) to TSS: -131,722,220, to TTS: -131,723,800
The distance from a cisred search region to a gene, is measured from the center of the search region to either the transcription termination site (TTS) or the transcription start site (TSS), whichever is closer.
A distance is negative if the TTS/TTS is downstream of the search region's center, relative to the gene's strand.
A distance is positive if the TTS/TTS is upstream of the search region's center, relative to the gene's strand.

A transcript's location is always reported relative to the positive strand.

Image of Motifs and Modules

The image shown below illustrates the location of the motifs and modules within this search region.
However, the number of modules in a region may exceed the genome browser's track limit. Thus, for each type of module (pattern), the text in the yellow box below indicates whether or not any modules of that type exist, and if so, whether or not all the modules of that type are displayed. (If it indicates that only the top 20 modules are displayed, this ranking is based on the number of instances of each module.)

User-specified filter settings do not affect the features displayed in the UCSC genome browser view.

About the patterns shown in this image:
Note: This search region contains more patterns than are displayed in this UCSC genome browser image.
There are no 'de novo' patterns to show.
Only the top 20 ranked (TRANSFAC) annotation-based patterns are shown.
There are no (JASPAR) annotation-based patterns to show.

UCSC Genome Browser Image
Click on the image to view this region at UCSC.
    This image uses the Human genome assembly from Aug 2004. (UCSC hg17, NCBI 35, Ensembl v35) and contains a number of custom tracks followed by several UCSC tracks. The custom tracks show:
  • The long red and gray bar at the top shows the nominal 'search region' within which comparative genomics discovery methods were applied. Motif predictions were not made in coding exons and most types of repeats. In this database, the locations of masks applied to the search region are shown in gray.
  • The numbered brown blocks are 'atomic' motifs, i.e. conserved DNA sequence motifs that were identified by discovery methods and post-processing operations. Motifs are shaded to indicate the discovery p-value; a darker motif was more significant at the discovery stage.
  • Following the motif discovery stage, motifs were filtered by membership in co-occurring patterns, and patterns were ranked by genome-scale properties. Motifs instances that occur in highly-ranked putative regulatory modules may be more reliable predictions of functional genomic elements.
  • The connected sets of blue boxes are co-occurring patterns, i.e. putative regulatory modules. Modules are shaded to indicate the number of times that a pattern is found in the target genome; a darker module is associated with more search regions. An pattern is called either 'de novo' or annotation-based, depending on the type of groups that appear in the pattern. (see next point)
  • Before patterns (i.e. modules) can be identified from atomic motifs, groups of similar motifs must be found; modules are then identified as co-occurring 'group labels' that satisfy certain criteria. Such groups can be identified a) by annotating atomic motifs with known binding site resources from TRANSFAC, JASPAR or ORegAnno; or b) by 'de novo' clustering. Currently we show both types of patterns, but we consider annotation-based groups and patterns to be more reliable than 'de novo' groups and patterns. Work to improve genome-scale 'de novo' grouping is ongoing.

Motifs Found in this Region

The contents of the table below can be modified by changing your filter settings.
Currently, this table only contains motifs which:
+ have a discovery p-value < 1.0
+ were found using any of these algorithms: MotifSampler, Consensus.oops, Consensus.omops, Consensus.zmops, MEME.tcm, MEME.oops, MEME.zoops

Showing 36 out of 36 atomic motifs.

Group(s)
crtHsap#(name) [p-value]
Motif
craHsap#
Discovery
p-value
Location Width (+)motif (-)motif
0 annotated groups 3561 2.14E-01 chr5:131,860,360-131,860,372 13  Seq Logo
TTGCAGCCTGAGA
Seq Logo
TCTCAGGCTGCAA
2 annotated group(s):
201028 (NRSF) [2.19E-04]
201017 (Pbx1) [3.92E-04]
3562 1.05E-02 chr5:131,860,427-131,860,432 6  Seq Logo
GGACGG
Seq Logo
CCGTCC
0 annotated groups 3563 5.76E-01 chr5:131,860,443-131,860,454 12  Seq Logo
sCkGrGsTCnGC
Seq Logo
GCnGAwCyCmGw
13 annotated group(s):
200277 (Lmo2) [2.67E-05]
200005 (AP-4) [6.18E-05]
200698 (HEB) [7.93E-05]
...
3564 4.53E-01 chr5:131,860,493-131,860,504 12  Seq Logo
ynGCAGCTGGnG
Seq Logo
CnCCAGCTGCnr
4 annotated group(s):
200649 (MAZ) [6.43E-04]
200008 (Sp1) [6.47E-04]
200720 (CAC-binding_protein) [7.15E-04]
...
3566 5.31E-01 chr5:131,860,503-131,860,516 14  Seq Logo
CGGkGGGGsGGGGC
Seq Logo
GCCCCwCCCCmCCG
6 annotated group(s):
200444 (VDR) [2.67E-04]
200649 (MAZ) [6.43E-04]
200008 (Sp1) [6.47E-04]
...
3565 4.10E-01 chr5:131,860,507-131,860,514 8  Seq Logo
GGGGCrGG
Seq Logo
CCyGCCCC
3 annotated group(s):
200054 (NF-kappaB) [1.80E-04]
200733 (Smad4) [7.81E-04]
10071 (RORalfa-1) [8.06E-04]
3567 1.43E-01 chr5:131,860,557-131,860,568 12  Seq Logo
GCkGrGsTCnGC
Seq Logo
GCnGAwCyCmGC
4 annotated group(s):
201001 (DEAF-1) [6.76E-05]
10051 (Irf-2) [2.06E-04]
200453 (IRF-7A) [3.53E-04]
...
3568 4.41E-01 chr5:131,860,612-131,860,619 8  Seq Logo
TCGGTTTC
Seq Logo
GAAACCGA
9 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
201001 (DEAF-1) [6.76E-05]
...
3569 9.28E-03 chr5:131,860,613-131,860,620 8  Seq Logo
sGGTTTCs
Seq Logo
wGAAACCw
11 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
200062 (IRF-1) [6.25E-05]
...
3570 9.15E-03 chr5:131,860,614-131,860,621 8  Seq Logo
GGTTyCsG
Seq Logo
CwGrAACC
12 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
200062 (IRF-1) [6.25E-05]
...
3571 3.29E-01 chr5:131,860,616-131,860,623 8  Seq Logo
TTTCGGTT
Seq Logo
AACCGAAA
7 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
201001 (DEAF-1) [6.76E-05]
...
3572 3.29E-01 chr5:131,860,616-131,860,627 12  Seq Logo
TTTCGGTTTCTT
Seq Logo
AAGAAACCGAAA
10 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
200062 (IRF-1) [6.25E-05]
...
3573 1.04E-02 chr5:131,860,619-131,860,626 8  Seq Logo
CGGTTTCT
Seq Logo
AGAAACCG
7 annotated group(s):
201050 (ISRE_3) [3.90E-05]
201050 (ISRE_3) [3.90E-05]
201001 (DEAF-1) [6.76E-05]
...
3574 3.29E-01 chr5:131,860,620-131,860,627 8  Seq Logo
GGTTTCTT
Seq Logo
AAGAAACC
2 annotated group(s):
201001 (DEAF-1) [6.76E-05]
200453 (IRF-7A) [3.53E-04]
3575 3.29E-01 chr5:131,860,621-131,860,628 8  Seq Logo
GTTTCTTG
Seq Logo
CAAGAAAC
1 annotated group(s):
201001 (DEAF-1) [6.76E-05]
3576 1.00E-08 chr5:131,860,621-131,860,633 13  Seq Logo
GTTTCTTGCAAAC
Seq Logo
GTTTGCAAGAAAC
1 annotated group(s):
200964 (CAR) [9.32E-04]
3577 2.05E-02 chr5:131,860,634-131,860,641 8  Seq Logo
GCsGCGGC
Seq Logo
GCCGCwGC
1 annotated group(s):
200135 (POU2F1) [8.44E-04]
3579 1.00E-08 chr5:131,860,644-131,860,660 17  Seq Logo
CAGGGAAATCGAGGAGT
Seq Logo
ACTCCTCGATTTCCCTG
3 annotated group(s):
201001 (DEAF-1) [3.19E-04]
201028 (NRSF) [8.31E-04]
200135 (POU2F1) [8.44E-04]
3578 1.00E-08 chr5:131,860,647-131,860,654 8  Seq Logo
GGAAATCG
Seq Logo
CGATTTCC
18 annotated group(s):
200007 (Elk-1) [7.16E-05]
10028 (Elk-1) [1.33E-04]
10107 (p65) [1.95E-04]
...
3580 1.00E-08 chr5:131,860,654-131,860,674 21  Seq Logo
GAGGAGTTTCCGGGAAC...
Seq Logo
nnCGGTTCCCGGAAACT...
3 annotated group(s):
200480 (LUN-1) [9.51E-05]
200235 (AhR) [7.19E-04]
200235 (AhR) [8.20E-04]
3581 5.71E-01 chr5:131,860,673-131,860,685 13  Seq Logo
AACCrnsCCGGGA
Seq Logo
TCCCGGwnyGGTT
3 annotated group(s):
200113 (CREB) [3.29E-04]
200513 (ATF3) [5.36E-04]
200792 (Smad1) [7.86E-04]
3582 1.00E-08 chr5:131,860,688-131,860,699 12  Seq Logo
GCTGAGsTCTGC
Seq Logo
GCAGAwCTCAGC
4 annotated group(s):
200056 (NF-1) [7.67E-05]
200651 (NF-muE1) [3.71E-04]
200638 (HNF-4) [6.63E-04]
...
3583 1.00E-08 chr5:131,860,742-131,860,754 13  Seq Logo
CGGCCAGTGGCCm
Seq Logo
kGGCCACTGGCCG
8 annotated group(s):
200007 (Elk-1) [2.41E-05]
200056 (NF-1) [7.67E-05]
200993 (Tal-1) [2.39E-04]
...
3584 6.98E-03 chr5:131,860,752-131,860,759 8  Seq Logo
CCAGCTGG
Seq Logo
CCAGCTGG
11 annotated group(s):
200808 (Pax-1) [8.88E-05]
200644 (LBP-1) [3.16E-04]
200005 (AP-4) [3.71E-04]
...
3589 8.73E-03 chr5:131,860,756-131,860,782 27  Seq Logo
CTGGAAGTTCCAGCGGG...
Seq Logo
TTCCCCGGCTCCCGCTG...
10 annotated group(s):
200007 (Elk-1) [2.41E-05]
200056 (NF-1) [7.67E-05]
200808 (Pax-1) [8.88E-05]
...
3585 1.00E-08 chr5:131,860,759-131,860,764 6  Seq Logo
GAAGTT
Seq Logo
AACTTC
2 annotated group(s):
200005 (AP-4) [3.71E-04]
200261 (Olf-1) [5.03E-04]
3586 8.32E-02 chr5:131,860,766-131,860,773 8  Seq Logo
CAGCGGGA
Seq Logo
TCCCGCTG
0 annotated groups 3587 1.47E-01 chr5:131,860,769-131,860,780 12  Seq Logo
CGGGAGCyGGGG
Seq Logo
CCCCrGCTCCCG
1 annotated group(s):
200261 (Olf-1) [5.03E-04]
3588 8.42E-01 chr5:131,860,770-131,860,779 10  Seq Logo
GGGrGCCGGG
Seq Logo
CCCGGCyCCC
12 annotated group(s):
10101 (c-REL) [2.04E-04]
200053 (c-Rel) [2.13E-04]
200054 (NF-kappaB) [2.78E-04]
...
3593 1.00E-08 chr5:131,860,776-131,860,809 34  Seq Logo
yGGGGAAAACCGGCCCC...
Seq Logo
TCAGGTGGGGCTTTTCC...
0 annotated groups 3590 1.04E-02 chr5:131,860,785-131,860,790 6  Seq Logo
CCGGCC
Seq Logo
GGCCGG
3 annotated group(s):
201001 (DEAF-1) [1.05E-04]
201001 (DEAF-1) [1.42E-04]
200412 (ZEB_(1124_AA)) [3.04E-04]
3591 1.04E-02 chr5:131,860,799-131,860,806 8  Seq Logo
GCCCCACC
Seq Logo
GGTGGGGC
9 annotated group(s):
200277 (Lmo2) [6.83E-05]
201001 (DEAF-1) [1.05E-04]
200412 (ZEB_(1124_AA)) [1.34E-04]
...
3592 1.04E-02 chr5:131,860,800-131,860,807 8  Seq Logo
CCCCACCT
Seq Logo
AGGTGGGG
9 annotated group(s):
200693 (E12) [4.59E-05]
200693 (E12) [7.99E-05]
200412 (ZEB_(1124_AA)) [1.16E-04]
...
3594 1.14E-01 chr5:131,860,808-131,860,823 16  Seq Logo
GAATGCACCTGCCCms
Seq Logo
wkGGGCAGGTGCATTC
2 annotated group(s):
200720 (CAC-binding_protein) [2.65E-04]
10079 (SP1) [4.96E-04]
3595 8.22E-02 chr5:131,860,848-131,860,861 14  Seq Logo
kmAGGTGGGGCTTT
Seq Logo
AAAGCCCCACCTkm
3 annotated group(s):
200057 (COMP1) [2.79E-04]
200800 (AP-2) [2.90E-04]
200412 (ZEB_(1124_AA)) [5.18E-04]
3596 5.15E-03 chr5:131,860,871-131,860,877 7  Seq Logo
ACCTGCC
Seq Logo
GGCAGGT

Questions or comments: cisred@bcgsc.ca