FILTERS:
pvalue:0.1
Region for Gene ENSG00000143774 (HGNC Symbol: GUK1)
Go to Ensembl.org Go to UCSC

This page describes one promoter-based search region.

Page Contents

Overview
Genomic Context
Image of Motifs and Modules
Motifs Found in this Region
Search region overview
Search region location chr1:226,391,600-226,394,804 (+) (3205 bp)
Assembly Human, Ensembl v40 (NCBI 36)
Gene name GUK1
Ensembl gene ID ENSG00000143774
Gene description Guanylate kinase (EC 2.7.4.8) (GMP kinase). [Source:Uniprot/SWISSPROT;Acc:Q16774]
Gene description source HGNC Symbol
Modules
(Co-occurring motif patterns)
4 annotation-based module(s)
No 'de novo' modules exist in this database.

Genomic Context: Human genome assembly from Mar 2006. (UCSC hg18, NCBI 36, Ensembl v40)

The image shown below was created to provide context for this search region by showing a 100000 bp span of the genome, centered on this search region.

UCSC Genome Browser Image    
  • The first track shows the location of the search region in red.
  • The last track shows known gene locations in this 100000 bp span of the genome.

This promoter-based search region is located close to a particular gene of interest, described in the table below. Click on the gene id to open a page at ensembl.org with more information about this gene.

Target gene used to create this search region
Ensembl ID ENSG00000143774 (GUK1)
RefSeq ID NM_000858
Gene description Guanylate kinase (EC 2.7.4.8) (GMP kinase). [Source:Uniprot/SWISSPROT;Acc:Q16774]
Gene type protein_coding
Gene location chr1: 226,394,605-226,403,275 (+) (8,671 bp)
Distance from the region's midpoint to TSS -1401
The distance from a cisred search region to a gene, is measured from the center of the search region to either the transcription termination site (TTS) or the transcription start site (TSS), whichever is closer.
A distance is negative if the TTS/TTS is downstream of the search region's center, relative to the gene's strand.
A distance is positive if the TTS/TTS is upstream of the search region's center, relative to the gene's strand.

A transcript's location is always reported relative to the positive strand.

Image of Motifs and Modules

The image shown below illustrates the location of the motifs and modules within this search region.
However, the number of modules in a region may exceed the genome browser's track limit. Thus, for each type of module (pattern), the text in the yellow box below indicates whether or not any modules of that type exist, and if so, whether or not all the modules of that type are displayed. (If it indicates that only the top 20 modules are displayed, this ranking is based on the number of instances of each module.)

User-specified filter settings do not affect the features displayed in the UCSC genome browser view.

About the patterns shown in this image:
There are no 'de novo' patterns to show.
All instances of each (TRANSFAC) annotation-based pattern are shown.
There are no (JASPAR) annotation-based patterns to show.

UCSC Genome Browser Image
Click on the image to view this region at UCSC.
    This image uses the Human genome assembly from Mar 2006. (UCSC hg18, NCBI 36, Ensembl v40) and contains a number of custom tracks followed by several UCSC tracks. The custom tracks show:
  • The long red and gray bar at the top shows the nominal 'search region' within which comparative genomics discovery methods were applied. Motif predictions were not made in coding exons and most types of repeats. In this database, the locations of masks applied to the search region are shown in gray.
  • The numbered brown blocks are 'atomic' motifs, i.e. conserved DNA sequence motifs that were identified by discovery methods and post-processing operations. Motifs are shaded to indicate the discovery p-value; a darker motif was more significant at the discovery stage.
  • Following the motif discovery stage, motifs were filtered by membership in co-occurring patterns, and patterns were ranked by genome-scale properties. Motifs instances that occur in highly-ranked putative regulatory modules may be more reliable predictions of functional genomic elements.
  • The connected sets of blue boxes are co-occurring patterns, i.e. putative regulatory modules. Modules are shaded to indicate the number of times that a pattern is found in the target genome; a darker module is associated with more search regions. An pattern is called either 'de novo' or annotation-based, depending on the type of groups that appear in the pattern. (see next point)
  • Before patterns (i.e. modules) can be identified from atomic motifs, groups of similar motifs must be found; modules are then identified as co-occurring 'group labels' that satisfy certain criteria. Such groups can be identified a) by annotating atomic motifs with known binding site resources from TRANSFAC, JASPAR or ORegAnno; or b) by 'de novo' clustering. Currently we show both types of patterns, but we consider annotation-based groups and patterns to be more reliable than 'de novo' groups and patterns. Work to improve genome-scale 'de novo' grouping is ongoing.

Motifs Found in this Region

The contents of the table below can be modified by changing your filter settings.
Currently, this table only contains motifs which:
+ have a discovery p-value < 0.1
+ were found using any of these algorithms: MotifSampler, Consensus.oops, Consensus.omops, Consensus.zmops, MEME.tcm, MEME.oops, MEME.zoops

Showing 22 out of 22 atomic motifs.

Group(s)
crtHsap#(name) [p-value]
Motif
craHsap#
Discovery
p-value
Location Width (+)motif (-)motif
1 annotated group(s):
40073 (Evi-1) [3.60E-04]
33495 1.96E-02 chr1:226,391,726-226,391,737 12  Seq Logo
ATCyTTTCAAAT
Seq Logo
ATTTGAAArGAT
8 annotated group(s):
40114 (Lhx3a) [3.25E-04]
40155 (Pax-6) [4.64E-04]
50047 (HNF-3beta) [4.80E-04]
...
33500 2.48E-02 chr1:226,392,338-226,392,343 6  Seq Logo
AAATTA
Seq Logo
TAATTT
7 annotated group(s):
40032 (C-EBPdelta) [1.70E-04]
40114 (Lhx3a) [3.25E-04]
40155 (Pax-6) [4.64E-04]
...
33504 2.48E-02 chr1:226,392,341-226,392,346 6  Seq Logo
TTAAAT
Seq Logo
ATTTAA
5 annotated group(s):
40038 (Cdc5) [9.07E-05]
40164 (POU2F1) [1.90E-04]
40155 (Pax-6) [6.36E-04]
...
33512 2.48E-02 chr1:226,392,355-226,392,360 6  Seq Logo
TTAAAT
Seq Logo
ATTTAA
1 annotated group(s):
40016 (AR) [7.85E-04]
33520 2.48E-02 chr1:226,392,744-226,392,749 6  Seq Logo
TTTATT
Seq Logo
AATAAA
3 annotated group(s):
40153 (Pax-2) [5.03E-04]
40003 (AhR) [9.32E-04]
40160 (Pbx1a) [9.40E-04]
33527 2.48E-02 chr1:226,392,791-226,392,796 6  Seq Logo
AATAAA
Seq Logo
TTTATT
6 annotated group(s):
40083 (FOXP3) [4.06E-05]
40176 (RORalpha2) [1.56E-04]
50072 (RORalfa-2) [1.82E-04]
...
33534 2.48E-02 chr1:226,392,827-226,392,832 6  Seq Logo
TAATTT
Seq Logo
AAATTA
9 annotated group(s):
40039 (Cdx-2) [5.13E-05]
40095 (HNF-1alpha) [1.38E-04]
40095 (HNF-1alpha) [1.65E-04]
...
33542 2.48E-02 chr1:226,392,962-226,392,967 6  Seq Logo
TTTATT
Seq Logo
AATAAA
1 annotated group(s):
40039 (Cdx-2) [9.86E-05]
33551 2.48E-02 chr1:226,393,479-226,393,484 6  Seq Logo
AATAAA
Seq Logo
TTTATT
8 annotated group(s):
40007 (aMEF-2) [1.22E-04]
40122 (MEF-2A) [1.62E-04]
40143 (Nkx6-2) [3.16E-04]
...
33565 2.48E-02 chr1:226,393,505-226,393,510 6  Seq Logo
TAATTT
Seq Logo
AAATTA
1 annotated group(s):
40018 (ATF-1) [3.26E-04]
33573 4.36E-02 chr1:226,394,204-226,394,213 10  Seq Logo
CTGTGACGTA
Seq Logo
TACGTCACAG
6 annotated group(s):
40048 (CREB) [8.70E-05]
40021 (ATF6) [1.11E-04]
40001 (120-kDa_CRE-binding_protein) [1.82E-04]
...
33584 5.04E-03 chr1:226,394,239-226,394,251 13  Seq Logo
sGCkGTGACGTAG
Seq Logo
CTACGTCACmGCw
1 annotated group(s):
40181 (Smad1) [5.40E-04]
33593 3.18E-02 chr1:226,394,260-226,394,272 13  Seq Logo
sGCCGGGCCsGCG
Seq Logo
CGCwGGCCCGGCw
0 annotated groups 33614 9.95E-02 chr1:226,394,321-226,394,332 12  Seq Logo
kGCGGCGCCsGC
Seq Logo
GCwGGCGCCGCm
0 annotated groups 33607 3.72E-03 chr1:226,394,325-226,394,336 12  Seq Logo
kGGmCGGyGAGk
Seq Logo
mCTCrCCGkCCm
1 annotated group(s):
40189 (Spz1) [7.85E-04]
33626 7.59E-02 chr1:226,394,327-226,394,338 12  Seq Logo
GCCGGCGCGGCG
Seq Logo
CGCCGCGCCGGC
7 annotated group(s):
40055 (DP-1) [3.98E-04]
40055 (DP-1) [4.29E-04]
40090 (HES-1) [4.94E-04]
...
33638 5.33E-03 chr1:226,394,367-226,394,382 16  Seq Logo
kCmCrGCGyCGCGCCG
Seq Logo
CGGCGCGrCGCyGkGm
1 annotated group(s):
40139 (Nkx2-2) [9.11E-04]
33651 3.50E-02 chr1:226,394,486-226,394,497 12  Seq Logo
mAGTACTTCyCT
Seq Logo
AGrGAAGTACTk
1 annotated group(s):
40154 (Pax-5) [5.69E-04]
33676 5.28E-03 chr1:226,394,575-226,394,590 16  Seq Logo
kGmTGsTGCGGCGCys
Seq Logo
wrGCGCCGCAwCAkCm
3 annotated group(s):
40058 (E2F-1) [1.35E-04]
40048 (CREB) [4.45E-04]
40154 (Pax-5) [5.69E-04]
33661 6.63E-02 chr1:226,394,578-226,394,585 8  Seq Logo
CGGTGAGT
Seq Logo
ACTCACCG
2 annotated group(s):
40058 (E2F-1) [1.35E-04]
40154 (Pax-5) [5.69E-04]
33669 1.06E-02 chr1:226,394,583-226,394,590 8  Seq Logo
CrGCGCCG
Seq Logo
CGGCGCyG
1 annotated group(s):
40003 (AhR) [5.21E-04]
33681 7.23E-02 chr1:226,394,585-226,394,597 13  Seq Logo
CCGCTGTGACGTA
Seq Logo
TACGTCACAGCGG

Questions or comments: cisred@bcgsc.ca