Gene Set: GGCNKCCATNK_UNKNOWN

Standard name GGCNKCCATNK_UNKNOWN
Systematic name M11715
Brief description Genes having at least one occurence of the highly conserved motif M88 GGCNKCCATNK in the region spanning up to 4 kb around their transcription start sites. The motif does not match any known transcription factor binding site (v7.4 TRANSFAC).
Full description or abstract Comprehensive identification of all functional elements encoded in the human genome is a fundamental need in biomedical research. Here, we present a comparative analysis of the human, mouse, rat and dog genomes to create a systematic catalogue of common regulatory motifs in promoters and 3' untranslated regions (3' UTRs). The promoter analysis yields 174 candidate motifs, including most previously known transcription-factor binding sites and 105 new motifs. The 3'-UTR analysis yields 106 motifs likely to be involved in post-transcriptional regulation. Nearly one-half are associated with microRNAs (miRNAs), leading to the discovery of many new miRNA genes and their likely target genes. Our results suggest that previous estimates of the number of human miRNA genes were low, and that miRNAs regulate at least 20% of human genes. The overall results provide a systematic view of gene regulation in the human, which will be refined as additional mammalian genomes become available.
Collection C3: regulatory target gene sets
      TFT: All transcription factor targets
            TFT:TFT_Legacy: Legacy transcription factor targets
Source publication Pubmed 15735639   Authors: Xie X,Lu J,Kulbokas EJ,Golub TR,Mootha V,Lindblad-Toh K,Lander ES,Kellis M
Exact source  
Related gene sets (show 169 additional gene sets from the source publication)

(show 158 gene sets from the same authors)
External links  
Organism Homo sapiens
Contributed by Xiaohui Xie (Broad Institute)
Source platform HUMAN_GENE_SYMBOL
Dataset references  
Download gene set format: grp | text | gmt | gmx | xml
Compute overlaps (show collections to investigate for overlap with this gene set)
Compendia expression profiles GTEx compendium
Human tissue compendium (Novartis)
Global Cancer Map (Broad Institute)
NCI-60 cell lines (National Cancer Institute)
Advanced query Further investigate these 121 genes
Gene families Categorize these 121 genes by gene family
Show members (show 122 members mapped to 121 genes)
Version history 7.1: Moved to TFT_Legacy sub-collection.

See MSigDB license terms here. Please note that certain gene sets have special access terms.