CpG island

CpG island

In genetics, CpG islands or CG islands are genomic regions that contain a high frequency of CpG sites but to date objective definitions for CpG islands are limited. In mammalian genomes, CpG islands are typically 300-3,000 base pairs in length. They are in and near approximately 40% of promoters of mammalian genes.[1] About 70% of human promoters have a high CpG content. Given the GC frequency however, the number of CpG dinucleotides is much lower than expected.[2] The "p" in CpG refers to the phosphodiester bond between the cytosine and the guanine, which indicates that the C and the G are next to each other in sequence regardless of being single- or double- stranded. More explicitly, both C and G would be on the same strand of DNA/RNA covalently bonded (chemically connected) by a phosphodiester bond (a strong bond). This differs from the easily confused base-pairing of C and G which share three hydrogen bonds (weaker bond) across two separate strands of DNA (also known as complementary base pairing).

The usual formal definition of a CpG island is a region with at least 200 bp and with a GC percentage that is greater than 50% and with an observed/expected CpG ratio that is greater than 60%, where the value of expected CpG is calculated by formula (GC content/2)2.[3] Another recent study revised the rules of CpG island prediction in order to exclude other GC-rich genomic sequences such as Alu repeats. Based on an extensive search on the complete sequences of human chromosomes 21 and 22, DNA regions greater than 500 bp with a GC content greater than 55% and observed CpG/expected CpG of 0.65 were more likely to be the true CpG islands associated with the 5' regions of genes.[4]

CpG islands are characterized by CpG dinucleotide content of at least 60% of that which would be statistically expected (~4–6%), whereas the rest of the genome has much lower CpG frequency (~1%), a phenomenon called CG suppression. Unlike CpG sites in the coding region of a gene, in most instances, the CpG sites in the CpG islands of promoters are unmethylated if genes are expressed. This observation led to the speculation that methylation of CpG sites in the promoter of a gene may inhibit the expression of a gene. Methylation is central to imprinting alongside histone modifications.[5]

CpG islands typically occur at or near the transcription start site of genes, particularly housekeeping genes, in vertebrates.[2] Normally a C (cytosine) base followed immediately by a G (guanine) base (a CpG) is rare in vertebrate DNA because the cytosines in such an arrangement tend to be methylated. This methylation helps distinguish the newly synthesized DNA strand from the parent strand, which aids in the final stages of DNA proofreading after duplication. However, over evolutionary time methylated cytosines tend to turn into thymines because of spontaneous deamination. While there is a special enzyme in human (Thymine-DNA glycosylase, or TDG) that specifically replaces T's from T/G mismatches, it is not sufficiently effective to prevent the relatively rapid mutation of the dinucleotides. The result is that CpGs are relatively rare unless there is selective pressure to keep them or a region is not methylated for some reason, perhaps having to do with the regulation of gene expression.

See also

References

  1. ^ Fatemi M, Pao MM, Jeong S, Gal-Yam EN, Egger G, Weisenberger DJ, Jones PA (2005). "Footprinting of mammalian promoters: use of a CpG DNA methyltransferase revealing nucleosome positions at a single molecule level". Nucleic Acids Res 33 (20): e176. doi:10.1093/nar/gni180. PMC 1292996. PMID 16314307. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1292996. 
  2. ^ a b Saxonov S, Berg P, Brutlag DL (2006). "A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters". Proc Natl Acad Sci USA 103 (5): 1412–1417. doi:10.1073/pnas.0510310103. PMC 1345710. PMID 16432200. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1345710. 
  3. ^ Gardiner-Garden M, Frommer M (1987). "CpG islands in vertebrate genomes". Journal of molecular biology 196 (2): 261–82. doi:10.1016/0022-2836(87)90689-9. PMID 3656447. 
  4. ^ Takai D, Jones PA (2002). "Comprehensive analysis of CpG islands in human chromosomes 21 and 22.". Proc Natl Acad Sci USA 99 (6): 3740–5. doi:10.1073/pnas.052410099. PMC 122594. PMID 11891299. http://www.pnas.org/cgi/content/full/99/6/3740. 
  5. ^ Feil R, Berger F (2007). "Convergent evolution of genomic imprinting in plants and mammals". Trends Genet 23 (4): 192–199. doi:10.1016/j.tig.2007.02.004. PMID 17316885. 

External links



Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • CpG island — (= CG island) Region of genomic DNA rich in the dinucleotide C G. Methylation of the C in the dinucleotide is maintained through cell divisions, and profoundly affects the degree of transcription of the nearby genes, and is important in… …   Dictionary of molecular biology

  • CpG — can be: CpG site methylated sequences of DNA significant in gene regulation CpG Oligodeoxynucleotide unmethylated sequences of DNA that have immunostimulatory properties CpG island regions of DNA that contain several CpG sites This disambiguation …   Wikipedia

  • CpG site — Cytosine Phospha …   Wikipedia

  • CG island — (= CpG island) Region of genomic DNA rich in CG dinucleotides. Frequently found near genes that are transcribed at high level …   Dictionary of molecular biology

  • New Brighton, Staten Island — View of Brighton, Staten Island, New York, an 1857 engraving New Brighton, formerly an independent village, is today a neighborhood located on the North Shore of Staten Island in New York City, USA. The neighborhood comprises an older industrial… …   Wikipedia

  • Gola Island — Infobox Irish Place name = Gola Island gaeilge = Gabhla map pin coords = left: 37px; top: 42px north coord = 55.050941 | west coord = 8.235626 | irish grid = B847228 area = | elevation = province = Ulster county = County Donegal rural pop = 0… …   Wikipedia

  • DNA methylation — Illustration of a DNA molecule that is methylated at the two center cytosines. DNA methylation plays an important role for epigenetic gene regulation in development and disease. DNA methylation is a biochemical process that is important for… …   Wikipedia

  • Methylation — In the chemical sciences, methylation denotes the addition of a methyl group to a substrate or the substitution of an atom or group by a methyl group. Methylation is a form of alkylation with, to be specific, a methyl group, rather than a larger… …   Wikipedia

  • MECP2 — Methyl CpG binding protein 2 (Rett syndrome) PDB rendering based on 1qk9 …   Wikipedia

  • Methylated DNA immunoprecipitation — (MeDIP or mDIP) is a large scale (chromosome or genome wide) technique that is used to enrich for methylated DNA sequences. It consists of isolating methylated DNA fragments via an antibody raised against 5 methylcytosine (5mC). This technique… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”