Analysis of Methylated DNA

Author

Piero Palacios Bernuy

Published

March 7, 2024

Abstract

This document is part of a series of the analysis of Omics data. Especifycally, here is showed how to analyze Illumina 450k array and Methyl-Seq data with Bioconductor packages. Also, it’s showcased how to make plots of the methylation data in the context of genomic positions and genes strutctures.

Keywords

450k array, methylation, CpG, CpGs islands

1 Methylation Analysis

1.1 Methylation

Epigenetics is the study of how your behaviors and environment can cause changes that affect the way your genes work. Unlike genetic changes, epigenetic changes are reversible and do not change your DNA sequence, but they can change how your body reads a DNA sequence (CDC,2024).

Talking about the genes, these can only be expressed when the chromatin is in it’s relaxed form (euchromatin) and reversely, these cannot be expressed in it’s compacted form (hetero-chromatin). Methylation of cytosine’s control part of this process of the chromatin and it’s really important to understand is a gene (or the surroundings) is methylated.

DNA Methylation

DNA Methylation

1.2 CpG’s

The cytosine’s (C) that are methylated are followed by a guanine (G). This pattern is called CpG in the context of epigenetics. These CpG’s tend to cluster into groups (CpG islands) and, these Cpg islands tend to be close to the promoter region of the genes.

1.3 Bisulfite treatment

Bisulfite Treatment

Bisulfite Treatment

If a C is methylated the bisulfite treatment won´t change the cytosine but, if the C is not methylated, the C will suffer a transition to a T.

2 Methyl-Seq

Analyzing bisulfite sequencing data requires a server or an HPC. U can use the methylseq pipeline from nfcore to get the .cov files to proceed with the downstream analysis on R.

To see how to analyze Whole-genome bisulfite sequencing (WGBS) with the (Hansen, Langmead, and Irizarry 2012), click on the script below the image.

2.0.1 WGBS

Methylation DMR

Methylation DMR

3 Illumina 450k array

3.1 Methylation 450k array

To see how to analyze from a Illumina 450k array with the (Fortin, Triche, and Hansen 2016) package, click to the script below the image.

3.2 Methylation 450k array - Multi-resolution Analysis

To see how to analyze at a multi-resolution level from a Illumina 450k array with the (Fortin, Triche, and Hansen 2016) package, click to the script below the table.

Methylation DMR

Methylation DMR
Code
mcols(granges(dat$object))$type |> 
    table() |> 
    prop.table() |> 
    as.data.frame() |> 
    dplyr::rename("Island Status" = Var1) |> 
    gt::gt()
Source: DNA methylation measurement

Citation

BibTeX citation:
@online{palacios bernuy2024,
  author = {Palacios Bernuy, Piero},
  title = {Analysis of {Methylated} {DNA}},
  date = {2024-03-07},
  langid = {en},
  abstract = {This document is part of a series of the analysis of Omics
    data. Especifycally, here is showed how to analyze Illumina 450k
    array and Methyl-Seq data with Bioconductor packages. Also, it’s
    showcased how to make plots of the methylation data in the context
    of genomic positions and genes strutctures.}
}
For attribution, please cite this work as:
Palacios Bernuy, Piero. 2024. “Analysis of Methylated DNA.” An Open Source Portfolio. March 7, 2024.

References

Fortin, Jean-Philippe, Jr Triche Timothy J., and Kasper D Hansen. 2016. Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi.” Bioinformatics 33 (4): 558–60. https://doi.org/10.1093/bioinformatics/btw691.
Hansen, Kasper D., Benjamin Langmead, and Rafael A. Irizarry. 2012. BSmooth: From Whole Genome Bisulfite Sequencing Reads to Differentially Methylated Regions 13: R83. https://doi.org/10.1186/gb-2012-13-10-r83.