Thesis
Identifying the DNA binding specificity of mixed lineage leukaemia in leukaemia
- Abstract:
-
Chromosomal translocations of the KMT2A gene generate MLL fusion proteins that drive aberrant gene expression and define an aggressive, poor-prognosis subset of acute leukaemia. Although MLL binds unmethylated CpG-island promoters through its CXXC domain, only a subset of these potential sites is occupied in the fusion context. This thesis investigates the determinants of that selectivity across three regulatory layers: DNA sequence, co-factors and chromatin environment, and DNA methylation, including 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC).
Attribution analyses from DNA-sequence models revealed a shared CpG baseline across cell types, characterised by strong positive correlations with CG-rich motifs such as VEZF1, KLFs, and MAZ. In the non-fusion MLL context, this CpG signal was refined by negatively attributed motifs, including MYC and KLF6, restricting binding to a selective subset of CpG islands. In MLL-fusion cells, CpG density contributed less strongly to prediction, and negatively attributed lineage-specific motifs such as RUNX and SP-family transcription factors highlighted cell-type specificity, suggesting that transcription-factor context along with CpG content, determines occupancy.
Models based on co-binding proteins and histone marks revealed two regimes: MLL co-localised with Menin, LEDGF, and initiation machinery at promoters, whereas MLL-AF4 was enriched along active transcription units with elongation-associated complexes, including PAF1, the super-elongation complex, BRD proteins, RNA polymerase II, and DOT1L with H3K79me2. These co-binding patterns indicate that chromatin context directs selectivity within CpG-rich regions.
Integrating DNA methylation with hydroxymethylation showed that unmethylated CpG provides a baseline requirement for binding, but differential cytosine modification states sharpen site selection. Across cell types, MLL-AF4 cells displayed elevated promoter 5mC and globally reduced 5hmC. Inclusion of 5hmC features improved prediction of MLL occupancy. Local 5hmC at the +2 nucleosome emerged as a salient predictor in a non-fusion context. In MLL-AF4 correlations with TET2 binding suggest that TET2-driven demethylation facilitates recruitment at specific elongation-linked sites.
Together, these results support a model in which CpG recognition defines baseline potential, while transcription-factor context and chromatin environment tune site selection. TET2-mediated modulation of methylation state adds a further regulatory layer that determines which CpG-rich regions are ultimately bound.
Actions
Access Document
- Files:
-
-
(Preview, Dissemination version, pdf, 13.4MB, Terms of use)
-
Authors
Contributors
+ Milne, T
- Institution:
- University of Oxford
- Division:
- MSD
- Department:
- NDORMS
- Role:
- Supervisor
- ORCID:
- 0000-0002-0413-4271
+ Smith, A
- Institution:
- University of Oxford
- Division:
- MSD
- Department:
- Radcliffe Department of Medicine
- Sub department:
- RDM-Nuffield Division of Clinical Laboratory Sciences
- Role:
- Supervisor
- ORCID:
- 0000-0001-6330-1407
+ Wellcome Trust
More from this funder
- Funder identifier:
- https://ror.org/029chgv08
- Grant:
- 226812/Z/22/Z
- Programme:
- Wellcome Trust PhD Studentship in Basic Science
- DOI:
- Type of award:
- DPhil
- Level of award:
- Doctoral
- Awarding institution:
- University of Oxford
- Language:
-
English
- Keywords:
- Subjects:
- Pubs id:
-
2407707
- Local pid:
-
pubs:2407707
- Deposit date:
-
2026-03-26
- ARK identifier:
Terms of use
- Copyright holder:
- Catherine Chahrour
- Copyright date:
- 2025
If you are the owner of this record, you can report an update to it here: Report update to this record