Colin Semple

These susceptible isles

October 7, 2024

Fascinating insights into the regional differentiation within the UK and Ireland – with Mike Halachev, Jim Wilson and others. Remarkably, even with the current – far from complete – genomic coverage of regional populations we see evidence for different rare variants within different regions. With conservative filtering we find that a fraction of these regionally enriched rare variants are likely to have adverse biomedical consequences in homozygous individuals. Now available on Nature Comms with commentary in the Economist – following a marathon review process of ~2 years!

The chaotic genome of ovarian cancer

January 22, 2024

For the past few years we’ve been working with Charlie Gourley, Patricia Roxburgh, Ailith Ewing and others to better understand high grade serous ovarian cancer (HGSOC) – and this has culminated in a manuscript now posted online. Ailith has also written a thread explaining this work. There aren’t enough good treatment options for women who develop these tumours and that’s partly because we don’t yet understand how they evolve during the course of the disease. We’ve assembled a large whole genome sequencing dataset, including 324 of these tumours, gaining a number of new insights into how they emerge and grow, and opening new avenues to better treatments. HGSOC is also a tumour type that shows extreme structural instability, resulting in chaotic new rearrangements of the human genome. We hope that our study also provides a blueprint to study other poorly studied cancers showing high structural diversity.

HGU funding boost

July 25, 2023

The MRC Human Genetics Unit (where we are located) has successfully secured funding for another 5 years from the UK MRC, totalling £46.3M, as reported by the BBC and others. This is the culmination of a ~18month process from preparations for review, review of research proposals, a site visit by a review panel and subsequent negotiations over the final financial settlement. Our group lived to fight another day – but we’re glad it’s over!

Mutational bias in spermatogonia impacts human regulatory sites

June 28, 2021

Vera’s analysis of spermatogonial ATAC-seq data from Martin Taylor’s lab is now available (biorxiv; tweetorial) and shows that spermatogonial regulatory sites that are bound by particular factors (such as NRF1) are associated with higher mutation rates. Human populations show enrichment for singleton (ie relatively recent) deletion breakpoints at these sites. Surprisingly the same sites are also enriched for short insertions, and these insertions often duplicate the binding sites of the factors binding there, producing clusters of binding sites. As a side effect, when other cells/tissues that share regulatory sites with spermatogonia (such as regions of the developing brain) they are also impacted by these enrichments of mutations – suggesting that there may be a cost of spermatogonial mutational bias in causing disruptions to human development.

Large structural variants impact tumour phenotype

March 23, 2021

For the past few years we’ve been working with Charlie Gourley’s lab in the CRUK Edinburgh Centre generating WGS data for high grade serous ovarian cancer (HGSOC) tumour samples – and the most recent product of this work, led by Dr AIlith Ewing, has recently surfaced in Clinical Cancer Research.

This work establishes large (multi-megabase) alterations in chromosome structure – and especially large deletions – as a significant impact on tumour function. Ailith has shown that these large variants cause repair deficiencies in tumours that are likely to be targetable by chemotherapeutics – and has written a tweetorial explaining this. These variants should therefore be considered in addition to the small variants that are more commonly profiled in HGSOC patient samples, and intriguingly there is evidence for similar large variants impacting tumours across a range of other cancers.

Lesion segregation dominates early tumour genome evolution

June 24, 2020

Screenshot 2020-06-24 at 16.42.05

For almost 3 years we’ve had the pleasure of participating in the Liver Cancer Evolution (LCE) consortium, along with the labs of Duncan Odom, Paul Flicek, Nuria Lopez-Bigas and Martin Taylor – and today sees the publication of the first consortium paper in Nature. The Odom lab generated some unprecedented data, mapping genome and transcriptome evolution during liver tumour evolution in several strains/species of mice some time ago, and we have all collaborated on the analysis of these data. The initial analyses revealed surprising patterns of mutational asymmetry in these model systems: huge multimegabase segments of each chromosome showed strong biases to particular base substitutions. These patterns emerge due to a failure to repair mutagenic DNA lesions over successive cell cycles, and similar patterns can appear in human cells following mutagenesis. Each new round of DNA replication on a lesion-containing strand can lead to the incorporation of a different mispaired base opposite the lesion site in the newly synthesized strand, generating cells in the evolving tumour with different mutations of the same base pair. Unrepaired lesion segregation is therefore an unexpected source of diversity during what is otherwise straightforward clonal evolution, and may provide fuel for adaptive evolution in early tumours.

Northern exposure

November 28, 2019

jim

For the past couple of years we’ve been studying the unusual genetics of the Shetland islands with Jim Wilson’s group. Jim has a long history of studying the isolated populations of the Scottish Northern Isles, but we’ve just published the first study that is based upon whole genome sequencing (WGS), comparing Shetland (n=500) and mainland Scottish populations (n=1156). The results are quite striking, showing an enrichment of genetic variants that are rare or ultra-rare (ie not yet seen elsewhere) such that around 10% of all Shetland variants “are unique to the VIKING cohort or are seen at frequencies at least ten fold higher than in more cosmopolitan control populations”. Many of these variants are predicted to alter gene function and they are particularly enriched in promoter regions, which control gene expression patterns. This raises the possibility that gene expression may evolve relatively rapidly in isolated human populations.

Modeling the breakome

February 11, 2019

jim

Tracy’s work building models of DNA double strand break susceptibility finally emerges from review this week in Genome Biology. She shows that it is possible to make remarkably accurate models, predicting the frequency of breakage in a given region of the genome, using a variety of underlying chromatin features. These predicted frequencies from these models can then be compared (above) to the rates of breakage seen in human tumour data, and identify regions that may be important to tumourigenesis. This work bridges the fields of genome instability, chromatin structure and cancer genomics – which is pretty cool, until you attempt to find suitably eclectic reviewers! It’s also the first manuscript to come out of our ongoing collaboration with our friends in the Crosetto group at the Karolinska.

Anchors in the storm

July 31, 2018

Chromatin loop anchors seem to be a basic unit of the physical organisation of the human genome, providing stable architectural sites within the nucleus, and influencing gene expression. Vera’s work exploring the strange mutational landscape at loop anchors shows that these sites are also unusually fragile: showing high rates of DNA double strand breaks in vitro and elevated rates of breakage in a variety of tumours. Unexpectedly a substantial fraction of loop anchors also coincide precisely with human recombination hotspots (HS_LAPs below), establishing these sites as foci for evolutionary change in mammalian evolution as well as during tumourigenesis.

jim

Average human recombination rates within 500 kb of recombination hotspots (HSs), the subset of LAPs overlapping HSs (HS_LAPs) and all LAPs. Recombination rates were derived from the worldwide whole genome sequencing data of the 1000 Genomes Project.