Skip to main content

Project Summary

Start date:
1 October 2020
Duration:
7 years
Project Leader:
Funders:
BBSRC

Proteins are the workhorses of the cell: they facilitate chemical reactions, act as gene switches and have structural roles. For cells to work efficiently, proteins need to be produced in the right place, at the right time and in the right amount. They also need to be removed when no longer needed. There are many levels at which these processes are regulated and there are still many gaps in our knowledge.


This project seeks to use the model plant, Arabidopsis thaliana, to answer fundamental questions about the control of protein expression, including which mechanisms are important and how they interact in a complex multi-cellular organism.


We also aim to determine to what extent the protein content of a given cell, tissue or organ predicts observable traits (the phenotype) of the plant. To address these questions, we have designed an integrated programme of experiments and sophisticated mathematical analysis around a genetically variable population of Arabidopsis (known as the MAGIC population). This is a powerful genetic resource for mapping sections of DNA that correlate with variation in a trait (known as quantitative trait loci, QTL), to identify causal variants and dissect the regulation of genome expression.

Detail

This project aims to understand how protein abundance is controlled in plants and to determine the phenotypic consequences of proteomic variation, together with genotypic, structural, epigenotypic and transcriptomic variation. We propose an integrated programme of quantitative trait loci (QTL) analysis of an Arabidopsis multiparental advanced generation intercross (MAGIC) population. Firstly, we will determine all variation in the 19 MAGIC founders, and interactions between different 'omic layers, via a comprehensive set of assays. Long-read sequencing of 18 founders' genomes will be performed for comparison of structural variation relative to the 19th founder, the Col-0 reference. We will measure epigenetic marks of cytosine DNA methylation and chromatin accessibility by ATAC-seq. Transcript abundance and regulatory RNA species will be analysed by RNA-seq and protein translation and abundance quantified by Ribo-seq and proteomics, respectively. Next, a holistic experimental and computational analysis of 400 Arabidopsis MAGIC RILs (recombinant inbred lines) will be used to understand the regulatory networks controlling protein expression and dissect the relative contributions of genotype (including small-scale variation and large-scale structural rearrangements), RNA transcription, protein synthesis and protein degradation. We will use statistical and machine learning (ML) approaches to construct different types of molecular networks and identify causal mediators. Co-expression analysis will also identify novel physical complexes and sets of proteins that participate in common processes. Selected networks and complexes will be tested experimentally. Whole plant phenotyping of the MAGIC lines will be performed and used together with the molecular data to interrogate the predictive ability of different 'omic layers across a range of phenotypes. Finally, data and knowledge generated will be shared with the community through a user-friendly web resource.

Project Leader

Prof. Frederica Theodoulou

Science Team Leader

Team Members

Dr Xiaowei Li

Research Scientist

Dr Keywan Hassani-Pak

Head of Bioinformatics

Collaborators

Professor Richard Mott, University College London
Professor Kathryn Lilley, University of Cambridge