Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An improved chromosome-scale genome assembly and population genetics resource for populus tremula
Umeå University, Faculty of Science and Technology, Department of Plant Physiology. Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC).ORCID iD: 0000-0002-5249-604X
Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC). Umeå University, Faculty of Science and Technology, Department of Plant Physiology.ORCID iD: 0000-0002-9771-467x
National Engineering Laboratory for Tree Breeding; Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education; The Tree and Ornamental Plant Breeding and Biotechnology Laboratory of National Forestry and Grassland Administration, College of Biological Sciences and Technology, Beijing Forestry University, China.
Umeå University, Faculty of Science and Technology, Department of Plant Physiology. Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC).
Show others and affiliations
2024 (English)In: Physiologia Plantarum, ISSN 0031-9317, E-ISSN 1399-3054, Vol. 176, no 5, article id e14511Article in journal (Refereed) Published
Abstract [en]

Aspen (Populus tremula L.) is a keystone species and a model system for forest tree genomics. We present an updated resource comprising a chromosome-scale assem- bly, population genetics and genomics data. Using the resource, we explore the genetic basis of natural variation in leaf size and shape, traits with complex genetic architecture.

We generated the genome assembly using long-read sequencing, optical and high-density genetic maps. We conducted whole-genome resequencing of the Umeå Aspen (UmAsp) collection. Using the assembly and re-sequencing data from the UmAsp, Swedish Aspen (SwAsp) and Scottish Aspen (ScotAsp) collections we performed genome-wide association analyses (GWAS) using Single Nucleotide Polymorphisms (SNPs) for 26 leaf physiognomy phenotypes. We conducted Assay of Transposase Accessible Chromatin sequencing (ATAC-Seq), identified genomic regions of accessible chromatin, and subset SNPs to these regions, improving the GWAS detection rate. We identified candidate long non-coding RNAs in leaf samples, quantified their expression in an updated co-expression network, and used this to explore the functions of candidate genes identified from the GWAS.

A GWAS found SNP associations for seven traits. The associated SNPs were in or near genes annotated with developmental functions, which represent candidates for further study. Of particular interest was a !177-kbp region harbouring associations with several leaf phenotypes in ScotAsp.

We have incorporated the assembly, population genetics, genomics, and GWAS data into the PlantGenIE.org web resource, including updating existing genomics data to the new genome version, to enable easy exploration and visualisation. We provide all raw and processed data to facilitate reuse in future studies.

Place, publisher, year, edition, pages
John Wiley & Sons, 2024. Vol. 176, no 5, article id e14511
Keywords [en]
genome assembly, natural selection, co-expression, population genetics, Populus, aspen, GWAS, leaf physiognomy, leaf shape, leaf size, genetic architecture, ATAC-Seq, lncRNA
National Category
Bioinformatics and Computational Biology Genetics and Genomics
Identifiers
URN: urn:nbn:se:umu:diva-229976DOI: 10.1111/ppl.14511ISI: 001313686100001PubMedID: 39279509Scopus ID: 2-s2.0-85204093798OAI: oai:DiVA.org:umu-229976DiVA, id: diva2:1900462
Funder
Swedish Research Council, 2019-05476Swedish Research Council Formas, 2018-01644Vinnova, S111416L0710
Note

Supplementary figures and appendixes under Supporting information on article web page. 

Available from: 2024-09-23 Created: 2024-09-23 Last updated: 2025-12-12Bibliographically approved
In thesis
1. Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
Open this publication in new window or tab >>Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Att tackla en genomisk avgrund : tillvägagångssätt för att koppla långa icke-kodande RNA till potentiell biologisk funktion i gran och asp
Abstract [en]

Protein coding genes have been extensively studied in both plant and animal genomes, while non-coding portions of the genomes were considered not relevant for a long time. This was due to the fact that non-coding led immediately to not functional, until the discovery of let-7, the first conserved miRNA, in Caenorhabditis elegans. From here on, several studies on small RNAs (sRNAs) were performed, while long non-coding RNAs (lncRNAs) have risen to attention in the last two decades, also because of their usage as diagnostic biomarkers in cancer. Studies to assign function to RNAs have progressed more slowly in plants compared to the animal kingdom and there is still a lot to explore even in the protein coding space, above all if we consider huge genomes like Norway spruce and Scots pine, so the non-coding part of the genome still represents an abyss to discover. In my PhD I mostly focused on a subclass of non-coding RNAs in Norway spruce and aspen. Long non-coding RNAs are considered arbitrarily longer than 200 nucleotides (nt) and can have one small open reading frame (sORF, length < 300 nt) coding for a short peptide (not a complete protein). lncRNAs tend to be expressed at lower levels than genes, but with precise spatio-temporal patterns. They are mostly expressed in particular tissues, stages of a biological process and/or particular conditions, that are often related to biotic or abiotic stresses. They have low levels of sequence homology conservation, even in close related species. In particular, I studied the class of lncRNAs located in the intergenic space, the long intergenic non-coding RNAs (lincRNAs). 

In the first part of this thesis, I developed a pipeline to identify lincRNAs. This pipeline allows to identify in silico bona fide lincRNAs starting from an RNA-Sequencing dataset. It is an ensemble method, considering different tools and the characteristics of lincRNAs. 

In the second part of this thesis, I focused on functionally annotating lincRNAs. To achieve this challenge, I decided to use the guilt-by-association strategy. This method relies on a co-expression network containing both lincRNAs and protein coding genes. Through a functional enrichment of the protein coding genes, it is possible to transfer the same annotation to a lincRNA co-expressed in the same module. I have also tried to relate lincRNAs to a possible function in the de novo methylation of DNA via the RdDM pathway in Norway spruce.

In the last part of this thesis, I identified lincRNAs expressed during leaf development in aspen and produced CRISPR-Cas9 mutants lacking the sequence of two lincRNAs in order to provide a functional validation. 

In general, RNA-Sequencing has enabled and advanced the identification of lincRNAs, and this thesis demonstrates an implemented strategy to identify and assign putative functional information to lincRNAs, deepening the knowledge in the non-coding abyss.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2024. p. 58
Keywords
Norway spruce, aspen, non-coding RNAs, long non-coding RNAs, RNA-Seq, transcriptome, functional annotation, co-expression network, guilt-by-association, functional validation, CRISPR-Cas9
National Category
Genetics and Genomics Bioinformatics and Computational Biology Plant Biotechnology
Identifiers
urn:nbn:se:umu:diva-229993 (URN)978-91-8070-491-5 (ISBN)978-91-8070-492-2 (ISBN)
Public defence
2024-10-24, Stora hörsalen, byggnad KBC, Umeå, 14:00 (English)
Opponent
Supervisors
Available from: 2024-10-03 Created: 2024-09-25 Last updated: 2025-02-05Bibliographically approved
2. A systems genetics approach to identify candidate genes driving salicinoid diversity in Populus tremula
Open this publication in new window or tab >>A systems genetics approach to identify candidate genes driving salicinoid diversity in Populus tremula
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Systemgenetik som verktyg för att identifiera generna som styr salicinoid-diversitet i Populus tremula
Abstract [en]

Trees have evolved an impressive array of strategies to cope with the challenges of having a long and sessile life. Not only must they withstand a fluctuating climate, but they also face instantaneous pressures from herbivores and other attackers. To protect themselves, plants can produce defence compounds, many of which are highly specialised and taxon specific. Within the Salicaceae family, a key group of such defence compounds are the salicinoid phenolic glycosides (SPGs). Many structural variants of SPGs have been identified, in which acyl groups (e.g., cinnamoyl, benzoyl, and acetyl) are common. Some of these SPGs can have toxic and deterrent effects against attackers, and a few are known for their medicinal properties in humans. However, the biological function of most SPGs in planta remains unclear, and the causal enzymes for the majority of SPGs are yet to be identified. 

The aim of this thesis was to uncover the genetic basis of SPG biosynthesis in European aspen (Populus tremula L.) and to determine the extent of ontogenic and organ-specific variation among individuals. To achieve this, SPG variation within a collection of natural aspen, the Swedish aspen (SwAsp) collection, was investigated using an integrative multi-omic approach. By analysing the metabolome and transcriptome of multiple leaf ages from aspen individuals with varying levels of cinnamoyl and acetylated SPGs, a set of candidate transferases and novel putative SPGs were identified. These analyses further suggested that young leaf tissue is a highly active site of SPG biosynthesis, compared with mature leaves.  

To extend this analysis, we performed genome-wide association studies on transcriptomic and metabolomic data from leaf buds to identify genomic regions associated with variation in SPG abundance and gene expression. These data were integrated into a systems genetics network, visualising the intricate relationship between candidate genes and the diversity of SPGs. Among the candidates, an acyltransferase was highly associated with both acetyl- and cinnamoyl-SPGs. Heterologous expression assays in Escherichia coli (E. coli) confirmed its acetylation activity. In line with these findings, overexpression of the gene in planta led to increased levels of acetyl-SPGs, suggesting acetylation activity of the enzyme.

In summary, these results have enhanced our understanding of SPG biosynthesis and provide a foundation for future studies aimed at elucidating the in planta function of the remaining candidate genes. 

Place, publisher, year, edition, pages
Umeå: Umeå University, 2025. p. 107
Keywords
aspen, Populus tremula, systems genetics, GWAS, eQTL, metabolomics, specialised metabolites, salicinoid phenolic glycosides, chemotype, liquid chromatography-mass spectrometry, transcriptomics, RNA-Seq
National Category
Bioinformatics and Computational Biology Genetics and Genomics Plant Biotechnology
Identifiers
urn:nbn:se:umu:diva-247549 (URN)978-91-8070-866-1 (ISBN)978-91-8070-867-8 (ISBN)
Public defence
2026-01-09, KBE301-Lilla hörsalen, KBC-huset, Umeå, 13:00 (English)
Opponent
Supervisors
Available from: 2025-12-19 Created: 2025-12-12 Last updated: 2025-12-12Bibliographically approved

Open Access in DiVA

fulltext(41855 kB)179 downloads
File information
File name FULLTEXT02.pdfFile size 41855 kBChecksum SHA-512
9bb729439ea1e009f42444c20e56a87754da36c574754840a88a690a73c3c345992224bfef2bcda570606c27eb6bd1b61a1c2ad8eebf947345584c19ef58aca7
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel

Search in DiVA

By author/editor
Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel
By organisation
Department of Plant PhysiologyUmeå Plant Science Centre (UPSC)
In the same journal
Physiologia Plantarum
Bioinformatics and Computational BiologyGenetics and Genomics

Search outside of DiVA

GoogleGoogle Scholar
Total: 180 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 435 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf