Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An improved chromosome-scale genome assembly and population genetics resource for populus tremula
Umeå University, Faculty of Science and Technology, Department of Plant Physiology. Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC).ORCID iD: 0000-0002-5249-604X
Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC). Umeå University, Faculty of Science and Technology, Department of Plant Physiology.ORCID iD: 0000-0002-9771-467x
National Engineering Laboratory for Tree Breeding; Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education; The Tree and Ornamental Plant Breeding and Biotechnology Laboratory of National Forestry and Grassland Administration, College of Biological Sciences and Technology, Beijing Forestry University, China.
Umeå University, Faculty of Science and Technology, Department of Plant Physiology. Umeå University, Faculty of Science and Technology, Umeå Plant Science Centre (UPSC).
Show others and affiliations
2024 (English)In: Physiologia Plantarum, ISSN 0031-9317, E-ISSN 1399-3054, Vol. 176, no 5, article id e14511Article in journal (Refereed) Published
Abstract [en]

Aspen (Populus tremula L.) is a keystone species and a model system for forest tree genomics. We present an updated resource comprising a chromosome-scale assem- bly, population genetics and genomics data. Using the resource, we explore the genetic basis of natural variation in leaf size and shape, traits with complex genetic architecture.

We generated the genome assembly using long-read sequencing, optical and high-density genetic maps. We conducted whole-genome resequencing of the Umeå Aspen (UmAsp) collection. Using the assembly and re-sequencing data from the UmAsp, Swedish Aspen (SwAsp) and Scottish Aspen (ScotAsp) collections we performed genome-wide association analyses (GWAS) using Single Nucleotide Polymorphisms (SNPs) for 26 leaf physiognomy phenotypes. We conducted Assay of Transposase Accessible Chromatin sequencing (ATAC-Seq), identified genomic regions of accessible chromatin, and subset SNPs to these regions, improving the GWAS detection rate. We identified candidate long non-coding RNAs in leaf samples, quantified their expression in an updated co-expression network, and used this to explore the functions of candidate genes identified from the GWAS.

A GWAS found SNP associations for seven traits. The associated SNPs were in or near genes annotated with developmental functions, which represent candidates for further study. Of particular interest was a !177-kbp region harbouring associations with several leaf phenotypes in ScotAsp.

We have incorporated the assembly, population genetics, genomics, and GWAS data into the PlantGenIE.org web resource, including updating existing genomics data to the new genome version, to enable easy exploration and visualisation. We provide all raw and processed data to facilitate reuse in future studies.

Place, publisher, year, edition, pages
John Wiley & Sons, 2024. Vol. 176, no 5, article id e14511
Keywords [en]
genome assembly, natural selection, co-expression, population genetics, Populus, aspen, GWAS, leaf physiognomy, leaf shape, leaf size, genetic architecture, ATAC-Seq, lncRNA
National Category
Bioinformatics and Computational Biology Genetics and Genomics
Identifiers
URN: urn:nbn:se:umu:diva-229976DOI: 10.1111/ppl.14511ISI: 001313686100001PubMedID: 39279509Scopus ID: 2-s2.0-85204093798OAI: oai:DiVA.org:umu-229976DiVA, id: diva2:1900462
Funder
Swedish Research Council, 2019-05476Swedish Research Council Formas, 2018-01644Vinnova, S111416L0710
Note

Supplementary figures and appendixes under Supporting information on article web page. 

Available from: 2024-09-23 Created: 2024-09-23 Last updated: 2025-04-24Bibliographically approved
In thesis
1. Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
Open this publication in new window or tab >>Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Att tackla en genomisk avgrund : tillvägagångssätt för att koppla långa icke-kodande RNA till potentiell biologisk funktion i gran och asp
Abstract [en]

Protein coding genes have been extensively studied in both plant and animal genomes, while non-coding portions of the genomes were considered not relevant for a long time. This was due to the fact that non-coding led immediately to not functional, until the discovery of let-7, the first conserved miRNA, in Caenorhabditis elegans. From here on, several studies on small RNAs (sRNAs) were performed, while long non-coding RNAs (lncRNAs) have risen to attention in the last two decades, also because of their usage as diagnostic biomarkers in cancer. Studies to assign function to RNAs have progressed more slowly in plants compared to the animal kingdom and there is still a lot to explore even in the protein coding space, above all if we consider huge genomes like Norway spruce and Scots pine, so the non-coding part of the genome still represents an abyss to discover. In my PhD I mostly focused on a subclass of non-coding RNAs in Norway spruce and aspen. Long non-coding RNAs are considered arbitrarily longer than 200 nucleotides (nt) and can have one small open reading frame (sORF, length < 300 nt) coding for a short peptide (not a complete protein). lncRNAs tend to be expressed at lower levels than genes, but with precise spatio-temporal patterns. They are mostly expressed in particular tissues, stages of a biological process and/or particular conditions, that are often related to biotic or abiotic stresses. They have low levels of sequence homology conservation, even in close related species. In particular, I studied the class of lncRNAs located in the intergenic space, the long intergenic non-coding RNAs (lincRNAs). 

In the first part of this thesis, I developed a pipeline to identify lincRNAs. This pipeline allows to identify in silico bona fide lincRNAs starting from an RNA-Sequencing dataset. It is an ensemble method, considering different tools and the characteristics of lincRNAs. 

In the second part of this thesis, I focused on functionally annotating lincRNAs. To achieve this challenge, I decided to use the guilt-by-association strategy. This method relies on a co-expression network containing both lincRNAs and protein coding genes. Through a functional enrichment of the protein coding genes, it is possible to transfer the same annotation to a lincRNA co-expressed in the same module. I have also tried to relate lincRNAs to a possible function in the de novo methylation of DNA via the RdDM pathway in Norway spruce.

In the last part of this thesis, I identified lincRNAs expressed during leaf development in aspen and produced CRISPR-Cas9 mutants lacking the sequence of two lincRNAs in order to provide a functional validation. 

In general, RNA-Sequencing has enabled and advanced the identification of lincRNAs, and this thesis demonstrates an implemented strategy to identify and assign putative functional information to lincRNAs, deepening the knowledge in the non-coding abyss.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2024. p. 58
Keywords
Norway spruce, aspen, non-coding RNAs, long non-coding RNAs, RNA-Seq, transcriptome, functional annotation, co-expression network, guilt-by-association, functional validation, CRISPR-Cas9
National Category
Genetics and Genomics Bioinformatics and Computational Biology Plant Biotechnology
Identifiers
urn:nbn:se:umu:diva-229993 (URN)978-91-8070-491-5 (ISBN)978-91-8070-492-2 (ISBN)
Public defence
2024-10-24, Stora hörsalen, byggnad KBC, Umeå, 14:00 (English)
Opponent
Supervisors
Available from: 2024-10-03 Created: 2024-09-25 Last updated: 2025-02-05Bibliographically approved

Open Access in DiVA

fulltext(41855 kB)61 downloads
File information
File name FULLTEXT02.pdfFile size 41855 kBChecksum SHA-512
9bb729439ea1e009f42444c20e56a87754da36c574754840a88a690a73c3c345992224bfef2bcda570606c27eb6bd1b61a1c2ad8eebf947345584c19ef58aca7
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel

Search in DiVA

By author/editor
Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel
By organisation
Department of Plant PhysiologyUmeå Plant Science Centre (UPSC)
In the same journal
Physiologia Plantarum
Bioinformatics and Computational BiologyGenetics and Genomics

Search outside of DiVA

GoogleGoogle Scholar
Total: 61 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 222 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf