Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
An improved chromosome-scale genome assembly and population genetics resource for populus tremula
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för fysiologisk botanik. Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Umeå Plant Science Centre (UPSC).ORCID-id: 0000-0002-5249-604X
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Umeå Plant Science Centre (UPSC). Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för fysiologisk botanik.ORCID-id: 0000-0002-9771-467x
National Engineering Laboratory for Tree Breeding; Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education; The Tree and Ornamental Plant Breeding and Biotechnology Laboratory of National Forestry and Grassland Administration, College of Biological Sciences and Technology, Beijing Forestry University, China.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för fysiologisk botanik. Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Umeå Plant Science Centre (UPSC).
Visa övriga samt affilieringar
2024 (Engelska)Ingår i: Physiologia Plantarum, ISSN 0031-9317, E-ISSN 1399-3054, Vol. 176, nr 5, artikel-id e14511Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Aspen (Populus tremula L.) is a keystone species and a model system for forest tree genomics. We present an updated resource comprising a chromosome-scale assem- bly, population genetics and genomics data. Using the resource, we explore the genetic basis of natural variation in leaf size and shape, traits with complex genetic architecture.

We generated the genome assembly using long-read sequencing, optical and high-density genetic maps. We conducted whole-genome resequencing of the Umeå Aspen (UmAsp) collection. Using the assembly and re-sequencing data from the UmAsp, Swedish Aspen (SwAsp) and Scottish Aspen (ScotAsp) collections we performed genome-wide association analyses (GWAS) using Single Nucleotide Polymorphisms (SNPs) for 26 leaf physiognomy phenotypes. We conducted Assay of Transposase Accessible Chromatin sequencing (ATAC-Seq), identified genomic regions of accessible chromatin, and subset SNPs to these regions, improving the GWAS detection rate. We identified candidate long non-coding RNAs in leaf samples, quantified their expression in an updated co-expression network, and used this to explore the functions of candidate genes identified from the GWAS.

A GWAS found SNP associations for seven traits. The associated SNPs were in or near genes annotated with developmental functions, which represent candidates for further study. Of particular interest was a !177-kbp region harbouring associations with several leaf phenotypes in ScotAsp.

We have incorporated the assembly, population genetics, genomics, and GWAS data into the PlantGenIE.org web resource, including updating existing genomics data to the new genome version, to enable easy exploration and visualisation. We provide all raw and processed data to facilitate reuse in future studies.

Ort, förlag, år, upplaga, sidor
John Wiley & Sons, 2024. Vol. 176, nr 5, artikel-id e14511
Nyckelord [en]
genome assembly, natural selection, co-expression, population genetics, Populus, aspen, GWAS, leaf physiognomy, leaf shape, leaf size, genetic architecture, ATAC-Seq, lncRNA
Nationell ämneskategori
Bioinformatik och beräkningsbiologi Genetik och genomik
Identifikatorer
URN: urn:nbn:se:umu:diva-229976DOI: 10.1111/ppl.14511ISI: 001313686100001PubMedID: 39279509Scopus ID: 2-s2.0-85204093798OAI: oai:DiVA.org:umu-229976DiVA, id: diva2:1900462
Forskningsfinansiär
Vetenskapsrådet, 2019-05476Forskningsrådet Formas, 2018-01644Vinnova, S111416L0710
Anmärkning

Supplementary figures and appendixes under Supporting information on article web page. 

Tillgänglig från: 2024-09-23 Skapad: 2024-09-23 Senast uppdaterad: 2025-04-24Bibliografiskt granskad
Ingår i avhandling
1. Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
Öppna denna publikation i ny flik eller fönster >>Tackling a genomic abyss: approaches to link long non-coding RNAs to potential biological function in Norway spruce and aspen
2024 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Alternativ titel[sv]
Att tackla en genomisk avgrund : tillvägagångssätt för att koppla långa icke-kodande RNA till potentiell biologisk funktion i gran och asp
Abstract [en]

Protein coding genes have been extensively studied in both plant and animal genomes, while non-coding portions of the genomes were considered not relevant for a long time. This was due to the fact that non-coding led immediately to not functional, until the discovery of let-7, the first conserved miRNA, in Caenorhabditis elegans. From here on, several studies on small RNAs (sRNAs) were performed, while long non-coding RNAs (lncRNAs) have risen to attention in the last two decades, also because of their usage as diagnostic biomarkers in cancer. Studies to assign function to RNAs have progressed more slowly in plants compared to the animal kingdom and there is still a lot to explore even in the protein coding space, above all if we consider huge genomes like Norway spruce and Scots pine, so the non-coding part of the genome still represents an abyss to discover. In my PhD I mostly focused on a subclass of non-coding RNAs in Norway spruce and aspen. Long non-coding RNAs are considered arbitrarily longer than 200 nucleotides (nt) and can have one small open reading frame (sORF, length < 300 nt) coding for a short peptide (not a complete protein). lncRNAs tend to be expressed at lower levels than genes, but with precise spatio-temporal patterns. They are mostly expressed in particular tissues, stages of a biological process and/or particular conditions, that are often related to biotic or abiotic stresses. They have low levels of sequence homology conservation, even in close related species. In particular, I studied the class of lncRNAs located in the intergenic space, the long intergenic non-coding RNAs (lincRNAs). 

In the first part of this thesis, I developed a pipeline to identify lincRNAs. This pipeline allows to identify in silico bona fide lincRNAs starting from an RNA-Sequencing dataset. It is an ensemble method, considering different tools and the characteristics of lincRNAs. 

In the second part of this thesis, I focused on functionally annotating lincRNAs. To achieve this challenge, I decided to use the guilt-by-association strategy. This method relies on a co-expression network containing both lincRNAs and protein coding genes. Through a functional enrichment of the protein coding genes, it is possible to transfer the same annotation to a lincRNA co-expressed in the same module. I have also tried to relate lincRNAs to a possible function in the de novo methylation of DNA via the RdDM pathway in Norway spruce.

In the last part of this thesis, I identified lincRNAs expressed during leaf development in aspen and produced CRISPR-Cas9 mutants lacking the sequence of two lincRNAs in order to provide a functional validation. 

In general, RNA-Sequencing has enabled and advanced the identification of lincRNAs, and this thesis demonstrates an implemented strategy to identify and assign putative functional information to lincRNAs, deepening the knowledge in the non-coding abyss.

Ort, förlag, år, upplaga, sidor
Umeå: Umeå University, 2024. s. 58
Nyckelord
Norway spruce, aspen, non-coding RNAs, long non-coding RNAs, RNA-Seq, transcriptome, functional annotation, co-expression network, guilt-by-association, functional validation, CRISPR-Cas9
Nationell ämneskategori
Genetik och genomik Bioinformatik och beräkningsbiologi Växtbioteknologi
Identifikatorer
urn:nbn:se:umu:diva-229993 (URN)978-91-8070-491-5 (ISBN)978-91-8070-492-2 (ISBN)
Disputation
2024-10-24, Stora hörsalen, byggnad KBC, Umeå, 14:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2024-10-03 Skapad: 2024-09-25 Senast uppdaterad: 2025-02-05Bibliografiskt granskad

Open Access i DiVA

fulltext(41855 kB)64 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 41855 kBChecksumma SHA-512
9bb729439ea1e009f42444c20e56a87754da36c574754840a88a690a73c3c345992224bfef2bcda570606c27eb6bd1b61a1c2ad8eebf947345584c19ef58aca7
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel

Sök vidare i DiVA

Av författaren/redaktören
Robinson, Kathryn M.Schiffthaler, BastianRydman, Sara M.Ahlgren Kalman, TeiturKumar, VikashCanovi, CamillaDelhomme, NicolasMähler, NiklasRichau, Kerstin HMannapperuma, ChanakaJansson, StefanStreet, Nathaniel
Av organisationen
Institutionen för fysiologisk botanikUmeå Plant Science Centre (UPSC)
I samma tidskrift
Physiologia Plantarum
Bioinformatik och beräkningsbiologiGenetik och genomik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 64 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 222 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf