Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Stenberg, Per
Alternative names
Publications (10 of 50) Show all publications
Bernenko, D., Lee, S. H., Stenberg, P. & Lizana, L. (2023). Mapping the semi-nested community structure of 3D chromosome contact networks. PloS Computational Biology, 19(7), Article ID e1011185.
Open this publication in new window or tab >>Mapping the semi-nested community structure of 3D chromosome contact networks
2023 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 19, no 7, article id e1011185Article in journal (Refereed) Published
Abstract [en]

Mammalian DNA folds into 3D structures that facilitate and regulate genetic processes such as transcription, DNA repair, and epigenetics. Several insights derive from chromosome capture methods, such as Hi-C, which allow researchers to construct contact maps depicting 3D interactions among all DNA segment pairs. These maps show a complex cross-scale organization spanning megabase-pair compartments to short-ranged DNA loops. To better understand the organizing principles, several groups analyzed Hi-C data assuming a Russian-doll-like nested hierarchy where DNA regions of similar sizes merge into larger and larger structures. Apart from being a simple and appealing description, this model explains, e.g., the omnipresent chequerboard pattern seen in Hi-C maps, known as A/B compartments, and foreshadows the co-localization of some functionally similar DNA regions. However, while successful, this model is incompatible with the two competing mechanisms that seem to shape a significant part of the chromosomes' 3D organization: loop extrusion and phase separation. This paper aims to map out the chromosome's actual folding hierarchy from empirical data. To this end, we take advantage of Hi-C experiments and treat the measured DNA-DNA interactions as a weighted network. From such a network, we extract 3D communities using the generalized Louvain algorithm. This algorithm has a resolution parameter that allows us to scan seamlessly through the community size spectrum, from A/B compartments to topologically associated domains (TADs). By constructing a hierarchical tree connecting these communities, we find that chromosomes are more complex than a perfect hierarchy. Analyzing how communities nest relative to a simple folding model, we found that chromosomes exhibit a significant portion of nested and non-nested community pairs alongside considerable randomness. In addition, by examining nesting and chromatin types, we discovered that nested parts are often associated with active chromatin. These results highlight that cross-scale relationships will be essential components in models aiming to reach a deep understanding of the causal mechanisms of chromosome folding.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2023
National Category
Genetics and Genomics
Identifiers
urn:nbn:se:umu:diva-212472 (URN)10.1371/journal.pcbi.1011185 (DOI)001024553700001 ()37432974 (PubMedID)2-s2.0-85165521366 (Scopus ID)
Funder
Swedish Research Council, 2017-03848Swedish Research Council, 2021-04080Knut and Alice Wallenberg Foundation, 2014-0018
Available from: 2023-08-04 Created: 2023-08-04 Last updated: 2025-02-07Bibliographically approved
Lewerentz, J., Johansson, A.-M. & Stenberg, P. (2023). The path to immortalization of cells starts by managing stress through gene duplications. Experimental Cell Research, 422(1), Article ID 113431.
Open this publication in new window or tab >>The path to immortalization of cells starts by managing stress through gene duplications
2023 (English)In: Experimental Cell Research, ISSN 0014-4827, E-ISSN 1090-2422, Vol. 422, no 1, article id 113431Article in journal (Refereed) Published
Abstract [en]

The genomes of immortalized cell lines (and cancer cells) are characterized by multiple types of aberrations, ranging from single nucleotide polymorphisms (SNPs) to structural rearrangements that have accumulated over time. Consequently, it is difficult to estimate the relative impact of different aberrations, the order of events, and which gene functions were under selective pressure at the early stage towards cellular immortalization. Here, we have established novel cell cultures derived from Drosophila melanogaster embryos that were sampled at multiple time points over a one-year period. Using short-read DNA sequencing, we show that copy-number gain in preferentially stress-related genes were acquired in a dominant fraction of cells in 300-days old cultures. Furthermore, transposable elements were active in cells of all cultures. Only a few (<1%) SNPs could be followed over time, and these showed no trend to increase or decrease. We conclude that the early cellular responses of a novel culture comprise sequence duplication and transposable element activity. During immortalization, positive selection first occurs on genes that are related to stress response before shifting to genes that are related to growth.

Place, publisher, year, edition, pages
Elsevier, 2023
Keywords
Cancer, Cell line, Copy number, DNA sequencing, Evolution, Stress response
National Category
Cell and Molecular Biology
Identifiers
urn:nbn:se:umu:diva-201458 (URN)10.1016/j.yexcr.2022.113431 (DOI)000892211300004 ()36423660 (PubMedID)2-s2.0-85142680757 (Scopus ID)
Funder
Knut and Alice Wallenberg Foundation, 2014-0018
Available from: 2022-12-06 Created: 2022-12-06 Last updated: 2023-09-05Bibliographically approved
Brindefalk, B., Brolin, H., Säve-Söderbergh, M., Karlsson, E., Sundell, D., Wikström, P., . . . Ahlinder, J. (2022). Bacterial composition in Swedish raw drinking water reveals three major interacting ubiquitous metacommunities. MicrobiologyOpen, 11(5), Article ID e1320.
Open this publication in new window or tab >>Bacterial composition in Swedish raw drinking water reveals three major interacting ubiquitous metacommunities
Show others...
2022 (English)In: MicrobiologyOpen, E-ISSN 2045-8827, Vol. 11, no 5, article id e1320Article in journal (Refereed) Published
Abstract [en]

Background: Surface raw water used as a source for drinking water production is a critical resource, sensitive to contamination. We conducted a study on Swedish raw water sources, aiming to identify mutually co-occurring metacommunities of bacteria, and environmental factors driving such patterns.

Methods: The water sources were different regarding nutrient composition, water quality, and climate characteristics, and displayed various degrees of anthropogenic impact. Water inlet samples were collected at six drinking water treatment plants over 3 years, totaling 230 samples. The bacterial communities of DNA sequenced samples (n = 175), obtained by 16S metabarcoding, were analyzed using a joint model for taxa abundance.

Results: Two major groups of well-defined metacommunities of microorganisms were identified, in addition to a third, less distinct, and taxonomically more diverse group. These three metacommunities showed various associations to the measured environmental data. Predictions for the well-defined metacommunities revealed differing sets of favored metabolic pathways and life strategies. In one community, taxa with methanogenic metabolism were common, while a second community was dominated by taxa with carbohydrate and lipid-focused metabolism.

Conclusion: The identification of ubiquitous persistent co-occurring bacterial metacommunities in freshwater habitats could potentially facilitate microbial source tracking analysis of contamination issues in freshwater sources.

Place, publisher, year, edition, pages
John Wiley & Sons, 2022
Keywords
16S rRNA, anthropogenic effects, bacterial community analysis, biotic interactions, generalized linear latent variable model
National Category
Environmental Sciences Ecology
Identifiers
urn:nbn:se:umu:diva-201110 (URN)10.1002/mbo3.1320 (DOI)000859098400001 ()36314747 (PubMedID)2-s2.0-85141178520 (Scopus ID)
Funder
Swedish Civil Contingencies Agency
Available from: 2022-11-22 Created: 2022-11-22 Last updated: 2023-08-02Bibliographically approved
Bernenko, D., Lee, S. H., Stenberg, P. & Lizana, L. (2022). Mapping the semi-nested community structure of 3D chromosome contact networks.
Open this publication in new window or tab >>Mapping the semi-nested community structure of 3D chromosome contact networks
2022 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Mammalian DNA folds into 3D structures that facilitate and regulate genetic processes such as transcription, DNA repair, and epigenetics. Several insights derive from chromosome capture methods, such as Hi-C, which allow researchers to construct contact maps depicting 3D interactions among all DNA segment pairs. These maps show a complex cross-scale organization spanning megabase-pair compartments to short-ranged DNA loops. To better understand the organizing principles, several groups analyzed Hi-C data assuming a Russian-doll-like nested hierarchy where DNA regions of similar sizes merge into larger and larger structures. Apart from being a simple and appealing description, this model explains, e.g., the omnipresent chequerboard pattern seen in Hi-C maps, known as A/B compartments, and foreshadows the co-localization of some functionally similar DNA regions. However, while successful, this model is incompatible with the two competing mechanisms that seem to shape a significant part of the chromosomes’ 3D organization: loop extrusion and phase separation. This paper aims to map out the chromosome’s actual folding hierarchy from empirical data. To this end, we take advantage of Hi-C experiments and treat the measured DNA-DNA interactions as a weighted network. From such a network, we extract 3D communities using the generalized Louvain algorithm. This algorithm has a resolution parameter that allows us to scan seamlessly through the community size spectrum, from A/B compartments to topologically associated domains (TADs). By constructing a hierarchical tree connecting these communities, we find that chromosomes are more complex than a perfect hierarchy. Analyzing how communities nest relative to a simple folding model, we found that chromosomes exhibit a significant portion of nested and non-nested community pairs alongside considerable randomness. In addition, by examining nesting and chromatin types, we discovered that nested parts are often associated with active chromatin. These results highlight that crossscale relationships will be essential components in models aiming to reach a deep understanding of the causal mechanisms of chromosome folding.

National Category
Other Computer and Information Science Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-207315 (URN)
Available from: 2023-04-26 Created: 2023-04-26 Last updated: 2023-04-27
Lewerentz, J., Johansson, A.-M., Larsson, J. & Stenberg, P. (2022). Transposon activity, local duplications and propagation of structural variants across haplotypes drive the evolution of the Drosophila S2 cell line. BMC Genomics, 23(1), Article ID 276.
Open this publication in new window or tab >>Transposon activity, local duplications and propagation of structural variants across haplotypes drive the evolution of the Drosophila S2 cell line
2022 (English)In: BMC Genomics, E-ISSN 1471-2164, Vol. 23, no 1, article id 276Article in journal (Refereed) Published
Abstract [en]

Background: Immortalized cell lines are widely used model systems whose genomes are often highly rearranged and polyploid. However, their genome structure is seldom deciphered and is thus not accounted for during analyses. We therefore used linked short- and long-read sequencing to perform haplotype-level reconstruction of the genome of a Drosophila melanogaster cell line (S2-DRSC) with a complex genome structure.

Results: Using a custom implementation (that is designed to use ultra-long reads in complex genomes with nested rearrangements) to call structural variants (SVs), we found that the most common SV was repetitive sequence insertion or deletion (> 80% of SVs), with Gypsy retrotransposon insertions dominating. The second most common SV was local sequence duplication. SNPs and other SVs were rarer, but several large chromosomal translocations and mitochondrial genome insertions were observed. Haplotypes were highly similar at the nucleotide level but structurally very different. Insertion SVs existed at various haplotype frequencies and were unlinked on chromosomes, demonstrating that haplotypes have different structures and suggesting the existence of a mechanism that allows SVs to propagate across haplotypes. Finally, using public short-read data, we found that transposable element insertions and local duplications are common in other D. melanogaster cell lines.

Conclusions: The S2-DRSC cell line evolved through retrotransposon activity and vast local sequence duplications, that we hypothesize were the products of DNA re-replication events. Additionally, mutations can propagate across haplotypes (possibly explained by mitotic recombination), which enables fine-tuning of mutational impact and prevents accumulation of deleterious events, an inherent problem of clonal reproduction. We conclude that traditional linear homozygous genome representation conceals the complexity when dealing with rearranged and heterozygous clonal cells.

Place, publisher, year, edition, pages
BioMed Central, 2022
Keywords
Cell-line evolution, Haplotype structure, S2-DRSC, Structural rearrangements
National Category
Genetics and Genomics
Identifiers
urn:nbn:se:umu:diva-194268 (URN)10.1186/s12864-022-08472-1 (DOI)000779371300002 ()35392795 (PubMedID)2-s2.0-85127755482 (Scopus ID)
Funder
Knut and Alice Wallenberg Foundation, 2014–0018Swedish Cancer Society, 2017/342Swedish Cancer Society, 20 0779
Available from: 2022-04-29 Created: 2022-04-29 Last updated: 2025-02-07Bibliographically approved
Sundell, D., Öhrman, C., Svensson, D., Karlsson, E., Brindefalk, B., Myrtennäs, K., . . . Sjödin, A. (2021). FlexTaxD: flexible modification of taxonomy databases for improved sequence classification. Bioinformatics, 37(21), 3932-3933
Open this publication in new window or tab >>FlexTaxD: flexible modification of taxonomy databases for improved sequence classification
Show others...
2021 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 37, no 21, p. 3932-3933Article in journal (Refereed) Published
Abstract [en]

The Flexible Taxonomy Database framework provides a method for modification and merging official and custom taxonomic databases to create improved databases. Using such databases will increase accuracy and precision of existing methods to classify sequence reads.

Place, publisher, year, edition, pages
Oxford University Press, 2021
National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:umu:diva-189091 (URN)10.1093/bioinformatics/btab621 (DOI)000733832700036 ()2-s2.0-85126297774 (Scopus ID)
Available from: 2021-11-04 Created: 2021-11-04 Last updated: 2025-02-07Bibliographically approved
Nyberg, M., Ambjörnsson, T., Stenberg, P. & Lizana, L. (2021). Modeling Protein Target Search in Human Chromosomes. Physical Review Research, 3(1), Article ID 013055.
Open this publication in new window or tab >>Modeling Protein Target Search in Human Chromosomes
2021 (English)In: Physical Review Research, E-ISSN 2643-1564, Vol. 3, no 1, article id 013055Article in journal (Refereed) Published
Abstract [en]

Several processes in the cell, such as gene regulation, start when key proteins recognize and bind to short DNA sequences. However, as these sequences can be hundreds of million times shorter than the genome, they are hard to find by simple diffusion: diffusion-limited association rates may underestimate in vitro measurements up to several orders of magnitude. Moreover, the rates increase if the DNA is coiled rather than straight. Here we model how this works in vivo in mammalian cells. We use chromatin-chromatin contact data from Hi-C experiments to map the protein target-search onto a network problem. The nodes represent DNA segments and the weight of the links are proportional to measured contact probabilities. We then put forward a diffusion-reaction equation for the density of searching protein that allows us to calculate the association rates across the genome analytically. For segments where the rates are high, we find that they are enriched with active gene starts and have high RNA expression levels. This paper suggests that the DNA's 3D conformation is important for protein search times in vivo and offers a method to interpret protein-binding profiles in eukaryotes that cannot be explained by the DNA sequence itself.

Place, publisher, year, edition, pages
American Physical Society, 2021
National Category
Other Physics Topics
Identifiers
urn:nbn:se:umu:diva-164023 (URN)10.1103/PhysRevResearch.3.013055 (DOI)000608675400003 ()2-s2.0-85101603294 (Scopus ID)
Note

Originally included in thesis in manuscript form with title "Modelling Protein Target-Search in Human Chromosomes".

Available from: 2019-10-14 Created: 2019-10-14 Last updated: 2023-03-24Bibliographically approved
Karlsson, E., Johansson, A.-M., Ahlinder, J., Lundkvist, M. J., Singh, N. J., Brodin, T., . . . Stenberg, P. (2020). Airborne microbial biodiversity and seasonality in Northern and Southern Sweden. PeerJ, 8, Article ID e8424.
Open this publication in new window or tab >>Airborne microbial biodiversity and seasonality in Northern and Southern Sweden
Show others...
2020 (English)In: PeerJ, E-ISSN 2167-8359, Vol. 8, article id e8424Article in journal (Refereed) Published
Abstract [en]

Microorganisms are essential constituents of ecosystems. To improve our understanding of how various factors shape microbial diversity and composition in nature it is important to study how microorganisms vary in space and time. Factors shaping microbial communities in ground level air have been surveyed in a limited number of studies, indicating that geographic location, season and local climate influence the microbial communities. However, few have surveyed more than one location, at high latitude or continuously over more than a year. We surveyed the airborne microbial communities over two full consecutive years in Kiruna, in the Arctic boreal zone, and Ljungbyhed, in the Southern nemoral zone of Sweden, by using a unique collection of archived air filters. We mapped both geographic and seasonal differences in bacterial and fungal communities and evaluated environmental factors that may contribute to these differences and found that location, season and weather influence the airborne communities. Location had stronger influence on the bacterial community composition compared to season, while location and season had equal influence on the fungal community composition. However, the airborne bacterial and fungal diversity showed overall the same trend over the seasons, regardless of location, with a peak during the warmer parts of the year, except for the fungal seasonal trend in Ljungbyhed, which fluctuated more within season. Interestingly, the diversity and evenness of the airborne communities were generally lower in Ljungbyhed. In addition, both bacterial and fungal communities varied significantly within and between locations, where orders like Rhizobiales, Rhodospirillales and Agaricales dominated in Kiruna, whereas Bacillales, Clostridiales and Sordariales dominated in Ljungbyhed. These differences are a likely reflection of the landscape surrounding the sampling sites where the landscape in Ljungbyhed is more homogenous and predominantly characterized by artificial and agricultural surroundings. Our results further indicate that local landscape, as well as seasonal variation, shapes microbial communities in air.

Place, publisher, year, edition, pages
PeerJ, 2020
Keywords
Airborne biodiversity, Microbial seasonality, High-throughput sequencing, Metabarcoding, eDNA
National Category
Biochemistry Molecular Biology Ecology
Identifiers
urn:nbn:se:umu:diva-168962 (URN)10.7717/peerj.8424 (DOI)000509466300009 ()32025374 (PubMedID)2-s2.0-85079064960 (Scopus ID)
Available from: 2020-04-01 Created: 2020-04-01 Last updated: 2025-02-20Bibliographically approved
Dwibedi, C. K., Larsson, P., Ahlinder, J., Lindgren, P., Myrtennäs, K., Granberg, M., . . . Johansson, A. (2020). Biological amplification of low frequency mutations unravels laboratory culture history of the bio-threat agent Francisella tularensis. Forensic Science International: Genetics, 45
Open this publication in new window or tab >>Biological amplification of low frequency mutations unravels laboratory culture history of the bio-threat agent Francisella tularensis
Show others...
2020 (English)In: Forensic Science International: Genetics, ISSN 1872-4973, E-ISSN 1878-0326, Vol. 45Article in journal (Refereed) Published
Abstract [en]

Challenges of investigating a suspected bio attack include establishing if microorganisms have been cultured to produce attack material and to identify their source. Addressing both issues, we have investigated genetic variations that emerge during laboratory culturing of the bacterial pathogen Francisella tularensis. Key aims were to identify genetic variations that are characteristic of laboratory culturing and explore the possibility of using biological amplification to identify genetic variation present at exceedingly low frequencies in a source sample. We used parallel serial passage experiments and high-throughput sequencing of F. tularensis to explore the genetic variation. We found that during early laboratory culture passages of F. tularensis, gene duplications emerged in the pathogen genome followed by single-nucleotide polymorphisms in genes for bacterial capsule synthesis. Based on a biological enrichment scheme and the use of high-throughput sequencing, we identified genetic variation that likely pre-existed in a source sample. The results support that capsule synthesis gene mutations are common during laboratory culture, and that a biological amplification strategy is useful for linking a F. tularensis sample to a specific laboratory variant among many highly similar variants.

Place, publisher, year, edition, pages
Elsevier, 2020
Keywords
Microbial forensics, Francisella tularensis, Molecular evolution, Genetic variation, Monomorphic bacteria, Source attribution
National Category
Medical Genetics and Genomics
Identifiers
urn:nbn:se:umu:diva-168168 (URN)10.1016/j.fsigen.2019.102230 (DOI)000508656600013 ()31924594 (PubMedID)2-s2.0-85077368647 (Scopus ID)
Note

Errata: Dwibedi C, Larsson P, Ahlinder J, Lindgren P, Myrtennäs K, Granberg M, et al., Corrigendum to "Biological amplification of low frequency mutations unravels laboratory culture history of the bio-threat agent Francisella tularensis". Forensic Sci. Int.: Genet. 45 (2020) 102230. DOI: 10.1016/j.fsigen.2024.103063.

Available from: 2020-03-17 Created: 2020-03-17 Last updated: 2025-02-10Bibliographically approved
Mahmud, A. K., Yang, D., Stenberg, P., Ioshikhes, I. & Nandi, S. (2020). Exploring a Drosophila Transcription Factor Interaction Network to Identify Cis-Regulatory Modules. Paper presented at 36th International Conference on Machine Learning (ICML), JUN 10-15, 2019, Long Beach, CA. Journal of Computational Biology, 27(8), 1313-1328
Open this publication in new window or tab >>Exploring a Drosophila Transcription Factor Interaction Network to Identify Cis-Regulatory Modules
Show others...
2020 (English)In: Journal of Computational Biology, ISSN 1066-5277, E-ISSN 1557-8666, Vol. 27, no 8, p. 1313-1328Article in journal (Refereed) Published
Abstract [en]

Multiple transcription factors (TFs) bind to specific sites in the genome and interact among themselves to form the cis-regulatory modules (CRMs). They are essential in modulating the expression of genes, and it is important to study this interplay to understand gene regulation. In the present study, we integrated experimentally identified TF binding sites collected from published studies with computationally predicted TF binding sites to identifyDrosophilaCRMs. Along with the detection of the previously known CRMs, this approach identified novel protein combinations. We determined high-occupancy target sites, where a large number of TFs bind. Investigating these sites revealed that Giant, Dichaete, and Knirp are highly enriched in these locations. A common TAG team motif was observed at these sites, which might play a role in recruiting other TFs. While comparing the binding sites at distal and proximal promoters, we found that certain regulatory TFs, such as Zelda, were highly enriched in enhancers. Our study has shown that, from the information available concerning the TF binding sites, the real CRMs could be predicted accurately and efficiently. Although we only may claim co-occurrence of these proteins in this study, it may actually point to their interaction (as known interaction proteins typically co-occur together). Such an integrative approach can, therefore, help us to provide a better understanding of the interplay among the factors, even though further experimental verification is required.

Place, publisher, year, edition, pages
Mary Ann Liebert, 2020
Keywords
cis-regulatory modules, Drosophila transcription factors, high-occupancy target sites, transcription factors association
National Category
Biochemistry Molecular Biology
Identifiers
urn:nbn:se:umu:diva-174467 (URN)10.1089/cmb.2018.0160 (DOI)000556715100013 ()31855461 (PubMedID)2-s2.0-85089301180 (Scopus ID)
Conference
36th International Conference on Machine Learning (ICML), JUN 10-15, 2019, Long Beach, CA
Available from: 2020-08-26 Created: 2020-08-26 Last updated: 2025-02-20Bibliographically approved
Organisations

Search in DiVA

Show all publications