Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Complete genome sequence and annotation of the laboratory reference strain Shigella flexneri serotype 5a M90T and genome-wide transcriptional start site determination
Umeå University, Faculty of Medicine, Molecular Infection Medicine Sweden (MIMS). Umeå University, Faculty of Medicine, Umeå Centre for Microbial Research (UCMR). Umeå University, Faculty of Medicine, Department of Molecular Biology (Faculty of Medicine). Department of Molecular Biology. (Andrea Puhar)ORCID iD: 0000-0002-0984-6286
Umeå University, Faculty of Medicine, Molecular Infection Medicine Sweden (MIMS). Umeå University, Faculty of Medicine, Umeå Centre for Microbial Research (UCMR). Umeå University, Faculty of Medicine, Department of Molecular Biology (Faculty of Medicine). Department of Molecular Biology. (Andrea Puhar)ORCID iD: 0000-0002-4643-9831
Umeå University, Faculty of Medicine, Molecular Infection Medicine Sweden (MIMS). Umeå University, Faculty of Medicine, Umeå Centre for Microbial Research (UCMR). Umeå University, Faculty of Medicine, Department of Molecular Biology (Faculty of Medicine). Department of Molecular Biology. (Andrea Puhar)ORCID iD: 0000-0002-9915-002x
2020 (English)In: BMC Genomics, E-ISSN 1471-2164, Vol. 21, no 1, article id 285Article in journal (Refereed) Published
Abstract [en]

Background: Shigella is a Gram-negative facultative intracellular bacterium that causes bacillary dysentery in humans. Shigella invades cells of the colonic mucosa owing to its virulence plasmid-encoded Type 3 Secretion System (T3SS), and multiplies in the target cell cytosol. Although the laboratory reference strain S. flexneri serotype 5a M90T has been extensively used to understand the molecular mechanisms of pathogenesis, its complete genome sequence is not available, thereby greatly limiting studies employing high-throughput sequencing and systems biology approaches.

Results: We have sequenced, assembled, annotated and manually curated the full genome of S. flexneri 5a M90T. This yielded two complete circular contigs, the chromosome and the virulence plasmid (pWR100). To obtain the genome sequence, we have employed long-read PacBio DNA sequencing followed by polishing with Illumina RNA-seq data. This provides a new hybrid strategy to prepare gapless, highly accurate genome sequences, which also cover AT-rich tracks or repetitive sequences that are transcribed. Furthermore, we have performed genome-wide analysis of transcriptional start sites (TSS) and determined the length of 5′ untranslated regions (5′-UTRs) at typical culture conditions for the inoculum of in vitro infection experiments. We identified 6723 primary TSS (pTSS) and 7328 secondary TSS (sTSS). The S. flexneri 5a M90T annotated genome sequence and the transcriptional start sites are integrated into RegulonDB (http://regulondb.ccg.unam.mx) and RSAT (http://embnet.ccg.unam.mx/rsat/) databases to use their analysis tools in the S. flexneri 5a M90T genome.

Conclusions: We provide the first complete genome for S. flexneri serotype 5a, specifically the laboratory reference strain M90T. Our work opens the possibility of employing S. flexneri M90T in high-quality systems biology studies such as transcriptomic and differential expression analyses or in genome evolution studies. Moreover, the catalogue of TSS that we report here can be used in molecular pathogenesis studies as a resource to know which genes are transcribed before infection of host cells. The genome sequence, together with the analysis of transcriptional start sites, is also a valuable tool for precise genetic manipulation of S. flexneri 5a M90T. Further, we present a new hybrid strategy to prepare gapless, highly accurate genome sequences. Unlike currently used hybrid strategies combining long- and short-read DNA sequencing technologies to maximize accuracy, our workflow using long-read DNA sequencing and short-read RNA sequencing provides the added value of using non-redundant technologies, which yield distinct, exploitable datasets.

Place, publisher, year, edition, pages
Springer Nature, 2020. Vol. 21, no 1, article id 285
Keywords [en]
Shigella flexneri serotype 5a M90T, Genome, Transcriptional start sites, TSS, Chromosome, Virulence plasmid, pWR100, Pseudogene, Insertion sequence, RegulonDB, RSAT
National Category
Microbiology in the medical area
Research subject
Infectious Diseases; Microbiology; Genetics
Identifiers
URN: urn:nbn:se:umu:diva-169687DOI: 10.1186/s12864-020-6565-5ISI: 000525518700002PubMedID: 32252626Scopus ID: 2-s2.0-85083071984OAI: oai:DiVA.org:umu-169687DiVA, id: diva2:1423829
Funder
Knut and Alice Wallenberg Foundation, KAW 2015.0225The Kempe Foundations, JCK-1528Available from: 2020-04-15 Created: 2020-04-15 Last updated: 2024-01-17Bibliographically approved

Open Access in DiVA

fulltext(2566 kB)502 downloads
File information
File name FULLTEXT01.pdfFile size 2566 kBChecksum SHA-512
bb60f5ce77005caddc0cad6641b055ee0d1be96739aa4da77daebb5be73a10961128815da66d69a96ee556a8250b5c61cc644230c41cdbaeffb9f5e10270ca8a
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Cervantes-Rivera, RamónTronnet, SophiePuhar, Andrea

Search in DiVA

By author/editor
Cervantes-Rivera, RamónTronnet, SophiePuhar, Andrea
By organisation
Molecular Infection Medicine Sweden (MIMS)Umeå Centre for Microbial Research (UCMR)Department of Molecular Biology (Faculty of Medicine)
In the same journal
BMC Genomics
Microbiology in the medical area

Search outside of DiVA

GoogleGoogle Scholar
Total: 503 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 602 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf