umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
On the semantics of regular expression parsing in the wild
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2017 (English)In: Theoretical Computer Science, ISSN 0304-3975, E-ISSN 1879-2294, Vol. 679, 69-82 p.Article in journal (Refereed) Published
Abstract [en]

We introduce prioritized transducers to formalize capturing groups in regular expression matching in a way that permits straightforward modeling of capturing in Java's 1 regular expression library. The broader questions of parsing semantics and performance are also considered. In addition, the complexity of deciding equivalence of regular expressions with capturing groups is investigated.

Place, publisher, year, edition, pages
Elsevier, 2017. Vol. 679, 69-82 p.
Keyword [en]
Regular expression matchers, Capturing groups, Prioritized transducers
National Category
Computer Science
Identifiers
URN: urn:nbn:se:umu:diva-137402DOI: 10.1016/j.tcs.2016.09.006ISI: 000403125700006OAI: oai:DiVA.org:umu-137402DiVA: diva2:1119892
Available from: 2017-07-05 Created: 2017-07-05 Last updated: 2017-07-05Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Berglund, Martin
By organisation
Department of Computing Science
In the same journal
Theoretical Computer Science
Computer Science

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 15 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf