On the Semantics of Atomic Subgroups in Practical Regular Expressions
2017 (English)In: Implementation and Application of Automata: 22nd International Conference, CIAA 2017, Marne-la-Vallée, France, June 27-30, 2017, Proceedings / [ed] Arnaud Carayol; Cyril Nicaud, Springer, 2017, p. 14-26Conference paper, Published paper (Refereed)
Abstract [en]
Most regular expression matching engines have operators and features to enhance the succinctness of classical regular expressions, such as interval quantifiers and regular lookahead. In addition, matching engines in for example Perl, Java, Ruby and .NET, also provide operators, such as atomic operators, that constrain the backtracking behavior of the engine. The most common use is to prevent needless backtracking, but the operators will often also change the language accepted. As such it is essential to develop a theoretical sound basis for the matching semantics of regular expressions with atomic operators. We here establish that atomic operators preserve regularity, but are exponentially more succinct for some languages. Further we investigate the state complexity of deterministic and non-deterministic finite automata accepting the language corresponding to a regular expression with atomic operators, and show that emptiness testing is PSPACE-complete.
Place, publisher, year, edition, pages
Springer, 2017. p. 14-26
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 10329
Keywords [en]
Regular Expression, Regular Language, Finite Automaton, Input String, Matching Semantic
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-192858DOI: 10.1007/978-3-319-60134-2_2Scopus ID: 2-s2.0-85021987864ISBN: 978-3-319-60133-5 (print)ISBN: 978-3-319-60134-2 (electronic)OAI: oai:DiVA.org:umu-192858DiVA, id: diva2:1641651
Conference
CIAA 2017, 22nd International Conference on Implementation and Application of Automata, Marne-la-Vallée, France, June 27-30, 2017
Note
Also part of the Theoretical Computer Science and General Issues book sub series (LNTCS, volume 10329)
2022-03-022022-03-022022-03-03Bibliographically approved