0% found this document useful (0 votes)

14 views45 pages

CMP3008 LN4 RegularExpressions

The document provides information about regular expressions and finite automata. It defines regular expressions, discusses their precedence of operators and examples. It also covers the equivalence between regular expressions and finite automata. Specifically, it explains how to convert a deterministic finite automaton (DFA) into a generalized nondeterministic finite automaton (GNFA) and then convert the GNFA into a regular expression.

Uploaded by

Ammar Jagadhita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views45 pages

CMP3008 LN4 RegularExpressions

Uploaded by

Ammar Jagadhita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

CMP3008

Formal Languages
and Automata Theory
Lecture Notes 4
Regular Expressions
Sources
https://eecs.wsu.edu/~ananth/CptS317/Lectures/index.htm
"Introduction to automata theory, languages and
computation" by JE Hopcroft, R Motwani and JD Ullman.
" An Introduction to Formal Languages and Automata Theory" by
Peter Linz
1
Content
• Regular Expressions
• Precedence of Operators
• Formal Definition
• Examples
• Further Properties
• Equivalence with Finite Automata
• Generalized Nondeterministic Finite Automata

2
Regular Expressions

• The value of the arithmetic expression is the number 32. The value of a regular expression is a language.
• In this case, the value is the language consisting of all strings starting with a 0 or a 1 followed by any
number of 0s.
• The symbols 0 and 1 are shorthand for the sets {0} and {1}. So (0 ∪ 1) means ({0} ∪ {1}). The value of this
part is the language {0,1}.
• The part 0* means {0}* and its value is the language consisting of all strings containing any number of 0s.
• (0 ∪ 1)0* is shorthand for (0 ∪ 1) ◦ 0*
Regular Expressions vs. Finite Automata

• Offers a declarative way to express the pattern of any string we want to accept
• E.g., 01*+ 10*

• Automata => more machine-like

< input: string , output: [accept/reject] >
• Regular expressions => more program syntax-like

• Unix environments heavily use regular expressions

• E.g., bash shell, grep, vi & other editors, sed
• Perl scripting – good for string processing
• Lexical analyzers such as Lex or Flex

4
Regular Expressions

Regular = Finite Automata

expressions (DFA, NFA, -NFA)
Syntactical
expressions Automata/machines

Regular
Languages

Formal language
classes

5
Language Operators
• Union of two languages:
• L U M = all strings that are either in L or M
• Note: A union of two languages produces a third language

• Concatenation of two languages:

• L . M = all strings that are of the form xy
s.t., x  L and y  M
• The dot operator is usually omitted
• i.e., LM is same as L.M

6
Kleene Closure (the * operator)
“i” here refers to how many strings to concatenate from the parent
language L to produce strings in the language Li
• Kleene Closure of a given language L:
• L0= {}
• L1= {w | for some w  L}
• L2= { w1w2 | w1  L, w2  L (duplicates allowed)}
• Li= { w1w2…wi | all w’s chosen are  L (duplicates allowed)}
• (Note: the choice of each wi is independent)
• L* = Ui≥0 Li (arbitrary number of concatenations)
Example:
• Let L = { 1, 00}
• L0= {}
• L1= {1,00}
• L2= {11,100,001,0000}
• L3= {111,1100,1001,10000,000000,00001,00100,0011}
• L* = L0 U L1 U L2 U …

7
Example: how to use these regular expression properties and
language operators?
• L = { w | w is a binary string which does Regular expression for the four cases:
not contain two consecutive 0s or two Case A: (01)*
consecutive 1s anywhere) Case B: (10)*
• E.g., w = 01010101 is in L, Case C: 0(10)*
while w = 10010 is not in L Case D: 1(01)*
• Goal: Build a regular expression for L Since L is the union of all 4 cases:
• Four cases for w:
• Case A: w starts with 0 and |w| is even Reg Exp for L = (01)* + (10)* + 0(10)* + 1(01)*
• Case B: w starts with 1 and |w| is even If we introduce  then the regular expression can be
• Case C: w starts with 0 and |w| is odd simplified to:
• Case D: w starts with 1 and |w| is odd
Reg Exp for L = ( +1)(01)*( +0)

8
Examples

all possible strings of 0s and 1s. If Σ = {0,1}, we can write Σ as shorthand for the
regular expression (0 ∪ 1).

all strings that start with a 0 or end with a 1

Precedence
• In regular expressions,
• the star operation is done first,
• followed by concatenation,
• and finally union, unless parentheses change the usual order.
Precedence of Operators
• Highest to lowest
• * operator (star)
.
• (concatenation)
• + operator

• Example:
• 01* + 1 = ( 0 . ((1)*) ) + 1

11
Algebraic Laws of Regular Expressions
• Commutative:
• E+F = F+E
• Associative:
• (E+F)+G = E+(F+G)
• (EF)G = E(FG)
• Identity:
• E+Φ = E
• E=E=E
• Annihilator:
• ΦE = EΦ = Φ

12
Algebraic Laws…
• Distributive:
• E(F+G) = EF + EG
• (F+G)E = FE+GE
• Idempotent: E + E = E
• Involving Kleene closures:
• (E*)* = E*
• Φ* =
• * =
• E+ =EE*
• E? =  +E

13
Formal Definition
Important Note
• Don’t confuse the regular expressions ε and ∅.
• The expression ε represents the language containing a single
string—namely, the empty string—whereas ∅ represents the
language that doesn’t contain any strings.

15
+ notation
• For convenience, we let R+ be shorthand for RR*. In other words,
whereas R* has all strings that are 0 or more concatenations of
strings from R, the language R+ has all strings that are 1 or more
concatenations of strings from R.
• So R+ ∪ ε = R*.
• In addition, we let Rk be shorthand for the concatenation of k R’s with
each other.
More Examples
More Examples
More Examples
Further Properties
Further Properties
An example from programming languages
• How can you define a numerical constant?
• Examples: 72, 13.4, 0.1, -0.3, +.02, -7.
• Not a numerical constant: -., +.-, -9+
• Σ = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9, +, -, .}
• D = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}
Equivalence with Finite Automata
Equivalence with Finite Automata
Equivalence with Finite Automata
Equivalence with Finite Automata
Equivalence with Finite Automata
Equivalence with Finite Automata
Proof
• If a language A is regular, a regular expression describes it.
• Because A is regular, it is accepted by a DFA.
• We break this procedure into two parts, using a new type of finite
automaton called a generalized nondeterministic finite automaton,
GNFA.
• First we show how to convert DFAs into GNFAs,
• and then GNFAs into regular expressions.

35
Generalized Nondeterministic Finite Automata
• Generalized nondeterministic finite automata (GNFA) are simply
nondeterministic finite automata wherein the transition arrows may
have any regular expressions as labels, instead of only members of
the alphabet or ε.
• The GNFA moves along a transition arrow connecting two states by
reading a block of symbols from the input, which themselves
constitute a string described by the regular expression on that arrow.
GNFA Example
GNFAs
• For convenience, we require that GNFAs always have a special form
that meets the following conditions.
• The start state has transition arrows going to every other state but no arrows
coming in from any other state.
• There is only a single accept state, and it has arrows coming in from every
other state but no arrows going to any other state. Furthermore, the accept
state is not the same as the start state.
• Except for the start and accept states, one arrow goes from every state to
every other state and also from each state to itself.
GNFAs
We can easily convert a DFA into a GNFA in the special form.
• We simply add a new start state with an ε arrow to the old start state and a
new accept state with ε arrows from the old accept states.
• If any arrows have multiple labels (or if there are multiple arrows going
between the same two states in the same direction), we replace each with
a single arrow whose label is the union of the previous labels.
• Finally, we add arrows labeled ∅ between states that had no arrows. This
last step won’t change the language recognized because a transition
labeled with ∅ can never be used.
• From here on we assume that all GNFAs are in the special form.
GNFAs
• After converting DFA into an equivalent GNFA, we need to convert
GNFA into a regular expression. For this, we will remove the states in
GNFA until we have two states (start and accept states).
• The crucial step is constructing an equivalent GNFA with one fewer
state when k > 2.
• We do so by selecting a state, ripping it out of the machine, and
repairing the remainder so that the same language is still recognized.
• Any state will do, provided that it is not the start or accept state.
• We are guaranteed that such a state will exist because k > 2.
• Let’s call the removed state qrip.
GNFAs
• After removing qrip we repair the machine by altering the regular
expressions that label each of the remaining arrows.
• The new labels compensate for the absence of qrip by adding back the
lost computations.
• The new label going from a state qi to a state qj is a regular expression
that describes all strings that would take the machine from qi to qj
either directly or via qrip.
Constructing an equivalent GNFA with one fewer
state
Conversion from GNFA to Regular Expression
• The stages in converting a DFA with three states to an equivalent
regular expression are shown in the following figure.
An Example
Conversion
Another
example
conversion

CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Chapter 3 Regular Expression
No ratings yet
Chapter 3 Regular Expression
25 pages
Chapter 2 REGULAR EXPRESSION
No ratings yet
Chapter 2 REGULAR EXPRESSION
26 pages
Chapter 2 RegularExpressions (3)
No ratings yet
Chapter 2 RegularExpressions (3)
95 pages
Chapter 03 - Regular Expression and Language
No ratings yet
Chapter 03 - Regular Expression and Language
42 pages
Unit 4: Regular Expressions
No ratings yet
Unit 4: Regular Expressions
52 pages
Week4-5
No ratings yet
Week4-5
43 pages
tcs1 Slides 50 60
No ratings yet
tcs1 Slides 50 60
92 pages
Module 1&2
No ratings yet
Module 1&2
98 pages
computability-05
No ratings yet
computability-05
28 pages
chapter two
No ratings yet
chapter two
59 pages
CS242_Module 3
No ratings yet
CS242_Module 3
45 pages
Lec 03
No ratings yet
Lec 03
45 pages
Lecture05 RegularExpression&FA
No ratings yet
Lecture05 RegularExpression&FA
44 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
29 pages
3B-Formal Languages
No ratings yet
3B-Formal Languages
24 pages
ToA - Lecture 05 06 - Regular Expressions Finite Automata
No ratings yet
ToA - Lecture 05 06 - Regular Expressions Finite Automata
26 pages
class3
No ratings yet
class3
52 pages
Chapter 3 REGULAR EXPRESSION
No ratings yet
Chapter 3 REGULAR EXPRESSION
26 pages
Module 2flat
No ratings yet
Module 2flat
26 pages
Spring 2024 Compiler Constructoin A Lab 3-2
No ratings yet
Spring 2024 Compiler Constructoin A Lab 3-2
16 pages
3 Models of Computation - NFA Equiv. DFA & Regular Expressions
No ratings yet
3 Models of Computation - NFA Equiv. DFA & Regular Expressions
25 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
39 pages
Regular Expressions Full Notes Cse
No ratings yet
Regular Expressions Full Notes Cse
16 pages
6. Section 3.1
No ratings yet
6. Section 3.1
44 pages
Unit Ii
No ratings yet
Unit Ii
25 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
20 pages
Finit Representations Language
No ratings yet
Finit Representations Language
5 pages
3 RegularExpressions
No ratings yet
3 RegularExpressions
25 pages
TOC Unit2
No ratings yet
TOC Unit2
87 pages
FLAT Lec - 3
No ratings yet
FLAT Lec - 3
34 pages
CT2
No ratings yet
CT2
21 pages
Regular Expressions G P: Reading: Chapter 3
No ratings yet
Regular Expressions G P: Reading: Chapter 3
16 pages
Automata Lectuee3
No ratings yet
Automata Lectuee3
27 pages
Daily House Keeping Checklist
No ratings yet
Daily House Keeping Checklist
290 pages
4-Reg Exp
No ratings yet
4-Reg Exp
33 pages
Chapter 3 - Regular Expressions
No ratings yet
Chapter 3 - Regular Expressions
49 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
16 pages
toc u2ppt
No ratings yet
toc u2ppt
41 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
16 pages
Regular Expression
No ratings yet
Regular Expression
14 pages
Lecture 6 Regular Expressions
No ratings yet
Lecture 6 Regular Expressions
28 pages
Regular Expressiontzzz
No ratings yet
Regular Expressiontzzz
46 pages
Regular Expressions
No ratings yet
Regular Expressions
60 pages
ATCD Material
No ratings yet
ATCD Material
50 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
Tcom005n PDF
No ratings yet
Tcom005n PDF
41 pages
اللغات الرسمية والأالات نظري 3
No ratings yet
اللغات الرسمية والأالات نظري 3
47 pages
Automata - Chap3+regularexpressionlanguages - 2
No ratings yet
Automata - Chap3+regularexpressionlanguages - 2
61 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
cs212 Lect02 63 Inter
No ratings yet
cs212 Lect02 63 Inter
39 pages
1.3 Regular Expression
No ratings yet
1.3 Regular Expression
47 pages
Dfa
No ratings yet
Dfa
43 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Chapter 3 REGULAR EXPRESSION
No ratings yet
Chapter 3 REGULAR EXPRESSION
28 pages
Chapter 3 - Regular Expression
No ratings yet
Chapter 3 - Regular Expression
16 pages
Geometric Isomerism
No ratings yet
Geometric Isomerism
68 pages
chapter 3
No ratings yet
chapter 3
10 pages
Regular Expression: Operations On Regular Language
No ratings yet
Regular Expression: Operations On Regular Language
33 pages
WT Tutorials Feb2020
No ratings yet
WT Tutorials Feb2020
32 pages
MCQ and Case Based Questions
100% (1)
MCQ and Case Based Questions
31 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
WWE - Lab Manual - 3171306
No ratings yet
WWE - Lab Manual - 3171306
16 pages
Thesis End HK PDF
No ratings yet
Thesis End HK PDF
142 pages
NSE7 OTS Correct Answers Only
No ratings yet
NSE7 OTS Correct Answers Only
12 pages
Motorola MC3090 G Product Manual
No ratings yet
Motorola MC3090 G Product Manual
146 pages
SQL With R
100% (1)
SQL With R
12 pages
Module Parts Adopting The 5 E's Instructional Design
No ratings yet
Module Parts Adopting The 5 E's Instructional Design
18 pages
GE 2007 100110-001 - AA0 XP4 Crossing Application Typical Diagrams
No ratings yet
GE 2007 100110-001 - AA0 XP4 Crossing Application Typical Diagrams
88 pages
CMP2003 LectureNotes Week3 4
No ratings yet
CMP2003 LectureNotes Week3 4
83 pages
Sorting Part 2
No ratings yet
Sorting Part 2
69 pages
Week1 Slides 20221004
No ratings yet
Week1 Slides 20221004
62 pages
Recursion
No ratings yet
Recursion
50 pages
PRESENTATION Guder
No ratings yet
PRESENTATION Guder
27 pages
2ms Extra Activities Book by Mrs BENGHALIA
No ratings yet
2ms Extra Activities Book by Mrs BENGHALIA
46 pages
Introduction To R
No ratings yet
Introduction To R
33 pages
Module - 3 - Electric Vehicles EV's & Hybrid Electric Vehicle
No ratings yet
Module - 3 - Electric Vehicles EV's & Hybrid Electric Vehicle
14 pages
Pe Price Dt. 01.03.2019
No ratings yet
Pe Price Dt. 01.03.2019
89 pages
SENSEI - An Architecture For The Internet
No ratings yet
SENSEI - An Architecture For The Internet
76 pages
What To Eat and When - Stanley K Clark
100% (2)
What To Eat and When - Stanley K Clark
66 pages
CMP2003 Lecturenotes Week9
No ratings yet
CMP2003 Lecturenotes Week9
25 pages
Enviro Dual Compact Pack-Off
100% (1)
Enviro Dual Compact Pack-Off
1 page
Face Recognition PDF
No ratings yet
Face Recognition PDF
41 pages
CMP3008 LN1 CourseOverview Introduction
No ratings yet
CMP3008 LN1 CourseOverview Introduction
49 pages
CMP3008 LN3 NonDeterminism
No ratings yet
CMP3008 LN3 NonDeterminism
40 pages
Twin and Full Size Platform Bed Project Diagram
No ratings yet
Twin and Full Size Platform Bed Project Diagram
8 pages
BH FFM13 PPT ch01
No ratings yet
BH FFM13 PPT ch01
11 pages
Escaner
No ratings yet
Escaner
2 pages
Module in Assessment in Learning 2 PR
No ratings yet
Module in Assessment in Learning 2 PR
8 pages
Ridhi Resume
No ratings yet
Ridhi Resume
2 pages
Golden Moment To Become Transf - Leader
No ratings yet
Golden Moment To Become Transf - Leader
9 pages
Materials Today: Proceedings: S.M. Sutharsan, M. Mohan Prasad, S. Vijay
No ratings yet
Materials Today: Proceedings: S.M. Sutharsan, M. Mohan Prasad, S. Vijay
5 pages
Year 4 Civic LP June Love
No ratings yet
Year 4 Civic LP June Love
4 pages
Oxford University Press, Design History Society Journal of Design History
No ratings yet
Oxford University Press, Design History Society Journal of Design History
9 pages
Sample Item Analysis
No ratings yet
Sample Item Analysis
2 pages
Synthesis: Century Skills Namely: Critical Thinking
No ratings yet
Synthesis: Century Skills Namely: Critical Thinking
2 pages
Icmai: Now Also Available Through Online Mode
No ratings yet
Icmai: Now Also Available Through Online Mode
1 page
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CMP3008 LN4 RegularExpressions

Uploaded by

CMP3008 LN4 RegularExpressions

Uploaded by

CMP3008

• Automata => more machine-like

• Unix environments heavily use regular expressions

Regular = Finite Automata

• Concatenation of two languages:

all strings that start with a 0 or end with a 1

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.