0% found this document useful (0 votes)

15 views108 pages

Program-Analysis-ThuTrangNguyen-Day-2

Uploaded by

quangdauhong342

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views108 pages

Program-Analysis-ThuTrangNguyen-Day-2

Uploaded by

quangdauhong342

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 108

Program Analysis Nguyễn Thu Trang

iSE, UET

1
How do you verify the correctness of
a software program?

2
Testing is one of the most
common methods:
• Bug detection
• Correctness verification

3
Why do we
need testing?
• All software has bugs
• Bugs are hard to find
• Bugs cause serious harm

4
Is there any other method for
software quality assurance without
running the program?

5
What is static (program) analysis?

6
• Static analyis is a method of (automatically)
examining the source code without having to

What is execute the program.

• Goals:
static • Detect (potential) defects
analysis? • Find sercurity vulnerabilites
• Find code quality issues
• Optimize performance
• When: early in the development process

7
Static vs. Dynamic analysis

Static analysis Dynamic analysis

• Do not require code execution • Require code execution

• Analyze all code (regardless of whether • Only analyze the executed code
it’s executed or not) • Under-approximate program behaviors
• Over-approximate program behaviors
• Detect runtime-specific issues
• Detect potential defects, vulnerabilites,
• Miss errors in unexecuted paths (false
code quality issues negatives)
• May have many false positives (and also
• Example: Monkey testing tool, seleminum
false negatives)
• Example: compilers, lint-like tools

8
Example

This is JavaScript code.

What are the possible

outputs?

9
Example
• Over-approximation:
“yes”, “no”, “maybe”

• Consider all paths (that are

feasible based on limited
knowledge)
• Math.random() actually
returns a value in [0, 1)
à Out = “maybe” is infeasible
10
Example
• Under-approximation:
“yes”

• Execute the program once

11
Example
• Sound and complete:
“yes”, “no”

• For this example: can

explore both feasible paths

12
Another
example

What are the

possible outputs?

13
Another
example
• Over-approximation:
any value

• Consider all paths (that are

feasible based on limited
knowledge about random())

14
Another
example
• Under-approximation:
some value in [0, 2)

• Execute the program once

15
Another
example
• Sound and complete?
• Exploring all possible outputs:
practically impossible
• This is the case for most real-
world programs

16
Under vs. Over-approximation
• Program: P
• For input 𝑖 ∈ I, we observe a behavior P(𝑖)

All possible behaviors (what we want, ideally)

P(i1) P(i3) Under-approximation (e.g., testing, dynamic
analysis)
Over-approximation (most static analysis)
P(i2)

False negatives
False positives

17
Program representations
Many ways to represent a (part of) program:
• Sequence of characters
• Sequence of tokens
• Abstract syntax tree (AST)
• Control flow graph
• Program dependence graph
• Call graph
• Intermediate representation
• Etc.

18
Sequence of
characters
• Original code written by the
programmer
• Human-readable form of the
program

19
• Tree representation of source code
• “Abstract” because some details of
Abstract syntax omitted
• E.g. { in Java
Syntax tree • Nodes: Construct in source code
(AST) • Edges: Parent-child relationship
• Tools: Espima, Joern
• Used for syntax analysis

20
Abstract syntax
tree (AST)

21
• Models flow of control through a
program
• Directed graph (N, E) with:
Control flow • Nodes N: basic blocks = sequence of
operation executed together
graph (CFG) • Edges E: possible transfer of control
• Typically on the method-level
• Used for analyzing possible paths of a
program

22
Control flow
graph

23
More about
CFG

24
• A directed graph (N, E) that represents
the control/data dependencies
between program components
Program • Nodes N: basic blocks = sequence of
operation executed together
dependence • Edges E: possible (data/control)
depenedence relationship
graph (PDG) • Typically on the method-level
• Used for optimization, parallelization,
vulnerability detection

25
Program
dependence
graph (PDG)

26
Types of program analysis

• Lexical analysis: analyze the basic tokens of the source code

• Syntactic analysis: ensure the source code follows correct syntax
• Data flow analysis: analyze how data defined and used through the source
code
• Control flow analysis: analyze the flow of the program
• Type checking: ensure the correct and consistent useage of the types

27
Data flow analysis
One popular way of formulating a static analysis

28
Many IDE features are based on data
Real-world flow analysis
• E.g.
use cases • Reaching definitions
• Unused variables

29
• Propagate analysis information along the
edges of a control flow graph

Data flow • Goal: Compute analysis state at each

program point
analysis • For each satement, define how it affects the
analysis state
• For loops: Iterate until fix-point reached

30
Available expression analysis

Very busy expression analysis

Data flow
analysis Reaching definitions analysis

Live variables analysis

31
• Goal: for each program point, compute
Available which expressions must have already
been computed, and not later modified.
expression • Useful, e.g., to avoid re-computing an
analysis expression
• Used as part of compiler optimization

32
Example

Available every time

execution reaches this point

33
• Transfer function of a statement:
How the statement affects the analysis state
• Here: analysis state = available
Transfer expressions
• Two functions:
functions • gen: Available expressions generated by
a statement
• kill: Available expressions killed by a
statement

34
Funtion 𝑔𝑒𝑛: 𝑆𝑡𝑚𝑡 → 𝑃(𝐸𝑥𝑝𝑟)
• A statement generates an available expression
e if:

gen function
• It evaluates e and
• It does not later write any variable used in e
• Otherwise, function returns empty set
Example:
var x = a + b; generates a + b

35
Function 𝑘𝑖𝑙𝑙: 𝑆𝑡𝑚𝑡 → 𝑃(𝐸𝑥𝑝𝑟)
• A statement kills an available expression e
if:
kill function • It modifies any of the variables used in e
• Otherwise, function returns empty set
Example:
a = 23; kills a * b

36
Example
Draw the control flow
graph of this code snippet

37
entry

Example x=a+b

y=a*b

y>a+b
T
F a=a+1

x=a+b

exit

38
entry

Example (1)x = a + b

(2)y = a * b

Non-trivial expressions: (3)y > a + b

a+b
T
a*b (4)a = a + 1
F
a+1
(5)x = a + b

exit

39
entry

Example (1)x = a + b

(2)y = a * b

Non-trivial expressions: a + b, a*b, a + 1 (3)y > a + b

T
Statement s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)a = a + 1
F
1
2 (5)x = a + b
3
4
exit
5

40
entry

Example (1)x = a + b

(2)y = a * b

Non-trivial expressions: a + b, a*b, a + 1 (3)y > a + b

T
Statement s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)a = a + 1
F
1 {a + b} ∅
2 {a*b} ∅ (5)x = a + b
3 {a + b} ∅
4 ∅ {a + b, a* b, a + 1}
exit
5 {a + b} ∅

41
• Initially, no available expressions
• Forward analysis: Propagate available
expressions in the direction of control flow
Propagating • For each statement 𝑠, outgoing available

available epressions are: incomming available

expressions minus 𝑘𝑖𝑙𝑙𝑠(𝑠) plus 𝑔𝑒𝑛(𝑠)

expressions • When control flow splits, propagate available

expressions both ways
• When control flow merge, intersect the
incoming available expressions

42
Data flow equations entry

(1)x = a + b
• 𝐴𝐸!"#$% (𝑠) : available expression at the entry of s
• 𝐴𝐸!&'# (𝑠) : available expression at the exit of s
(2)y = a * b
• 𝐴𝐸!"#$% 1 = ∅
• 𝐴𝐸!"#$% 2 = 𝐴𝐸!&'# (1) (3)y > a + b
• 𝐴𝐸!"#$% 3 = 𝐴𝐸!&'# 2 ∩ 𝐴𝐸!&'# (5)
T
• 𝐴𝐸!"#$% 4 = 𝐴𝐸!&'# (3) (4)a = a + 1
F
• 𝐴𝐸!"#$% 5 = 𝐴𝐸!&'# (4)
• 𝐴𝐸!&'# 1 = 𝐴𝐸!"#$% 1 ∪ 𝑎 + 𝑏 (5)x = a + b

• 𝐴𝐸!&'# 2 = 𝐴𝐸!"#$% 2 ∪ 𝑎 ∗ 𝑏
exit
• 𝐴𝐸!&'# 3 = 𝐴𝐸!"#$% 3 ∪ 𝑎 + 𝑏
• 𝐴𝐸!&'# 4 = 𝐴𝐸!"#$% 4 \ 𝑎 + 𝑏, 𝑎 ∗ 𝑏, 𝑎 + 1
• 𝐴𝐸!&'# 5 = 𝐴𝐸!"#$% 5 ∪ 𝑎 + 𝑏 43
Solution of the equation
• 𝐴𝐸!"#$% 1 = ∅
• 𝐴𝐸!"#$% 2 = 𝐴𝐸!&'# (1)
• 𝐴𝐸!"#$% 3 = 𝐴𝐸!&'# 2 ∩ 𝐴𝐸!&'# (5) S 𝐴𝐸!"#$% 𝑆 𝐴𝐸!&'# (𝑆)
1 ∅ 𝑎+𝑏
• 𝐴𝐸!"#$% 4 = 𝐴𝐸!&'# (3)
2 𝑎 ∗𝑏 𝑎 + 𝑏, 𝑎 ∗ 𝑏
• 𝐴𝐸!"#$% 5 = 𝐴𝐸!&'# (4) 3 𝑎+𝑏 𝑎+𝑏
• 𝐴𝐸!&'# 1 = 𝐴𝐸!"#$% 1 ∪ 𝑎 + 𝑏 4 𝑎+𝑏 ∅
• 𝐴𝐸!&'# 2 = 𝐴𝐸!"#$% 2 ∪ 𝑎 ∗ 𝑏 5 ∅ 𝑎+𝑏

• 𝐴𝐸!&'# 3 = 𝐴𝐸!"#$% 3 ∪ 𝑎 + 𝑏
• 𝐴𝐸!&'# 4 = 𝐴𝐸!"#$% 4 \ 𝑎 + 𝑏, 𝑎 ∗ 𝑏, 𝑎 + 1
• 𝐴𝐸!&'# 5 = 𝐴𝐸!"#$% 5 ∪ 𝑎 + 𝑏
44
Solution of the equation

S 𝐴𝐸!"#$% 𝑆 𝐴𝐸!&'# (𝑆)

1 ∅ 𝑎+𝑏
2 𝑎 ∗𝑏 𝑎 + 𝑏, 𝑎 ∗ 𝑏
3 𝑎+𝑏 𝑎+𝑏
4 𝑎+𝑏 ∅
5 ∅ 𝑎+𝑏

At the entry of statement 3, expression 𝑎 + 𝑏 has already been computed

45
Quiz

is x – y an available
expression when entering
the statement 7?

46
Any data flow analysis is defined by six properties:

Defining a • Domain
• Direction
data flow • Transfer function
• Meet operator
analysis • Boundary condition
• Initial values

47
• Analysis associates some information with
every program point
• “Information” means elements of a set
Domain • Domain of the analysis: All possible
elements the set may have
• E.g., for available expressions analysis:
Domain is set of non-trivial expressions

48
• Analysis propagates information along the
control flow graph:
• Forward analysis: normal flow of control
Direction • Backward anlysis: invert all edges
• Reasons about executions in reverse
• E.g., available expression analysis: Forward

49
• Defines how a statement affects the

Transfer propagated information

• 𝐷𝐹3456 (𝑠) = some function of 𝐷𝐹37689 (𝑠)
function • E.g., for available expression analysis:
𝐴𝐸3456 𝑠 = 𝐴𝐸37689 𝑠 ∖ kill 𝑠 ∪ 𝑔𝑒𝑛(𝑠)

50
• What if two statements s:, s; flow to a
statement s?
• Forward analysis: Execution branches merge

Meet
• Backward analysis: branching point
• Meet operator defines how to combine the
operator incoming information
• Union:
DF()*+, s = DF(-.* s/ ∪ DF(-.* (s0 )
• Intersection:
DF()*+, s = DF(-.* s/ ∩ DF(-.* (s0 )

51
• What information to start with at the first CFG
node?
Boundary • Forward analysis: first node is entry node
• Backward analysis: first node is exit node
condition • Common choices:
• Empty set
• Entire domain

52
• What is the information to start with at
Initial intermediate nodes?
• Common choices:
values • Empty set
• Entire domain

53
Defining a data flow analysis
Any data flow analysis is defined by six Available expression is defined as:
properties:
• Domain • Non-trivial expression
• Direction • Forward
• Transfer function
• 𝐴𝐸!&'# 𝑠 = 𝐴𝐸!"#$% 𝑠 ∖ kill 𝑠 ∪
• Meet operator
𝑔𝑒𝑛(𝑠)
• Boundary condition
• Intersection (∩)
• Initial values
• 𝐴𝐸!"#$% 𝑒𝑛𝑡𝑟𝑦𝑁𝑜𝑑𝑒 = ∅
• ∅

54
• Goal: for each program point, compute
Reaching which assignments may have been made
and may not have been overwritten
definitions • Useful in various program analyses:
• Detect uninitialized variables
analysis • Optimize register allocation
• E.g. to compute a data flow graph

55
Example

Definition (x)
reaches the
entry of this
statement

56
Example

All
definitions
reaches the
entry of this
statement

57
• Domain: Definitions (assigments) in the code
• Set of pairs (𝑣, 𝑠) of variables and stmts
Defining the • (𝑣, 𝑠) means a definition of 𝑣 at 𝑠
• Direction: forward
Analysis • Meet operator: Union
• Because we care about definitions that may
reach a program point

58
• Transfer function:
𝑅D<456 𝑠 = (𝑅𝐷37689 𝑠 ∖ 𝑘𝑖𝑙𝑙 𝑠 ) ∪ 𝑔𝑒𝑛(𝑠)
• Function 𝑔𝑒𝑛(𝑠)
Defining the • If 𝑠 is assignment to 𝑣: (𝑣, 𝑠)
• Otherwise: empty set
Analysis (2) • Function 𝑘𝑖𝑙𝑙(𝑠)
• If 𝑠 is assignment to 𝑣: (𝑣, 𝑠’) for all 𝑠’ (𝑠 = ! = s)
that define 𝑣
• Otherwise: empty set

59
• Boundary condition: Entry node starts will all
variables undefined
Defining the • Special “statement” for undefined
variables: ?
Analysis (3) • 𝑅𝐷37689 𝑒𝑛𝑡𝑟𝑦𝑁𝑜𝑑𝑒 = 𝑣, ? 𝑣 ∈ 𝑉𝑎𝑟𝑠}
• Initially, all nodes have no reaching
definitions

60
Example
Draw CFG for this code
snippet

61
entry

(1)x = 5

Example
(2)y = 1

(3)x > 1
T
(4)y = x * y
F

(5)x = x - 1

exit

62
entry

(1)x = 5

Example
(2)y = 1

Domain: (x, ?), (x, 1), (x, 5)

(y, ?), (y, 2), (y, 4) (3)x > 1
T

s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)y = x * y

F
1
2 (5)x = x - 1
3
4
exit
5
63
entry

(1)x = 5

Example
(2)y = 1

Domain: (x, ?), (x, 1), (x, 5)

(y, ?), (y, 2), (y, 4) (3)x > 1
T

s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)y = x * y

F
1 {(x, 1)} { (x, 5), (x, ?)}
2 {(y, 2)} {(y, 4), (y, ?)} (5)x = x - 1
3 ∅ ∅
4 {(y, 4)} {(y, 2), (y, ?)}
exit
5 {(x, 5)} {(x, 1), (x, ?)}
64
Data flow equations
entry

(1)x = 5
• 𝑅𝐷!"#$% 1 = 𝑥, ? , 𝑦, ?
• 𝑅𝐷!"#$% 2 = 𝑅𝐷!&'# 1 (2)y = 1
• 𝑅𝐷!"#$% 3 = 𝑅𝐷!&'# 2 ∪ 𝑅𝐷!&'# 5
• 𝑅𝐷!"#$% 4 = 𝑅𝐷!&'# 3 (3)x > 1
• 𝑅𝐷!"#$% 5 = 𝑅𝐷!&'# 4 T
• 𝑅𝐷!&'# 1 = (𝑅𝐷!"#$% 1 ∖ 𝑥, 1 , 𝑥, 5 , 𝑥, ? ) ∪ {𝑥, 1} (4)y = x * y
• 𝑅𝐷!&'# 2 = (𝑅𝐷!"#$% 2 ∖ 𝑦, 2 , 𝑦, 4 , 𝑦, ? ) ∪ {𝑦, 2} F

• 𝑅𝐷!&'# 3 = 𝑅𝐷!"#$% 3
(5)x = x - 1
• 𝑅𝐷!&'# 4 = (𝑅𝐷!"#$% 4 ∖ 𝑦, 2 , 𝑦, 4 , 𝑦, ? ) ∪ {𝑦, 4}
• 𝑅𝐷!&'# 5 = (𝑅𝐷!"#$% 5 ∖ 𝑥, 1 , 𝑥, 5 , 𝑥, ? ) ∪ {𝑥, 5}
exit

65
Solution of the equation
• 𝑅𝐷!"#$% 1 = 𝑥, ? , 𝑦, ? S 𝑅𝐷!"#$% 𝑆 𝑹𝑫!&'# (𝑆)
• 𝑅𝐷!"#$% 2 = 𝑅𝐷!&'# 1 1 𝑥, ? , 𝑦, ? 𝑥, 1 , 𝑦, ?
• 𝑅𝐷!"#$% 3 = 𝑅𝐷!&'# 2 ∪ 𝑅𝐷!&'# 5 2 𝑥, 1 , 𝑦, ? 𝑥, 1 , 𝑦, 2
• 𝑅𝐷!"#$% 4 = 𝑅𝐷!&'# 3 3 𝑥, 1 , 𝑦, 2 , 𝑥, 1 , 𝑦, 2 ,
• 𝑅𝐷!"#$% 5 = 𝑅𝐷!&'# 4 (𝑥, 5), (𝑦, 4) (𝑥, 5), (𝑦, 4)
4 𝑥, 1 , 𝑦, 2 , 𝑥, 1
• 𝑅𝐷!&'# 1 = (𝑅𝐷!"#$% 1 ∖ 𝑥, 1 , 𝑥, 5 , 𝑥, ? ) ∪ {𝑥, 1}
(𝑥, 5), (𝑦, 4) (𝑥, 5), (𝑦, 4)
• 𝑅𝐷!&'# 2 = (𝑅𝐷!"#$% 2 ∖ 𝑦, 2 , 𝑦, 4 , 𝑦, ? ) ∪ {𝑦, 2}
5 𝑥, 1 , 𝑥, 5 , 𝑦, 4
• 𝑅𝐷!&'# 3 = 𝑅𝐷!"#$% 3 (𝑥, 5), (𝑦, 4)
• 𝑅𝐷!&'# 4 = (𝑅𝐷!"#$% 4 ∖ 𝑦, 2 , 𝑦, 4 , 𝑦, ? ) ∪ {𝑦, 4}
• 𝑅𝐷!&'# 5 = (𝑅𝐷!"#$% 5 ∖ 𝑥, 1 , 𝑥, 5 , 𝑥, ? ) ∪ {𝑥, 5}

66
• Goal: for each statement, find variables that
are may be “live” at the exit from the

Live statement
• “live”: the variable is used before being
variables redefined
• Useful, e.g., for identifying dead code
analysis • Bug detection: dead assignments are
typically unexpected
• Optimization: remove dead code

67
Example
x is not live after this statement

68
Example
Both x and y are live after this statement

69
• Domain: all variables occuring in the code
Defining the • Direction: Backward

Analysis
• Meet operator: Union
• Because we care about whether a
variable may be used

70
• Transfer function:
𝐿𝑉!"#$% 𝑠 = 𝐿𝑉!&'# 𝑠 ∖ 𝑘𝑖𝑙𝑙 𝑠 ∪ 𝑔𝑒𝑛(𝑠)
• Backward analysis: Returns set of variables that
are live at entry of statement

Defining the • Function 𝑔𝑒𝑛(𝑠)

• All variables 𝑣 that are used in 𝑠
Analysis • Otherwise: empty set
• Funtion 𝑘𝑖𝑙𝑙(𝑠)
• If 𝑠 assigns to 𝑣, then it kills 𝑣
• Otherwise: empty set

71
Defining the • Boundary condition: Final node starts with no
live variables
𝐿𝑉!&'# 𝑓𝑖𝑛𝑎𝑙𝑁𝑜𝑑𝑒 = ∅
analysis • Initially, all nodes have no live variables

72
Quiz
Compute the live variables
before and after every
statement

73
• Intra-procedural analysis:
• Reason about a function in isolation
Intra- • Inter-procedural analysis:
vs. • Reason about multiple functions

Inter-procedural • Calls and returns

• Data flow analyses considered so far: intra-
procedural

74
Inter- • One control flow graph per function

procedural • Connect call sites to entry node of

callee
control flow • Connect exit node back to call side

75
entry entry

x=1 Console.log(y)

z = bar(5) z = bar(3) return y + 1

exit exit

76
• Arguments passed into call
• Propagate to formal parameters of callee

Propagating • Return value

• Propagate back to caller

information • Local variables

• Do not propagate into callee
• Instead, when call returned, continue
with state just before call

77
Application of Static Analysis
Vulnerability Detection through Static Analysis

78
Security vulnerability is a flaw/weakness in a
What is system that can be exploited by attackers to

security compromise confidentiality (stealing sensitivy

data), integrity (altering data), availability
vulnerability? (disrupting services), or conducting other
malicious activities.

79
Vulnerability vs. Bug

Bug Vulnerability

• A coding or design error • Weakness in software security

• Cause the system to behave • Can be exploited by an attacker
unexpectedly à incorrect results, • Can lead to severe consequences like
crashes, funtional failures unauthorized access, data theft, or
• Don’t always pose a direct security other malicious activities
risk • Fixing a vulnerabilities is a priority for
• Not all bugs are vulnerabilites maintaining security
• Vulnerabilities are security bugs

80
Out of bounds
Use after free
SQL injection

Vulnerability XSS
types Null pointer dereference
Integer overflow or wraparound
Improper input validation
Use of hard corded credentials
81
Vulnerabilities
by year

Nguồn: https://www.cvedetails.com 82
Example
Is there any problem
within this program?

83
Example

84
Use-after-
free

85
• Occurs when a program continues to use a
pointer to memory that has already been freed
Use-after- or deallocated
• This can lead to undefined behaviors, crahses,
free or security vulnerabilites
• Prevention of use-after-free
• Set pointer to null after freeing memory

86
Example
Is there any problem
within this program?

87
Example

88
• A double free vulnerability occurs when a
program attempts to free (or deallocate) the
same block of memory more than once.
• Consequences of double free:
Double free • Memory corruption
• Crash or unstable behavior
• Exploited by attackers
• Prevention of double free
• Set pointers to null after freeing

89
Example
Is there any problem
within this program?

90
Example

91
Null pointer
dereference

92
• A null pointer dereference occurs when a
Null pointer program tries to access or manipulate data
through a pointer that has a null value
dereference • Dereferencing null pointer causes an error
because there’s no valid data to access

93
• Attackers can leverage null pointer
dereference to causes various forms of
harm such as crashing the program, causing
Null pointer a denial of service

dereference • Prevention of Null Pointer Dereferences:

• Pointer initialization
• Null check before dereferencing
• Safe memory allocation/management

94
Buffer
overflow

95
Buffer
overflow

96
Integer
overflow or
Wraparound

97
SQL injection

98
Path
traversal

99
Use hard-
coded
credentials

100
• Comply with coding standards
• CERT
(https://wiki.sei.cmu.edu/confluence/display/seccode)
How to avoid • MISRA (https://misra.org.uk)
• Follow secure designs
vulnerabilities? • Apply software quality assurance
techniques/tools
• Clang analyzer (https://clang-analyzer.llvm.org)
• Cppcheck(https://cppcheck.sourceforge.io)
• Infer (https://fbinfer.com)

101
Future trends

102
How to create a program analysis?

103
Traditional • Manually crafted
• Years of work
program • Precise, logical reasoning

analysis • Heuristics to handle undecidability

• Challenged by large code bases

104
Insight: Lots of data about software
development to learn from

Neural New code,

execution,
etc.

software Source code

Execution traces
Machine

analysis
Documentation Predictive tool
learning
Bug reports
Etc.
Information
useful for
developer

105
Traditional vs. neural software analysis

• Manually crafted • Automatically

• Years of work learned within hours

• Precise, logical • Data-driven

reasoning prediction

• Heuristics to handel • Learn instead of

undecidability hard-code heuristics

• Challenged by large • Use big code to our

code bases benefit

106
Application • Type prediction

of neural • Bug detection

• Program repair
software • Code summarization

analysis • Code completion

107
Q&A

108

Huawei HCIA AI AI ILearning Certification Final Exam Answers Written
60% (5)
Huawei HCIA AI AI ILearning Certification Final Exam Answers Written
9 pages
Samsung Ps64e8000 Ps64e550 Bulletin
No ratings yet
Samsung Ps64e8000 Ps64e550 Bulletin
2 pages
Summative Assessment Answers - Introduction To Python Programming - Y8
No ratings yet
Summative Assessment Answers - Introduction To Python Programming - Y8
16 pages
Class Data Flow Analysis
No ratings yet
Class Data Flow Analysis
44 pages
04 Avp 2015
No ratings yet
04 Avp 2015
29 pages
Program Analysis
No ratings yet
Program Analysis
73 pages
Static Testing
No ratings yet
Static Testing
22 pages
Static Testing: Defect Prevention
No ratings yet
Static Testing: Defect Prevention
22 pages
Note 3
No ratings yet
Note 3
40 pages
Data Flow Analysis: CS 201 Compiler Construction
No ratings yet
Data Flow Analysis: CS 201 Compiler Construction
16 pages
Constraint Based Analysis: Seminar
No ratings yet
Constraint Based Analysis: Seminar
21 pages
Static Program Analysis: Anders Møller and Michael I. Schwartzbach
No ratings yet
Static Program Analysis: Anders Møller and Michael I. Schwartzbach
82 pages
Unit 5 Part 3
No ratings yet
Unit 5 Part 3
18 pages
UNIT-4-CD DFA
No ratings yet
UNIT-4-CD DFA
6 pages
1_Introduction (1)
No ratings yet
1_Introduction (1)
65 pages
25-Automatic Static Analysis-14!03!2024
No ratings yet
25-Automatic Static Analysis-14!03!2024
185 pages
Software Testing Techniques
No ratings yet
Software Testing Techniques
40 pages
0936 Static Program Analysis
No ratings yet
0936 Static Program Analysis
162 pages
1-dfa
No ratings yet
1-dfa
33 pages
Week-4 (Lecture-2)
No ratings yet
Week-4 (Lecture-2)
50 pages
Compiler Design
No ratings yet
Compiler Design
25 pages
CH 13
No ratings yet
CH 13
12 pages
Software Testing and Quality Assurance
0% (1)
Software Testing and Quality Assurance
30 pages
A Survey of Static Program Analysis Techniques
No ratings yet
A Survey of Static Program Analysis Techniques
16 pages
Data-Flow Analysis - Part 2: Y.N. Srikant
No ratings yet
Data-Flow Analysis - Part 2: Y.N. Srikant
26 pages
Dfa Part 2 PDF
No ratings yet
Dfa Part 2 PDF
26 pages
Static Program Analysis
No ratings yet
Static Program Analysis
210 pages
Data Flow Analysis
No ratings yet
Data Flow Analysis
18 pages
15Cs314J - Compiler Design: Unit 5
No ratings yet
15Cs314J - Compiler Design: Unit 5
36 pages
Chapter_Four Structural (White Box) Testing Part I
No ratings yet
Chapter_Four Structural (White Box) Testing Part I
28 pages
4 Data-Testing PDF
No ratings yet
4 Data-Testing PDF
79 pages
Cdunit 6
No ratings yet
Cdunit 6
20 pages
Data Flow Diagram
No ratings yet
Data Flow Diagram
55 pages
Chapter 4 Part II
No ratings yet
Chapter 4 Part II
32 pages
SWQTesting Unit 2
No ratings yet
SWQTesting Unit 2
40 pages
Week 6- Data Flow Testing
No ratings yet
Week 6- Data Flow Testing
28 pages
SEPM Chapter4
No ratings yet
SEPM Chapter4
102 pages
08 StaticAnalysis
No ratings yet
08 StaticAnalysis
39 pages
Data Flow Analysis: Goal: This Information Is Used in Various Optimizations
No ratings yet
Data Flow Analysis: Goal: This Information Is Used in Various Optimizations
28 pages
Ilovepdf_merged (1) - Converted (1) (1)
No ratings yet
Ilovepdf_merged (1) - Converted (1) (1)
24 pages
Chapter Four
No ratings yet
Chapter Four
85 pages
Software Testing Part#01
No ratings yet
Software Testing Part#01
54 pages
CD Unit-5 Full Notes Ok (2)
No ratings yet
CD Unit-5 Full Notes Ok (2)
49 pages
Unit-5-2
No ratings yet
Unit-5-2
48 pages
Chapter 4 Part I Structural (White Box) Testing
No ratings yet
Chapter 4 Part I Structural (White Box) Testing
26 pages
CH 14 Software Testing Techniques 170706120945
No ratings yet
CH 14 Software Testing Techniques 170706120945
23 pages
Chapter_Four Structural (White Box) Testing Part II
No ratings yet
Chapter_Four Structural (White Box) Testing Part II
34 pages
Unit 4 Program Modeling
No ratings yet
Unit 4 Program Modeling
35 pages
Path Sensitization
No ratings yet
Path Sensitization
34 pages
13 Modelling Programs 25-01-2025
No ratings yet
13 Modelling Programs 25-01-2025
43 pages
Basics of Data Flow Testing
100% (1)
Basics of Data Flow Testing
13 pages
coding-M 4
No ratings yet
coding-M 4
35 pages
unit testing-techniques
No ratings yet
unit testing-techniques
21 pages
White-Box Test Case Design: Da-Iict
No ratings yet
White-Box Test Case Design: Da-Iict
18 pages
UNIT 5 Notes CD
No ratings yet
UNIT 5 Notes CD
6 pages
Optimization
No ratings yet
Optimization
67 pages
Software Testing Techniques: Organized & Presented By: Software Engineering Team CSED TIET, Patiala
No ratings yet
Software Testing Techniques: Organized & Presented By: Software Engineering Team CSED TIET, Patiala
55 pages
Spring22-Lecture10-DataFlow
No ratings yet
Spring22-Lecture10-DataFlow
57 pages
Code Optimization PPT
No ratings yet
Code Optimization PPT
32 pages
Dataflow Testing
No ratings yet
Dataflow Testing
14 pages
Narayana Engineering College::Nellore Department of Cse CDF: Definition of Dataflow Testing
No ratings yet
Narayana Engineering College::Nellore Department of Cse CDF: Definition of Dataflow Testing
5 pages
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Mastering Python: A Comprehensive Guide for Beginners and Experts
From Everand
Mastering Python: A Comprehensive Guide for Beginners and Experts
Rick Spair
No ratings yet
Ev Charging Systems Sem 7
No ratings yet
Ev Charging Systems Sem 7
7 pages
MP DSP 02xx R4
No ratings yet
MP DSP 02xx R4
19 pages
ISmartRecruit ATS & CRM Brochure
No ratings yet
ISmartRecruit ATS & CRM Brochure
8 pages
Pinout Auriga
No ratings yet
Pinout Auriga
1 page
Design and Development of A Website Usin
No ratings yet
Design and Development of A Website Usin
7 pages
Architectural Support For High Level Languages
No ratings yet
Architectural Support For High Level Languages
33 pages
Network Operations Center (NOC) &#038 Wireless 2G, 3G, 4G RAN, CORE &#038 Digital Radio Operations
No ratings yet
Network Operations Center (NOC) &#038 Wireless 2G, 3G, 4G RAN, CORE &#038 Digital Radio Operations
8 pages
Databases+ +I
No ratings yet
Databases+ +I
7 pages
Computational Thinking - A Perspective On Computer Science - Zhiwei Xu, Jialin Zhang
No ratings yet
Computational Thinking - A Perspective On Computer Science - Zhiwei Xu, Jialin Zhang
338 pages
Synchronization in Distributed Systems
No ratings yet
Synchronization in Distributed Systems
11 pages
C Prog 5
No ratings yet
C Prog 5
1 page
1.3.1 (Enqueue) New
No ratings yet
1.3.1 (Enqueue) New
8 pages
Kids Books For Python
No ratings yet
Kids Books For Python
1 page
HOSPITAL DATA Management System: Software Requirements Specification (SRS)
No ratings yet
HOSPITAL DATA Management System: Software Requirements Specification (SRS)
6 pages
Software Testing Framework For ERP Systems Based On Agile Development
No ratings yet
Software Testing Framework For ERP Systems Based On Agile Development
7 pages
Becke CH SFTP Server s0 0 v1 5 Overview
No ratings yet
Becke CH SFTP Server s0 0 v1 5 Overview
17 pages
RF Controlled Robotic Vehicle With Metal Detection Project
No ratings yet
RF Controlled Robotic Vehicle With Metal Detection Project
4 pages
User Manual-Vsplayer CMS-simplify Version (2024-10-10 02 - 50 - 20)
No ratings yet
User Manual-Vsplayer CMS-simplify Version (2024-10-10 02 - 50 - 20)
31 pages
TYBSC-IT - SEM6 - SIC - APR19 Munotes Mumbai University
No ratings yet
TYBSC-IT - SEM6 - SIC - APR19 Munotes Mumbai University
1 page
Red Hat Enterprise Linux-8-Configuring Basic System Settings-En-Us
No ratings yet
Red Hat Enterprise Linux-8-Configuring Basic System Settings-En-Us
256 pages
SCHWING Telematics Brochure (EN)
No ratings yet
SCHWING Telematics Brochure (EN)
8 pages
NSX From Scratch - Lab Setup: The SDN Solution From Vmware
No ratings yet
NSX From Scratch - Lab Setup: The SDN Solution From Vmware
6 pages
Cloud Computing Ex 5 Savita
No ratings yet
Cloud Computing Ex 5 Savita
4 pages
Cisco: 300-115 Exam
No ratings yet
Cisco: 300-115 Exam
265 pages
What Is Cyber Security
No ratings yet
What Is Cyber Security
18 pages
Module 2 - Introduction To Python Programming
No ratings yet
Module 2 - Introduction To Python Programming
23 pages
Data Structure Unit-2 Quiz
No ratings yet
Data Structure Unit-2 Quiz
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Program-Analysis-ThuTrangNguyen-Day-2

Uploaded by

Program-Analysis-ThuTrangNguyen-Day-2

Uploaded by

Program Analysis Nguyễn Thu Trang

What is execute the program.

Static analysis Dynamic analysis

• Do not require code execution • Require code execution

This is JavaScript code.

What are the possible

• Consider all paths (that are

• Execute the program once

• For this example: can

What are the

• Consider all paths (that are

• Execute the program once

All possible behaviors (what we want, ideally)

• Lexical analysis: analyze the basic tokens of the source code

Data flow • Goal: Compute analysis state at each

Very busy expression analysis

Live variables analysis

Available every time

Non-trivial expressions: (3)y > a + b

Non-trivial expressions: a + b, a*b, a + 1 (3)y > a + b

Non-trivial expressions: a + b, a*b, a + 1 (3)y > a + b

available epressions are: incomming available

expressions • When control flow splits, propagate available

S 𝐴𝐸!"#$% 𝑆 𝐴𝐸!&'# (𝑆)

At the entry of statement 3, expression 𝑎 + 𝑏 has already been computed

Transfer propagated information

Domain: (x, ?), (x, 1), (x, 5)

s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)y = x * y

Domain: (x, ?), (x, 1), (x, 5)

s 𝑔𝑒𝑛(𝑠) 𝑘𝑖𝑙𝑙(𝑠) (4)y = x * y

Defining the • Function 𝑔𝑒𝑛(𝑠)

Inter-procedural • Calls and returns

procedural • Connect call sites to entry node of

z = bar(5) z = bar(3) return y + 1

Propagating • Return value

information • Local variables

security compromise confidentiality (stealing sensitivy

• A coding or design error • Weakness in software security

dereference • Prevention of Null Pointer Dereferences:

analysis • Heuristics to handle undecidability

Neural New code,

software Source code

• Manually crafted • Automatically

• Precise, logical • Data-driven

• Heuristics to handel • Learn instead of

• Challenged by large • Use big code to our

of neural • Bug detection

analysis • Code completion

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.