0% found this document useful (0 votes)

639 views

ML UNIT-3 Notes PDF

The document provides an overview of machine learning concepts related to computational learning theory, rule learning, and inductive logic programming. Specifically, it discusses probably approximately correct (PAC) learning, sample complexity for infinite hypothesis spaces, Vapnik-Chervonenkis dimension, translating decision trees into rules, sequential covering algorithms, first-order rule induction using FOIL, inverse resolution, and PROGOL. The content is from a machine learning course and covers key topics in computational learning theory and rule-based learning.

Uploaded by

Anil Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

639 views

ML UNIT-3 Notes PDF

Uploaded by

Anil Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

MALINENI LAKSHMAIAH WOMEN’S ENGINEERING

COLLEGE
(Approved by AICTE, Affiliated to JNTUK)(An ISO9001:2008 Certified
Institution)

IV B.Tech (Common to CSE and IT), IV-I Semester, R19,

Machine Learning Notes, UNIT-III

Prepared by Dr.M.BHEEMALINGAIAH
UNIT-III

Computational Learning Theory: Models of learnability: learning in the limit; probably

approximately correct (PAC) learning. Sample complexity for infinite hypothesis spaces,
Vapnik-Chervonenkis dimension.
Rule Learning: Propositional and First-Order, Translating decision trees into rules, Heuristic rule
induction using separate and conquer and information gain, First-order Horn-clause induction
(Inductive Logic Programming) and Foil, Learning recursive rules, Inverse resolution, Golem,
and Progol

Content
3.1 Computational Learning Theory:
3.2 Probably approximately correct (PAC) learning
3.3 Sample Complexity for Infinite Hypothesis Spaces
3.4 Vapnik-Chervonenkis dimension
3.5 Rule Learning
3.6 Translating decision trees into rules
3.7 Sequential Covering Algorithms (Separate and Conquer Approach)
3.8 Learning First Order Rules
3.9 First Order Rule Inductive Learning (FOIL)
3.9.1 FOIL Algorithm
3.9.2 FOIL: Explanation
3.9.3 FOIL: Specializing the Current Rule
3.9.4 FOIL: Performance Evaluation Measure
3.9.5 Foil-Gain
3.9.6 Summary/Observations of FOIL
3.10 Induction as Inverted Deduction
3.11 Inverse resolution
3.12 PROGOL

1
3.1 Computational Learning Theory

Computational learning theory, or CoLT for a brief, could be a field of study involved with utilizing formal
mathematical strategies applied to learning systems. It seeks to utilize the tools of theoretical technology to
quantify learning issues. This includes characterizing the problem of learning specific tasks. Computational
learning theory is also thought of as an associate extension or relation of applied math or statistical learning
theory, or SLT for a brief that uses formal strategies to quantify learning algorithms.

When studying machine learning it is natural to wonder what general laws may govern machine (and
nonmachine) learners. Is it possible to identify classes of learning problems that are inherently difficult or
easy, independent of the learning algorithm? Can one characterize the number of training examples
necessary or sufficient to assure successful learning? How is this number affected if the learner is allowed
to pose queries to the trainer, versus observing a random sample of training examples? Can one
characterize the number of mistakes that a learner
will make before learning the target function? Can one characterize the inherent computational complexity
of classes of learning problems?
Although general answers to all these questions are not yet known, frag- ments of a computational
theory of learning have begun to emerge. This chapter presents key results from this theory, providing
answers to these questions within particular problem settings. We focus here on the problem of inductively
learning an unknown target function, given only training examples of this target function and a space
of candidate hypotheses. Within this setting, we will be chiefly concerned with questions such as how
many training examples are sufficient to successfully learn the target function, and how many mistakes will
the learner make before succeeding. As we shall see, it is possible to set quantitative bounds on these
measures, depending on attributes of the learning problem such as:

 the size or complexity of the hypothesis space considered by the learner

 the accuracy to which the target concept must be approximated
 the probability that the learner will output a successful hypothesis
 the manner in which training examples are presented to the learner

For the most part, we will focus not on individual learning algorithms, but rather on broad classes of
learning algorithms characterized by the hypothesis spaces they consider, the presentation of training
examples, etc. Our goal is to answer questions such as:

 Sample complexity. How many training examples are needed for a learner to converge (with
high probability) to a successful hypothesis?
 Computational complexity. How much computational effort is needed for a learner to
converge (with high probability) to a successful hypothesis?
 Mistake bound. How many training examples will the learner misclassify before converging
to a successful hypothesis?
2
3.2 Probably Approximately Correct (PAC) Leaner

3
4
5
6
3.3 SAMPLE COMPLEXITY FOR INFINITE HYPOTHESIS SPACES

Answers : Vapink Shervonenkis(VC) Dimension

3.4 Vapink Shervonenkis(VC) Dimension

Definition of Vapink Shervonenkis(VC) Dimension: The VC dimension of the hypothesis space

H, VC(H), is the size of the largest finite subset of the instance space X that can be shattered by
H. If arbitrarily large finite subsets of X can be shattered by X then VC(H) = ∞

7
8
9
3.5 Rule Learning

The Rule Based Learning represents knowledge in the form of IF-THEN rules that proves useful
in Artificial Intelligence(AI) system . This type of learning is most suitable for analyzing data
contains a mixture of numerical and qualitative attributed. Rule mining is useful and easy to
understand by humans. The main aim of this learning is to discover interesting relations between
variables and patents in large data sets. Several algorithms have proven useful which
automatically induce rules for data to build more accurate AI systems .

Format of Rule:

AB
LHS A is called Antecedent and RHS B is called Consequent OR

A is called Condition and B is called Conclusion or

A is called body and B is called head or

A is conjunction of attribute-value pairs and B is class label (Target class, class prediction that is
Yes or No)

Evolution Metrics of Rule: Two evolution metrics for rule , accuracy and coverage are defined
as follows
Number of instances that satisfy both antecedent and consequent of a rule
Accuracy 
instances that satisfy the antecedent of a rule.

Number of instances that satisfy antecedent of a rule

Coverage=
Total number of instances in training data set

10
Types of rule base earning methods

Indirect methods Direct methods

Translating decision Genetic Algorithms OneR Algorithm Sequential Covering

trees into rules Algorithm

Indirect methods: In this method, the rules are extracted from other classification models, the
Decision Tree and Genetic Algorithms belongs to this category

Direct mothed: In this method, the rules are extracted directly from taring data. Some of examples are
OneR Algorithm and Sequential Covering Algorithm

3.6 Translating decision trees into rules

In this method first decision is constructed from training data then it translated to rules.
Example decision tree for Playtennis as follows

The following three are rules predict class label as Yes

11
R1: IF ( Outlook=Sunny  Humidity=Normal) THEN PlayTennis=Yes

R2: IF ( Outlook=Overcast) THEN PlayTennis=Yes

R3: IF ( Outlook=Rain  Wind=Week) THEN PlayTennis=Yes

The following two are rules predict class label as No

R4: IF ( Outlook=Sunny  Humidity=High) THEN PlayTennis=No

R5: IF ( Outlook=Rain  Wind=Strong) THEN PlayTennis=No

3.7 Sequential Covering Algorithms (Separate and Conquer Approach)

Sequential Covering is a popular algorithm based on Rule-Based Classification used for

learning a disjunctive set of rules. The basic idea here is to learn one rule, remove the data that
it covers, then repeat the same process. In this process, in this way, it covers all the rules
involved with it in a sequential manner during the training phase. Sequential Covering
Algorithms are based Separate and Conquer Approach

Some of example for Sequential Covering Algorithms

 AQ
 CN2
 RIPPER(Repeated Incremental Pruning to Produce Error Reduction
 PRISM
 M5
 FOIL( First Order Inductive Learner ) and so on
In the Sequential Covering Algorithms steps as follows

 Start from an empty rule

 Grow a rule using some Learn-One-Rule function
 Remove training records covered by the rule
 Repeat Step (2) and (3) until stopping criterion is met

Example 1: Consider the following Training Dataset:

 Select rule from Training Dataset that covers most of positive examples and add this new
rule to rule set
 Update the training dataset by remove those positive examples that have been covered
selected rule from training dataset.
 Repeat this process until training dataset doesn’t contain positive examples (Finally it
contains only negative examples) as shown in figures
12
(i) Original Data
(ii) Step 1

R1 R1

(iii) Step 2 (iv) Step 3

Sequential Covering Algorithms

13
The sequential covering algorithm uses the subroutine (function) is called LEARN-ONE-RELE

LEARN-ONE-RELE subroutine (function) as follows

14
3.8 Learning First Order Rules

Lecture Outline:
• Why Learn First Order Rules?
• First Order Logic: Terminology
• The FOIL Algorithm

Propositional logic allows the expression of individual propositions and their truth-functional
combination.

 E.g. propositions like Tom is a man or All men are mortal may be represented by single
proposition letters such as P or Q (so, proposition letters may be viewed as variables
which range over propositions)
 Truth functional combinations are built up using connectives, such as ∧, ∨, ￢, →– e.g.
P∧Q
 Inference rules are defined over propositional forms – e.g.
P→Q
P

Q
–Note that if P is Tom is a man and Q is All men are mortal, then the inference that Tom is
mortal does not follow in propositional logic

First order logic allows the expression of propositions and their truth functional combination,
but it also allows us to represent propositions as assertions of predicates about individuals or
sets of individuals

Example : propositions like Tom is a man or All men are mortal may be represented by
Predicate-argument representations such as man (tom) or ∀x (man(x) →mortal(x))
(So, variables range over individuals)

Inference rules permit conclusions to be drawn about sets/individuals – e.g. mortal (tom)

 First order logic is much more expressive than propositional logic – i.e. it allows a finer-
grain of specification and reasoning when representing knowledge
 In the context of machine learning, consider learning the relational concept daughter(x,
y) defined over pairs of persons x, y, where
15
o persons are represented by attributes: (Name,Mother,Father,Male,Female)
 Training examples then have the form: (person1, person2, target attribute value)
E.g. (Name1 = Ann,Mother1 = Sue,Father1 = Bob,Male1 = F,Female1 = T)
Name2 = Bob,Mother2 = Gill,Father2 = Joe,Male2 = T,Female2 = F)
Daughter1,2 = Ti

: From such examples, a propositional rule learner such as ID3 or CN2 can only learn rules like:

IF (Father1 = Bob)∧(Name2 = Bob)∧(Female1 = T)

THEN Daughter1,2 = T

First Order Logic: Terminology

All expressions in first order logic are composed of:

o constants – e.g. bob, 23, a
o variables – e.g. X,Y,Z
o predicate symbols – e.g. f emale, f ather – predicates take on the values True or False
only
o function symbols – e.g. age – functions can take on any constant as a value

o connectives – e.g. ∧, ∨, ￢, →(or←)

o quantifiers – e.g. ∀, ∃
A term is
o any constant – e.g. bob
o any variable – e.g X
o any function applied to any term – e.g. age(bob)

A literal is any predicate or negated predicate applied to any terms – e.g. female(sue),
￢father(X,Y)
– A ground literal is a literal that contains no variables – e.g. female(sue)
– A positive literal is a literal that does not contain a negated predicate – e.g. female(sue)
– A negative literal is a literal that contains a negated predicate – e.g ￢father(X,Y)

A clause is any disjunction of literals L1 ∨· · ·∨Ln whose variables are universally quantified
(With wide scope)

• A Horn clause is any clause containing exactly one positive literal:

16
H ∨￢L1 ∨· · ·∨￢Ln
Since ￢L1 ∨· · ·∨￢Ln ≡ ￢ (L1 ∧· · ·∧Ln)
and (A∨￢B) ≡ (A←B) (read A←B as “if B then A”)
then a Horn clause can be equivalently written:
H ←L1 ∧· · ·∧Ln
Note: the equivalent form in Prolog: H :- L1, ..., Ln.

• A substitution is a function  = {x1/t1, ..., xn/tn} which when applied to an expression C

yields a new expression C′ with each variable xi in C replaced with term ti.
o C denotes the result of applying  to C.
o A unifying substitution for two expressions C1 and C2 is any substitution q such that
 C1  =C2 

3.9 First Order Rule Inductive Learning ( FOIL)

• FOIL was proposed by Quinlan, 1990, it is the natural extension of SEQUENTIAL-

COVERING and LEARN-ONE-RULE to first order rule learning.

FOIL learns first order rules which are similar to Horn clauses with two exceptions:

 literals may not contain function symbols (reduces complexity of hypothesis space
 literals in body of clause may be negated (hence, more expressive than Horn clauses
 Like SEQUENTIAL-COVERING, FOIL learns one rule at time and removes positive
examples covered by the learned rule before attempting to learn a further rule.

Unlike SEQUENTIAL-COVERING and LEARN-ONE-RULE, FOIL

 only tries to learn rules that predict when the target literal is true propositional version
sought rules that predicted both true and false values of target attribute
 performs a simple hill-climbing search (beam search of width one)

FOIL searches its hypothesis space via two nested loops:

• The outer loop at each iteration adds a new rule to an overall disjunctive hypothesis (i.e.
rule1 ∨rule2 ∨...)
This loop may be viewed as a specific-to-general search
 starting with the empty disjunctive hypothesis which covers no positive instances
 stopping when the hypothesis is general enough to cover all positive examples

• The inner loop works out the detail of each specific rule, adding conjunctive constraints to the
rule precondition on each iteration.
This loop may be viewed as a general-to-specific search
17
 starting with the most general precondition (empty)
 stopping when the hypothesis is specific enough to exclude all negative examples

3.9.1 FOIL Algorithm

3.9.2 FOIL: Explanation

The principal differences between FOIL and SEQUENTIAL-COVERING + LEARN-ONE-

RULE are:

• In its inner loop search to generate each new rule, FOIL needs to cope with variables in the
rule preconditions

• The performance measure used in FOIL is not the entropy measure used in LEARN-ONE-
RULE since
 the performances of distinct bindings of rule variables need to be distinguished
 FOIL only tries to discover rules that cover positive examples

3.9.3 FOIL: Specializing the Current Rule

• Suppose we are learning a rule of the form: P(x1, x2, . . . , xk)←L1 . . .Ln
18
• Then candidate specializations add a new literal of the form:
 Q(v1, . . . , vr), where
• Q is any predicate in the rule or training data;
• at least one of the vi in the created literal must already exist as a variable in the
rule
 Equal(x j, xk), where x j and xk are variables already present in the rule; or
 The negation of either of the above forms of literals

4.9.4 FOIL: Performance Evaluation Measure

How do we decide which is the best literal to add when specializing a rule?
• To do this FOIL considers each possible binding of variables in the candidate rule
specialization to constants in the training examples.
• For example, suppose we have the training data:
granddaughter(bill, joan) father( joan, joe) father(tom, joe)
female( joan) father( joe,bill)
and we also assume (“closed world assumption”) that any literals
– involving predicates granddaughter, father, and female
– involving constants bill, joan, joe, and tom
– not in the training data

are false. E.g. ￢granddaughter(bill, tom) is also true.

• Given the initial rule: granddaughter(X,Y)←

FOIL considers all possible bindings of X and Y to the constants bill, joan, joe, and tom.
Note that only {X/bill,Y/ joan} is a positive binding (i.e. corresponds to a positive training
example). The other 15 bindings of constants to X and Y are negative.

3.9.5 Foil-Gain

• At each stage of rule specialization, candidate specializations are preferred according to

whether they possess more positive and fewer negative bindings.
• The precise evaluation measure used by FOIL is:

19
 p1 p0 
Foil _ Gain( L, R)  t  log 2 - log 2 
 p1  n1 p0  n0 
Where
 L is the candidate literal to add to rule R
 p0 = number of positive bindings of R
 n0 = number of negative bindings of R
 p1 = number of positive bindings of R+L
 n1 = number of negative bindings of R+L
 t is the number of positive bindings of R also covered by R+L
3.9.6 Summary/Observations of FOIL

FOIL extends the SEQUENTIAL-COVERING and LEARN-ONE-RULE algorithms for

propositional rule learning to first order rule learning
• FOIL learns in two phases:
 an outer loop which acquires a disjunction of Horn clause-like rules which together cover
the positive examples
 an inner loop which constructs individual rules by progressive specialization of a rule
through adding new literals selected according to the FOIL-gain measure until no
negative examples are covered

3. 10 Induction as Inverted Deduction

A second, quite different approach to inductive logic programming is based on the simple
observation that induction is just the inverse of deduction! In general, machine learning involves
building theories that explain the observed data. Given some data D and some partial background
knowledge B, learning can be described as generating a hypothesis h that, together with B, explains
D. Put more precisely, assume as usual that the training data D is a set of training examples,
each of the form (xi, f (x;)}. Here xi denotes the ith training instance and f (xi) denotes its target
value. Then learning is the problem of discovering a hypothesis h, such that the classification f (xi)
of each training instance x; follows deductively from the hypothesis h, the description of xi and
any other background knowledge B known to the system.

Induction is finding h such that

(∀(xi, f(xi)) ∈ D) B ∧ h ∧ xi |–20 f (xi)

where
• xi is the ith training instance
• f(xi) is the target function value for xi
• B is other background knowledge.

The expression X |– Y is read "Y follows deductively from X ," or alternatively "X entails Y."
Expression (10.2) describes the constraint that must be satisfied by the learned hypothesis h;
namely, for every training instance x;, the target classification f(x;) must follow deductively
from B, h, and xi

As an example, consider the case where the target concept to be learned is "pairs of people (u, v}
such that the child of u is v," represented by the predicate Child(u, v).

Assume we are given a single positive example Child (Bob, Sharon), where the instance is
described by the literals Male(Bob), Female(Sharon), and Father(Sharon, Bob).

Furthermore, suppose we have the general background knowledge Parent(u, v)  Father(u, v).

We can describe above equation as follows

xi Male(Bob), Female(Sharon), Father(Sharon, Bob)

f(xi): Child(Bob, Sharon)

B: Parent(u, v)  Father(u, v)

Inverse resolution

3. 11 Inverse resolution

It is easiest to introduce the resolution rule in propositional form, though it is readily extended
to first-order representations. Let L be an arbitrary propositional literal, and let P and R be
arbitrary propositional clauses. The resolution rule is

PL
¬LR

PR

21
1. Given initial clauses C1 and C2, find a literal L from clause C1 such that ¬ L occurs in clause
C2.
2. Form the resolving C by including all literals from C1 and C2, except for L and ¬ L. More
precisely, the set of literals occurring in the conclusion C is
C = (C1 - {L})  (C2 - {¬ L})

C : PassExam  ¬KnowMaterial C : KnowMaterial  ¬Study

1 2

C: PassExam  ¬Study

C : PassExam  ¬KnowMaterial C : KnowMaterial  ¬Study

2
1

C: PassExam  ¬Study

Inverted Resolution (Propositional)

Given initial clauses C and C, find a literal L that occurs in clause C , but not in clause C.
1 1

2. Form the second clause C by including the following literals

C = (C - (C - {L}))  {¬ L
2 1

First Order Resolution

1. Find a literal L from clause C , literal L from clause C , and substitution  such that
1 1 2 2

L  = ¬L 
1 2

2. Form the resolvent C by including all literals from C  and C , except for L theta and ¬L .
1 2 1 2
More precisely, the set of literals occuring in the conclusion is
22
C = (C - {L })  (C - {L })
1 1 2 2

Inverting:
-1 -1
C2 = (C - (C1 - {L1}) 1) 2 {¬L1 1 2 }

Father(Tom,Bob) GrandChild(y,x)  ¬Father(x,z)  ¬Father(z,y))

Father(Shannon,Tom) GrandChild(Bob,x) 
¬Father(x,Tom))

{Shannon/x}

GrandChild(Bob,Shannon

)
3. 12 PROGOL

PROGOL: Reduce combinatorial explosion by generating the most specific acceptable h

1. User specifies H by stating predicates, functions, and forms of arguments allowed for each
2. PROGOL uses sequential covering algorithm.
For each (x , f(x ))
i i
Find most specific hypothesis h s.t.
i

B  h  x |– f(x )
i i i

actually, only considers k-step entailment

3. Conduct general-to-specific search bounded by specific hypothesis h , choosing hypothesis
i
with minimum description length

ML Question Bank
No ratings yet
ML Question Bank
7 pages
21cs54 Tie Simp
No ratings yet
21cs54 Tie Simp
5 pages
JNTUK R20 ML UNIT-I (Chapter-I)
No ratings yet
JNTUK R20 ML UNIT-I (Chapter-I)
9 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
Timeline of Probability and Statistics
100% (1)
Timeline of Probability and Statistics
3 pages
Learning Set of Rules
No ratings yet
Learning Set of Rules
11 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
AIML Module - 03 21CS4
No ratings yet
AIML Module - 03 21CS4
34 pages
NN DL
No ratings yet
NN DL
1 page
Unit-3-Second Chapter
No ratings yet
Unit-3-Second Chapter
9 pages
AIML Course File
No ratings yet
AIML Course File
31 pages
r20 - Aiml (CSM) Syllabus
No ratings yet
r20 - Aiml (CSM) Syllabus
175 pages
AI Module-2
No ratings yet
AI Module-2
123 pages
21CS54 TIE SIMPdocx (1) (1) (1) (1) PDF
No ratings yet
21CS54 TIE SIMPdocx (1) (1) (1) (1) PDF
4 pages
Machine Learning Notes (All Units Merged)
No ratings yet
Machine Learning Notes (All Units Merged)
144 pages
Concept Learning
No ratings yet
Concept Learning
62 pages
CSE Dept. PPT 176 173
No ratings yet
CSE Dept. PPT 176 173
17 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
136 pages
Ai R16 - Unit-4
No ratings yet
Ai R16 - Unit-4
30 pages
(Ebook) Generative Deep Learning by David Foster ISBN 9781492041948, 1492041947 - The ebook is available for instant download, no waiting required
100% (1)
(Ebook) Generative Deep Learning by David Foster ISBN 9781492041948, 1492041947 - The ebook is available for instant download, no waiting required
54 pages
Aiml Assignment - 1
No ratings yet
Aiml Assignment - 1
2 pages
R18 B.Tech - CSE (AIML) 3-2 Tentative Syllabus
No ratings yet
R18 B.Tech - CSE (AIML) 3-2 Tentative Syllabus
24 pages
Unit III Knowledge, Reasoning and Planning
No ratings yet
Unit III Knowledge, Reasoning and Planning
99 pages
Instance Based Learning
100% (1)
Instance Based Learning
49 pages
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
No ratings yet
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
65 pages
Artificial Intelligence Final
No ratings yet
Artificial Intelligence Final
4 pages
CA2-Question Bank MCQ (PEC-CSBS601D)
No ratings yet
CA2-Question Bank MCQ (PEC-CSBS601D)
9 pages
SCSA3015 Deep Learning Unit 2 PDF
No ratings yet
SCSA3015 Deep Learning Unit 2 PDF
32 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
Machine Learning QB
No ratings yet
Machine Learning QB
3 pages
Unit 3
No ratings yet
Unit 3
99 pages
Unit-1
No ratings yet
Unit-1
88 pages
Early Detection of Lung Cancer Using AI and ML
No ratings yet
Early Detection of Lung Cancer Using AI and ML
6 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
8 - Knowledge in Learning
No ratings yet
8 - Knowledge in Learning
35 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
AI-Unit 5
100% (1)
AI-Unit 5
6 pages
PPT05-Quantifying Uncertainty
No ratings yet
PPT05-Quantifying Uncertainty
39 pages
MCS101-Artificial Intelligence
100% (1)
MCS101-Artificial Intelligence
3 pages
Week 1 Solution
No ratings yet
Week 1 Solution
13 pages
NN Unit 1 Complete Notes
100% (1)
NN Unit 1 Complete Notes
154 pages
textbook ML_removed (2)
No ratings yet
textbook ML_removed (2)
10 pages
Question Bank
No ratings yet
Question Bank
4 pages
Question Bank: Unit 1: Introduction To Finite Automata
No ratings yet
Question Bank: Unit 1: Introduction To Finite Automata
8 pages
Krr Unit i Notes
No ratings yet
Krr Unit i Notes
32 pages
21cs502 Unit 4 Ai Notes Short
No ratings yet
21cs502 Unit 4 Ai Notes Short
32 pages
CP5191 Machine Learning Techniques L T P C3 0 0 3
No ratings yet
CP5191 Machine Learning Techniques L T P C3 0 0 3
7 pages
AD3501-DL-Unit 1 Notes
No ratings yet
AD3501-DL-Unit 1 Notes
43 pages
Week 7 Assignment 1
No ratings yet
Week 7 Assignment 1
6 pages
Unit Iv: Syllabus: Knowledge Representation: Introduction, Approaches To Knowledge Representation, Knowledge
No ratings yet
Unit Iv: Syllabus: Knowledge Representation: Introduction, Approaches To Knowledge Representation, Knowledge
14 pages
Overfitting vs. Underfitting, Bias vs. Variance
No ratings yet
Overfitting vs. Underfitting, Bias vs. Variance
7 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Ai Unit 4
No ratings yet
Ai Unit 4
23 pages
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
Unit-3 AI
No ratings yet
Unit-3 AI
98 pages
Applied Machine Learning Question Paper
100% (1)
Applied Machine Learning Question Paper
2 pages
ML Assignment 3
No ratings yet
ML Assignment 3
5 pages
AIML Module 3
No ratings yet
AIML Module 3
25 pages
ML UNIT-5 Notes PDF
No ratings yet
ML UNIT-5 Notes PDF
41 pages
rules
No ratings yet
rules
84 pages
House-Price-Prediction-Model Project
No ratings yet
House-Price-Prediction-Model Project
9 pages
CH 5 Answers
No ratings yet
CH 5 Answers
6 pages
Fraudulent Credit Card Activity Detection Using Adaptive Boosting and Aggregate Voting
No ratings yet
Fraudulent Credit Card Activity Detection Using Adaptive Boosting and Aggregate Voting
8 pages
Stratego Algorithms
No ratings yet
Stratego Algorithms
74 pages
Noc18 Ee40 Assignment13
No ratings yet
Noc18 Ee40 Assignment13
4 pages
Queue Lect 6
25% (4)
Queue Lect 6
21 pages
CEP Control Sytems
No ratings yet
CEP Control Sytems
3 pages
Numericals - Forecasting
No ratings yet
Numericals - Forecasting
6 pages
Extreme Value Theory: An Introduction
No ratings yet
Extreme Value Theory: An Introduction
5 pages
Research Article: Malware Detection On Byte Streams of PDF Files Using Convolutional Neural Networks
No ratings yet
Research Article: Malware Detection On Byte Streams of PDF Files Using Convolutional Neural Networks
10 pages
Turbogenerator Self-Tuning Automatic Voltage Regulator
No ratings yet
Turbogenerator Self-Tuning Automatic Voltage Regulator
6 pages
PNP DLT Ransac
No ratings yet
PNP DLT Ransac
9 pages
Projectmanagementch5 1921fall
No ratings yet
Projectmanagementch5 1921fall
58 pages
Digital Assignment 1 - Question - DSP
No ratings yet
Digital Assignment 1 - Question - DSP
2 pages
Geostatistical Estimation Methods ..
No ratings yet
Geostatistical Estimation Methods ..
54 pages
DSP Model Question
No ratings yet
DSP Model Question
4 pages
Preface - 2020 - Structural Dynamics Fundamentals and Advanced Applications
No ratings yet
Preface - 2020 - Structural Dynamics Fundamentals and Advanced Applications
2 pages
Slot: A2 Set: B SRM Institute of Science and Technology College of Engineering and Technology Department of Mathematics
No ratings yet
Slot: A2 Set: B SRM Institute of Science and Technology College of Engineering and Technology Department of Mathematics
6 pages
Anomaly Detection Time Series Final PDF
No ratings yet
Anomaly Detection Time Series Final PDF
12 pages
A Survey of Deep Neural Network Architectures and Their Applications PDF
No ratings yet
A Survey of Deep Neural Network Architectures and Their Applications PDF
16 pages
Unit 1 - Introduction
No ratings yet
Unit 1 - Introduction
8 pages
A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
No ratings yet
A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
10 pages
1-s2.0-S0957417421017255-main
No ratings yet
1-s2.0-S0957417421017255-main
13 pages
Deep Learning - From Big Data To Artificial Intelligence With - Stephane S - Tuffery - 2022 - Wiley - 9781119845010 - Anna's Archive
No ratings yet
Deep Learning - From Big Data To Artificial Intelligence With - Stephane S - Tuffery - 2022 - Wiley - 9781119845010 - Anna's Archive
542 pages
Final EE3150 2015 Fall
No ratings yet
Final EE3150 2015 Fall
2 pages
Optimasi Produksi Sepatu Menggunakan Program Linier Multi: Objective Fuzzy
No ratings yet
Optimasi Produksi Sepatu Menggunakan Program Linier Multi: Objective Fuzzy
7 pages
Data Structures: Hapter
No ratings yet
Data Structures: Hapter
31 pages
Machine Learning Report
92% (12)
Machine Learning Report
42 pages
Ai in SCM
No ratings yet
Ai in SCM
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML UNIT-3 Notes PDF

Uploaded by

ML UNIT-3 Notes PDF

Uploaded by

MALINENI LAKSHMAIAH WOMEN’S ENGINEERING

IV B.Tech (Common to CSE and IT), IV-I Semester, R19,

Computational Learning Theory: Models of learnability: learning in the limit; probably

 the size or complexity of the hypothesis space considered by the learner

Answers : Vapink Shervonenkis(VC) Dimension

3.4 Vapink Shervonenkis(VC) Dimension

Definition of Vapink Shervonenkis(VC) Dimension: The VC dimension of the hypothesis space

A is called Condition and B is called Conclusion or

A is called body and B is called head or

Number of instances that satisfy antecedent of a rule

Indirect methods Direct methods

Translating decision Genetic Algorithms OneR Algorithm Sequential Covering

3.6 Translating decision trees into rules

The following three are rules predict class label as Yes

R2: IF ( Outlook=Overcast) THEN PlayTennis=Yes

R3: IF ( Outlook=Rain  Wind=Week) THEN PlayTennis=Yes

The following two are rules predict class label as No

R4: IF ( Outlook=Sunny  Humidity=High) THEN PlayTennis=No

R5: IF ( Outlook=Rain  Wind=Strong) THEN PlayTennis=No

3.7 Sequential Covering Algorithms (Separate and Conquer Approach)

Sequential Covering is a popular algorithm based on Rule-Based Classification used for

Some of example for Sequential Covering Algorithms

 Start from an empty rule

Example 1: Consider the following Training Dataset:

(iii) Step 2 (iv) Step 3

Sequential Covering Algorithms

LEARN-ONE-RELE subroutine (function) as follows

IF (Father1 = Bob)∧(Name2 = Bob)∧(Female1 = T)

First Order Logic: Terminology

All expressions in first order logic are composed of:

o connectives – e.g. ∧, ∨, ￢, →(or←)

• A Horn clause is any clause containing exactly one positive literal:

• A substitution is a function  = {x1/t1, ..., xn/tn} which when applied to an expression C

3.9 First Order Rule Inductive Learning ( FOIL)

• FOIL was proposed by Quinlan, 1990, it is the natural extension of SEQUENTIAL-

Unlike SEQUENTIAL-COVERING and LEARN-ONE-RULE, FOIL

FOIL searches its hypothesis space via two nested loops:

3.9.1 FOIL Algorithm

3.9.2 FOIL: Explanation

The principal differences between FOIL and SEQUENTIAL-COVERING + LEARN-ONE-

3.9.3 FOIL: Specializing the Current Rule

4.9.4 FOIL: Performance Evaluation Measure

are false. E.g. ￢granddaughter(bill, tom) is also true.

• Given the initial rule: granddaughter(X,Y)←

• At each stage of rule specialization, candidate specializations are preferred according to

FOIL extends the SEQUENTIAL-COVERING and LEARN-ONE-RULE algorithms for

3. 10 Induction as Inverted Deduction

Induction is finding h such that

(∀(xi, f(xi)) ∈ D) B ∧ h ∧ xi |–20 f (xi)

We can describe above equation as follows

f(xi): Child(Bob, Sharon)

C : PassExam  ¬KnowMaterial C : KnowMaterial  ¬Study

C : PassExam  ¬KnowMaterial C : KnowMaterial  ¬Study

Inverted Resolution (Propositional)

2. Form the second clause C by including the following literals

First Order Resolution

Father(Tom,Bob) GrandChild(y,x)  ¬Father(x,z)  ¬Father(z,y))

PROGOL: Reduce combinatorial explosion by generating the most specific acceptable h

actually, only considers k-step entailment

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.