Ai Advanced - Unit - 3
Ai Advanced - Unit - 3
2. Ye check karta hai ki model ne galat positive kitne diye (False Positive kam
hone chahiye).
4. Jab galat positive prediction se problem ho (jaise spam filter), tab Precision
important hota hai.
5. High Precision = Kam galat positive = More reliable positive predictions.
2. Ye measure karta hai ki model ne kitne positive miss kiye (False
Negatives).
4. Jab har positive case ko pakadna zaroori ho (jaise cancer detection), tab
Recall important hota hai.
5. High Recall = Kam miss hone wale positives = Better detection rate.
4. Ye distance measure karta hai (like Euclidean distance) training data ke har
point se.
5. Fir top K closest neighbors me se majority class ko choose karta hai.
6. KNN lazy learner hai – training time pe kuch nahi seekhta, prediction ke waqt
hi calculation karta hai.
7. Simple algorithm hai, lekin large datasets me slow ho sakta hai.
8. Best use hota hai image recognition, recommendation systems, aur pattern
matching me.
🔥 Features:
● Simple and intuitive
● Slow for large data (kyunki har baar distance calculate hoti hai)
3. Ye line aise banai jati hai ki dono classes ke beech ka margin (distance)
maximum ho.
4. Support vectors wo data points hote hain jo margin ke sabse kareeb hote
hain.
5. Jab data linearly separable na ho, tab SVM kernel trick use karta hai (like
RBF, Polynomial).
6. SVM high-dimensional data me bhi achha perform karta hai.
7. Use cases me include hain: face detection, bioinformatics, text classification,
fraud detection.
📊 Example:
Tumhare paas 2 categories hain: Cats aur Dogs.
SVM unke features (like height & weight) ke basis pe ek straight line (ya curve)
banayega jo dono ko clearly alag kare.
✅ Goal:
● Classes ko is tarah separate karna ki margin (distance) dono classes se
maximum ho.
📚 Types of SVM:
1. Linear SVM:
○ Jab data linearly separable ho (ek straight line se alag ho sakta ho).
○ SVM "kernel functions" use karta hai (like RBF, Polynomial) taaki data
ko higher dimension me map kare aur linearly separate kar sake.
3. Ye curve dikhata hai ki model different threshold values pe kitna sahi kaam
kar raha hai.
4. Agar curve top-left corner ke kareeb hoti hai, to model achha perform kar
raha hai.
5. AUC (Area Under Curve) value jitni zyada (closer to 1), model utna hi
accurate hai.
6. ROC helpful hoti hai imbalanced datasets ke liye – jaise fraud detection,
cancer diagnosis.
7. Perfect model ka ROC curve top left tak jata hai, jiska AUC = 1 hota hai.