0% found this document useful (0 votes)
15 views10 pages

It, Hardware Exp1

The document outlines a practical experiment conducted by a student in a Computer Science & Engineering course, focusing on data preprocessing and AI/ML packages using Google Colab. It details the tools and libraries used, such as Pandas, NumPy, and Scikit-learn, along with various AI/ML algorithms like Linear Regression and Decision Trees. The document also includes learning outcomes related to the importance of data preprocessing and familiarity with AI/ML algorithms.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views10 pages

It, Hardware Exp1

The document outlines a practical experiment conducted by a student in a Computer Science & Engineering course, focusing on data preprocessing and AI/ML packages using Google Colab. It details the tools and libraries used, such as Pandas, NumPy, and Scikit-learn, along with various AI/ML algorithms like Linear Regression and Decision Trees. The document also includes learning outcomes related to the importance of data preprocessing and familiarity with AI/ML algorithms.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

University Institute of Engineering

Department of Computer Science & Engineering

Experiment: 1

Student Name: MAYANK AGARWAL UID: 24BCY70057


Branch: Computer Science & Engineering Section/Group:24BCY-116(B)
Semester : 2ND Date of Performance: 11/1/2025
Subject Name: IT, HARWARE AND AI WORKSHOP

Subject Code: 24ECP-102

1. Aim of the practical: EXPLORE DATA PREPROCESSING PACKAGES AND AIML


PACKAGES.

2. Tool Used: GOOGLE COLLAB

3. Basic Concept/ Command Description:


 GOOGLE COLLAB: A free, cloud-based platform that allows users to write and run
Python and R code.

 DATA PRE PROCESSING: The process of cleaning, organizing, and transforming raw
data into a format that's more suitable for analysis and model training.

 PREPROCCESING PACKAGES:
University Institute of Engineering
Department of Computer Science & Engineering

 Pandas Basics: Pandas is an open-source library providing high-performance,


easy-to-use data structures, and data analysis tools. It's particularly well-suited for
data manipulation and cleaning.

 NumPy Functions: NumPy is a foundational library for numerical computing in


Python. It offers support for large, multi-dimensional arrays and matrices, along
with a collection of mathematical functions to operate on these arrays.

 Scikit-learn Prep: Scikit-learn is a machine learning library that includes various


preprocessing modules. It provides tools for feature extraction, normalization,
standardization, and encoding categorical variables.

 Matplotlib Insights: Matplotlib is primarily a plotting library, but it's also useful
for data preprocessing when visual analysis is required.

 Seaborn Techniques: Seaborn builds on Matplotlib and provides a high-level


interface for drawing attractive statistical graphics. It integrates closely with
Pandas data structures and offers options for visualizing complex datasets.

 AIML ALGORITHMS:

 Linear Regression: In this process, a relationship is established between


independent and dependent variables by fitting them to a line. This line is known
as the regression line and is represented by a linear equation Y= a *X + b

 Logistic Regression: Logistic Regression is used to estimate discrete values


(usually binary values like 0/1) from a set of independent variables. It helps
predict the probability of an event by fitting data to a logit function. It is also
called logit regression.

 Decision Tree: Is one of the most popular algorithm in use today; this is a
supervised learning algorithm that is used for classifying problems.
University Institute of Engineering
Department of Computer Science & Engineering

 SVM Algorihtm: SVM algorithm is a method of a classification algorithm in


which you plot raw data as points in an n-dimensional space (where n is the
number of features you have).

 Naive Bayes Algorithm: A Naive Bayes classifier assumes that the presence of a
particular feature in a class is unrelated to the presence of any other feature.

4. Code:
 Open google collab and click on new notebook

 FIRST CLICK ON TEXT AND THEN ON CODE:


University Institute of Engineering
Department of Computer Science & Engineering

 CODE FOR DATA TYPE :

 CODE FOR PRINTING THREE PLACED AFTER DECIMAL:


University Institute of Engineering
Department of Computer Science & Engineering

 LIST:

 TUPLE:
University Institute of Engineering
Department of Computer Science & Engineering

 DICTIONARY:

 TO READ THE CONTENT OF TEXT DATA:


University Institute of Engineering
Department of Computer Science & Engineering

 TO CREATE A NEW FILE:

 WRITE 1 TO 10:

 TO DELETE A FILE:
University Institute of Engineering
Department of Computer Science & Engineering

 TO GET ALL THE FILES AND DIRECTORIES IN THE CURRENT


DIRECTORIES:

 INPUT FROM THE USER:

5. Observations, Simulation Screen Shots and Discussions:


University Institute of Engineering
Department of Computer Science & Engineering

6. Additional Creative Inputs (If Any):

Learning outcomes (What I have learnt):

1. The Importance of Data Pre-Processing

• Learn why pre-processing raw data is a critical step in machine learning pipelines.

• Understand how clean, well-prepared data enhances model accuracy and efficiency.

2. Mastery of Pre-Processing Tools and Techniques

• Learn specific techniques such as handling missing data, normalization, encoding


categorical variables, and feature selection.

3. Familiarity with AI/ML Algorithms

• Develop knowledge of key AI/ML algorithms, including supervised, unsupervised, and


reinforcement learning methods.
University Institute of Engineering
Department of Computer Science & Engineering

Evaluation Grid:

Sr. Parameters Marks Obtained Maximum Marks


No.
1. Student Performance 12
(Conduct of experiment)
2. Viva Voce 10
3. Submission of Work Sheet 8
(Record)
Signature of Faculty (with Date): Total Marks Obtained: 30

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy