01 JTW115 3 Dec 2022

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

Siri e-Kuliah PPPJJ, USM

JTW 115E
Introduction to Data Analytics
Mohd Faiz Hilmi

3 Dis 2022

We lead

Pengurus Kursus:
Dr. Mohd Faiz Hilmi

E-mail:
faiz@usm.my

3
4/12/2022 PPPJJ, USM

1
Question/Comment We lead

Question/Comment

https://bit.ly/AskJTW115

Answer

https://bit.ly/FAQJTW115

Timetable
We lead

5
4/12/2022 PPPJJ, USM

2
Synopsis We lead

This course introduces the knowledge and techniques in data analytics process with some
theoretical foundations which include useful statistical and machine learning concepts so that
the process can transform hypotheses and data into actionable predictions.

The course provides basic principles on important steps of the process which include data
collecting, organizing, analyzing, building predictive models and presenting results.

Programming language and statistical analysis techniques are introduced based on examples
from operations, marketing, business intelligence and decision support alongside
sustainability, ethical and entrepreneurship elements.

Brief Introduction We lead

3
Topics We lead

Bil Topik / Topics Bil Topik / Topics


Introduction to Big Data Analytics
Data Preparation
• Big Data Overview
• Cleaning Data
1 • State of the Practice in Analytics
7 • Single Imputation & Multiple Imputation
• Key Roles for the New Big Data Ecosystem
• Characterize Data
• Examples of Big Data Analytics
• Sampling for Modelling and Validation
Data Analytics Lifecycle
• Data Analytics Lifecycle Overview Data Analysis
• Phase 1: Discovery • Descriptive Analysis
• Phase 2: Data Preparation 8 • Predictive Analysis
2
• Phase 3: Model Planning • Time Series Data
• Phase 4: Model Building • Optimization Analysis
• Phase 5: Communicate Results
Choosing and Evaluating Models
• Phase 6: Operationalize
• Mapping Problems to Machine Learning Tasks
Review of Basic Data Analytic Methods 9
• Evaluating Models
• Introduction to Data Analytic Methods
3 • Validating Models
• Exploratory Data Analysis
• Statistical Methods for Evaluation Unsupervised Methods for Machine Learning: Association Rules
Statistical Distributions • Apriori Algorithm
• Normal Distribution • Evaluation of Candidate Rules
10
4 • Lognormal Distribution • Applications of Association Rules
• Binomial Distribution • Validation and Testing
• Tools for Distributions • Diagnostics
Statistical Theory
Supervised Methods for Machine Learning: Logistic Regression
• Statistical Philosophy
• Use Cases
5 • A/B Tests 11
• Model Description
• Power of Tests
• Diagnostics
• Specialized Statistical Tests
Working with Data Reporting
• Working with Data from Files • Table‐Based Report
6 12
• Working with Relational Databases • Different Outputs Formats
• Working with External Data Sources • ODS Graphics Procedures

Penilaian We lead

Assignment : 30%
PB : 20%
PA : 50%
Jumlah 100%

9
4/12/2022 PPPJJ, USM

4
Assignment (30%)
We lead

• Group work
– Min 6
– Max 10
– If group member < 6 or > 10
• Points will be deducted
– Register your group
• https://bit.ly/RegisterGroupJTW115
• Detail instruction will be given in the
portal
• Due: 26 Mac 2023 10
4/12/2022 PPPJJ, USM

10

PB/Mid Term Exam (20%) We lead

• During Intensive Week


• Format of questions
– Objectives/MCQ
• Topics covered?
– Selected
– Will be informed later

11
4/12/2022 PPPJJ, USM

11

5
PA/Final Exam (50%) We lead

• July/Aug 2023
• Format
– Objektive/MCQ
– 50 questions
• Duration = 3 hrs
• Coverage
– All topics

12
4/12/2022 PPPJJ, USM

12

References We lead

Primary Reference:
• e-kuliah/Self Instructional Learning Material (SIM)
• Notes
• EMC Education Services (2015), Data Science & Big Data Analytics:
Discovering, Analysing, Visualizing and Presenting data, Indianapolis: John
Wiley and Sons, Inc
• Berthold, M. R., Borgelt, C., Höppner, F., & Klawonn, F. (2010). Guide to
Intelligent Data Analysis: How to Intelligently Make Sense of Real Data.
London: Springer.

Additional References:
• Markus Hofmann, Ralf Klinkenberg, (2014), RapidMiner Data Mining Use
Cases and Business Analytics Applications, London: CRC Press
• Jay Liebowitz (2013), Big Data and Business Analytics, Florida: Auerbach
Publications
13
4/12/2022 PPPJJ, USM

13

6
Portal
JTW115E

14
4/12/2022 PPPJJ, USM

14

15

7
Selamat
Berjaya!

Terima Kasih
16
4/12/2022 PPPJJ, USM

16

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy