0% found this document useful (0 votes)
9 views15 pages

Module 1

The document outlines the IBM Data Science Professional Certificate's first module, which introduces data science, its applications, and essential skills. It covers topics such as big data, machine learning, and the daily responsibilities of data scientists, emphasizing the importance of curiosity and analytical skills. Additionally, it highlights various data formats, tools, and career pathways within the field of data science.

Uploaded by

fk569140
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views15 pages

Module 1

The document outlines the IBM Data Science Professional Certificate's first module, which introduces data science, its applications, and essential skills. It covers topics such as big data, machine learning, and the daily responsibilities of data scientists, emphasizing the importance of curiosity and analytical skills. Additionally, it highlights various data formats, tools, and career pathways within the field of data science.

Uploaded by

fk569140
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

IBM DATA SCIENCE PROFESSIONAL CERT.

MODULE 1 What Do Data Scientists Do?


1. WHAT IS DATA SCIENCE?  A Day in the Life of a Data Scientist
SYLLABUS:-
 Data Science Skills & Big Data
Defining Data Science
 Working on Different File Formats
 Defining Data Science
 Data Science Topics and Algorithms
 Video: What is Data Science?
 Discussion Prompt: Introduce Yourself
 Fundamentals of Data Science
 Reading: What Makes Someone a Data
 The Many Paths to Data Science
Scientist?
 Data Science: The Sexiest Job in the 21st
Century
 Defining Data Science
 Advice for New Data Scientists
Data Science Topics
$ Big Data and Data Mining $ Deep Learning and Machine Learning

 How Big Data is Driving Digital  Artificial Intelligence and Data Science
Transformation  Generative AI and Data Science
 Introduction to Cloud  Neural Networks and Deep Learning
 Cloud for Data Science  Applications of Machine Learning
 Foundations of Big Data  Reading: Regression
 Data Scientists at New York University  Lab: Exploring Data using IBM Cloud
 What is Hadoop? Gallery

 Big Data Processing Tools: Hadoop,


HDFS, Hive, and Spark
 Reading: Data Mining
Applications and Careers in Data Science: Careers and Recruiting in Data Science
Data Science Application Domains  How Can Someone Become a Data
 How Should Companies Get Started in Scientist?
Data Science?  Recruiting for Data Science
 Old Problems with New Data Science  Careers in Data Science
Solutions  Importance of Mathematics and
 Applications of Data Science Statistics for Data Science (only name
 How Data Science is Saving Lives change)

 Reading: The Final Deliverable  The Report Structure


 Reading: Info graph on roadmap
Data Literacy for Data Science  Considerations for Choice of Data
(Optional): Repository
Understanding Data  Data Integration Platforms
 Understanding Data
 Data Sources
 Working on Varied Data Sources and
Types
 Reading: Metadata
Data Literacy
 Data Collection and Organization
 Relational Database Management
System
 NoSQL
 Data Marts, Data Lakes, ETL, and Data
Pipelines
 Term  When
Definition
a data  Video where the term is introduced
point or points
 A set of step-
occur significantly
by-step
outside of most
instructions to
 Algorithms of the other data  What is Data Science?
solve a problem
in a data set,
or complete a
potentially
 Outliers task.  What is Data Science?
indicating
 anomalies,
A errors,
or unique
representation
phenomena
of the that
could impact
relationships
statistical analysis
and patterns
or modeling.
found in data to
make
 A systematic
predictions or
 Model approach using  What is Data Science?
analyze
mathematical and
 Quantitative complex
statistical analysis  Many Paths to Data Science
analysis systems
is used to
retaining
interpret
Who is an Actual Data scientist? Video
~someone who finds solutions to where
problems by analyzing Big or small data the term
Term Definition
using appropriate tools and then tells stories is
to communicate her findings to the relevant introduc
stakeholders. As long as one has a curious ed
mind, fluency in analytics, and the ability to
communicate the findings, I consider the Commonly
person a data scientist. used format for
Comma- storing tabular Working
separated data as plain on
values (CSV) / text where Different
Tab-separated either the File
values (TSV) comma or the Formats
tab separates
each value.

Data file types A computer file Working


Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

configuration is on such as a
designed to Different graph, of
store data in a File representing
Science
specific way. Formats data in a
Topics
readily
How data is Working visualization and
understandabl
encoded so it on Algorith
e way makes it
Data format can be stored Different ms
easier to see
within a data File trends in the
file type. Formats data.
Data A visual way, Data
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

A plain text file enable data


Working
where a exchange
on
Delimited text specific between Formats
Different
file character various
File
separates the technologies.
Formats
data values.
An open- What
Extensible A language Working source Makes
Markup designed to on Hadoop framework Someone
Language structure, Different designed to a Data
(XML) store, and File store and Scientist
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

process large applications to


datasets across exchange
clusters of structured
computers. data.

JavaScript A data format Working jupyter A Data


Object compatible on notebooks computational Science
Notation with various Different environment Skills &
(JSON) programming File that allows Big Data
languages for Formats users to create
two and share
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

documents A machine
containing learning
code, algorithm that Working
equations, predicts a on
Nearest
visualizations, target variable Different
neighbor
and based on its File
explanatory similarity to Formats
text. See other values in
Python the dataset.
notebooks.
Neural A A Day in
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

computational processes it
model used in using previous
deep learning learning, and
that mimics produces an
the structure the Life output.
networks and of a Data
functioning of Scientist Pandas An open- Data
the human source Python Science
brain's neural library that Skills &
pathways. It provides tools Big Data
takes an input, for working
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

with structured allows users to


data is often create and
used for data share
manipulation documents
and analysis. containing
code,
Also known as Data equations,
Python a “Jupyter” Science visualizations,
notebooks notebook, this Skills & and
computational Big Data explanatory
environment text.
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

An open- program that


source analyzes user
programming input, such as
Data
language used behaviors or
Science the Life
R for statistical preferences,
Skills & on engine of a Data
computing, and makes
Big Data Scientist
data analysis, personalized
and data recommendati
visualization. ons based on
that analysis.
Recommendati A computer A Day in
Video Video
where where
the term the term
Term Definition Term Definition
is is
introduc introduc
ed ed

A statistical Data that is A Day in


model that organized into the Life
Tabular data
shows a Data rows and of a Data
relationship Science columns. Scientist
between one Topics
Regression
or more and
predictor Algorith
variables with ms What Do Data Scientists Do?
a response
variable.
Data science is the study of large
quantities of data, which can reveal
insights that help organizations make outliers, model, algorithms, JSON, XML.
strategic choices. CSV, and regression.
 There are many paths to a career in data
science; most, but not all, involve math,
programming, and curiosity about data.
 New data scientists need to be curious,
judgemental and argumentative.
 Knowledgeable data scientists are in
high demand. Jobs in data science pays
high salaries for skilled workers.
 The typical work day for a Data Scientist
varies depending on what type of
project they are working on.
 Many algorithms are used to bring out
insights from data.
 Some key data science related terms
you learned in this lesson include:

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy