0% found this document useful (0 votes)
67 views

B. NoSQL, Big Data, and Spark Foundations - Coursera

Uploaded by

Hafiszan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views

B. NoSQL, Big Data, and Spark Foundations - Coursera

Uploaded by

Hafiszan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Browse Information Technology Data Management

NoSQL, Big Data, and Spark Foundations Specialization


Springboard your Big Data career. Master fundamentals of NoSQL, Big Data, and Apache Spark with hands-on job-ready skills in machine learning and data
engineering.

Taught in English 19 languages available Some content may not be translated

Instructors: IBM Skills Network Team +6 more

Enroll
Starts Dec 21

Specialization - 3 course series


Get in-depth knowledge of a subject

4.3 (129 reviews)

Beginner level
Recommended experience

1 months at 10 hours a week

Flexible schedule
Learn at your own pace

View all courses

What you'll learn

Work with NoSQL databases to insert, update, delete, query, index, Develop hands-on NoSQL experience working with MongoDB, Apache
aggregate, and shard/partition data. Cassandra, and IBM Cloudant.

Develop foundational knowledge of Big Data and gain hands-on lab Perform Extract, Transform and Load (ETL) processing and Machine
experience using Apache Hadoop, MapReduce, Apache Spark, Spark SQL, Learning model training and deployment with Apache Spark.
and Kubernetes.

Skills you'll gain

Cloud Database Mongodb Cassandra NoSQL Cloudant Machine Learning Machine Learning Pipelines Data Engineer SparkML

Apache Spark Big Data SparkSQL Apache Hadoop

Details to know
Shareable certificate Recently updated!
Add to your LinkedIn profile August 2023

Enroll
Starts Dec 21

See how employees at top companies are mastering


in-demand skills
Learn more about Coursera for Business

Advance your subject-matter expertise


Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects

Develop a deep understanding of key concepts


Earn a career certificate from IBM

Earn a career certificate


Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Specialization - 3 course series


Big Data Engineers and professionals with NoSQL skills are highly sought after in the data management industry. This Specialization is designed
for those seeking to develop fundamental skills for working with Big Data, Apache Spark, and NoSQL databases. Three information-packed
courses cover popular NoSQL databases like MongoDB and Apache Cassandra, the widely used Apache Hadoop ecosystem of Big Data tools, as
well as Apache Spark analytics engine for large-scale data processing.

You start with an overview of various categories of NoSQL (Not only SQL) data repositories, and then work hands-on with several of them including
IBM Cloudant, MonogoDB and Cassandra. You’ll perform various data management tasks, such as creating & replicating databases, inserting,
updating, deleting, querying, indexing, aggregating & sharding data. Next, you’ll gain fundamental knowledge of Big Data technologies such as
Hadoop, MapReduce, HDFS, Hive, and HBase, followed by a more in depth working knowledge of Apache Spark, Spark Dataframes, Spark SQL,
PySpark, the Spark Application UI, and scaling Spark with Kubernetes. In the final course, you will learn to work with Spark Structured Streaming
Spark ML - for performing Extract, Transform and Load processing (ETL) and machine learning tasks.

This specialization is suitable for beginners in the fields of NoSQL and Big Data – whether you are or preparing to be a Data Engineer, Software
Developer, IT Architect, Data Scientist, or IT Manager.

Applied Learning Project

The emphasis in this specialization is on learning by doing. As such, each course includes hands-on labs to practice & apply the NoSQL and Big
Data skills you learn during lectures.

In the first course, you will work hands-on with several NoSQL databases- MongoDB, Apache Cassandra, and IBM Cloudant to perform a variety of
tasks: creating the database, adding documents, querying data, utilizing the HTTP API, performing Create, Read, Update & Delete (CRUD)
operations, limiting & sorting records, indexing, aggregation, replication, using CQL shell, keyspace operations, & other table operations.

In the next course, you’ll launch a Hadoop cluster using Docker and run Map Reduce jobs. You’ll
explore working with Spark using Jupyter notebooks on a Python kernel. You’ll build your Spark skills using DataFrames, Spark SQL, and scale
your jobs using Kubernetes.

In the final course you will use Spark for ETL processing, and Machine Learning model training and deployment using IBM Watson.
Read less

Introduction to NoSQL Databases


Course details
Course 1 • 17 hours • 4.6 (226 ratings)

What you'll learn

Differentiate between the four main categories of NoSQL repositories.


Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.
Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

Cloud Database Mongodb Cassandra NoSQL Cloudant

Introduction to Big Data with Spark and Hadoop


Course details
Course 2 • 18 hours • 4.4 (289 ratings)

What you'll learn

Explain the impact of big data, including use cases, tools, and processing methods.

Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.
Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.
Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Big Data SparkSQL SparkML Apache Hadoop Apache Spark

Machine Learning with Apache Spark


Course details
Course 3 • 14 hours • 4.7 (29 ratings)

What you'll learn

Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.
Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML
pipelines.
Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.
Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Skills you'll gain

Machine Learning Machine Learning Pipelines Data Engineer SparkML Apache Spark

Instructors

IBM Skills Network Team


IBM
49 Courses • 655,942 learners

View all 7 instructors

Offered by

IBM
Learn more

Why people choose Coursera for their career

Felipe M. Jennifer J.
Learner since 2018 Learner since 2020

"To be able to take courses at my own pace and rhythm has been "I directly applied the concepts and skills I learned from my
an amazing experience. I can learn whenever it fits my schedule courses to an exciting new project at work."
and mood."

● ○

New to Data Management? Start here.


How Much Do Network 7 Cybersecurity Trends in 2024 What Is Access Control? Cloud Data Security in 2024:
Engineers Make? 2024 Salary Dangers, Safeguards, and More
Guide
December 14, 2023 November 29, 2023 December 14, 2023
December 11, 2023 Article · 6 min read Article · 2 min read Article · 8 min read
Article

Open new doors with Coursera Plus


Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready
certificate programs - all included in your subscription

Learn more

Advance your career with an online degree


Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Learn more

Frequently asked questions

How long does it take to complete the Specialization?

What background knowledge is necessary?

Do I need to take the courses in a specific order?

Show all 10 frequently asked questions

More questions
Visit the learner help center

Coursera Community
About Learners
What We Offer Partners
Leadership Beta Testers
Careers Translators
Catalog Blog
Coursera Plus The Coursera Podcast
Professional Certificates Tech Blog
MasterTrack® Certificates Teaching Center
Degrees
For Enterprise
For Government
For Campus
Become a Partner
Coronavirus Response
Social Impact
More
Press
Investors
Terms
Privacy
Help
Accessibility
Contact
Articles
Directory
Affiliates
Modern Slavery Statement
Manage Cookie Preferences

Learn Anywhere

Follow Us

© 2023 Coursera Inc. All rights reserved.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy