0% found this document useful (0 votes)

12 views4 pages

syllabus

The document outlines the curriculum for a Big Data Analytics course, detailing course objectives, teaching methods, and assessment criteria. It covers key topics including Hadoop, MapReduce, MongoDB, Hive, Pig, and Spark, along with practical experiments for hands-on learning. The course aims to equip students with skills to analyze big data and implement various data processing tools and techniques.

Uploaded by

PRADEEP NAZARETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

syllabus

Uploaded by

PRADEEP NAZARETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

BIG DATA ANALYTICS Semester 6

Course Code BAD601 CIE Marks 50

Teaching Hours/Week (L:T:P: S) 3:0:2:0 SEE Marks 50
Total Hours of Pedagogy 40 hours Theory + 8-10 Lab slots Total Marks 100
Credits 04 Exam Hours 3
Examination nature (SEE) Theory/practical
Course objectives:
1. To implement MapReduce programs for processing big data.
2. To realize storage and processing of big data using MongoDB, Pig, Hive and Spark.
3. To analyze big data using machine learning techniques.

Teaching-Learning Process (General Instructions)

These are sample Strategies; that teachers can use to accelerate the attainment of the various course outcomes.
1. Lecturer method (L) needs not to be only a traditional lecture method, but alternative effective teaching
methods could be adopted to attain the outcomes.
2. Use of Video/Animation to explain functioning of various concepts.
3. Encourage collaborative (Group Learning) Learning in the class.
4. Ask at least three HOT (Higher order Thinking) questions in the class, which promotes critical thinking.
5. Discuss how every concept can be applied to the real world - and when that's possible, it helps improve the
students' understanding.
6. Use any of these methods: Chalk and board, Active Learning, Case Studies.
MODULE-1
Classification of data, Characteristics, Evolution and definition of Big data, What is Big data, Why Big data,
Traditional Business Intelligence Vs Big Data,Typical data warehouse and Hadoop environment.
Big Data Analytics: What is Big data Analytics, Classification of Analytics, Importance of Big Data
Analytics, Technologies used in Big data Environments, Few Top Analytical Tools , NoSQL, Hadoop.

TB1: Ch 1: 1.1, Ch2: 2.1-2.5,2.7,2.9-2.11, Ch3: 3.2,3.5,3.8,3.12, Ch4: 4.1,4.2

MODULE-2
Introduction to Hadoop: Introducing hadoop, Why hadoop, Why not RDBMS, RDBMS Vs Hadoop, History
of Hadoop, Hadoop overview, Use case of Hadoop, HDFS (Hadoop Distributed File System),Processing data
with Hadoop, Managing resources and applications with Hadoop YARN(Yet Another Resource Negotiator).
Introduction to Map Reduce Programming: Introduction, Mapper, Reducer, Combiner, Partitioner,
Searching, Sorting, Compression.

TB1: Ch 5: 5.1-,5.8, 5.10-5.12, Ch 8: 8.1 - 8.8

MODULE-3
Introduction to MongoDB: What is MongoDB, Why MongoDB, Terms used in RDBMS and MongoDB, Data
Types in MongoDB, MongoDB Query Language.

TB1: Ch 6: 6.1-6.5
MODULE-4
Introduction to Hive: What is Hive, Hive Architecture, Hive data types, Hive file formats, Hive Query
Language (HQL), RC File implementation, User Defined Function (UDF).
Introduction to Pig: What is Pig, Anatomy of Pig, Pig on Hadoop, Pig Philosophy, Use case for Pig, Pig Latin
Overview, Data types in Pig, Running Pig, Execution Modes of Pig, HDFS Commands, Relational Operators,
Eval Function, Complex Data Types, Piggy Bank, User Defined Function, Pig Vs Hive.

TB1: Ch 9: 9.1-9.6,9.8, Ch 10: 10.1 - 10.15, 10.22

MODULE-5
Spark and Big Data Analytics: Spark, Introduction to Data Analysis with Spark.

1
Text, Web Content and Link Analytics: Introduction, Text Mining, Web Mining, Web Content and Web
Usage Analytics, Page Rank, Structure of Web and Analyzing a Web Graph.
TB2: Ch5: 5.2,5.3, Ch 9: 9.1-9.4

PRACTICAL COMPONENT OF IPCC

Sl.NO Experiments (Java/Python/R)
1 Install Hadoop and Implement the following file management tasks in Hadoop:
Adding files and directories
Retrieving files
Deleting files and directories.
Hint: A typical Hadoop workflow creates data files (such as log files) elsewhere and copies them into
HDFS using one of the above command line utilities.
2 Develop a MapReduce program to implement Matrix Multiplication
3 Develop a Map Reduce program that mines weather data and displays appropriate messages indicating
the weather conditions of the day.
4 Develop a MapReduce program to find the tags associated with each movie by analyzing movie lens
data.
5 Implement Functions: Count – Sort – Limit – Skip – Aggregate using MongoDB
6
Develop Pig Latin scripts to sort, group, join, project, and filter the data.
7 Use Hive to create, alter, and drop databases, tables, views, functions, and indexes.
8 Implement a word count program in Hadoop and Spark.
9 Use CDH (Cloudera Distribution for Hadoop) and HUE (Hadoop User Interface) to analyze data and
generate reports for sample datasets
Course outcomes (Course Skill Set):
At the end of the course, the student will be able to:

1. Identify and list various Big Data concepts, tools and applications.
2. Develop programs using HADOOP framework.
3. Make use of Hadoop Cluster to deploy Map Reduce jobs, PIG, HIVE and Spark programs.
4. Analyze the given data set and identify deep insights from the data set.
5. Demonstrate Text, Web Content and Link Analytics.

Assessment Details (both CIE and SEE)

The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%.
The minimum passing mark for the CIE is 40% of the maximum marks (20 marks out of 50) and for the
SEE minimum passing mark is 35% of the maximum marks (18 out of 50 marks). A student shall be
deemed to have satisfied the academic requirements and earned the credits allotted to each subject/
course if the student secures a minimum of 40% (40 marks out of 100) in the sum total of the CIE
(Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.

CIE for the theory component of the IPCC (maximum marks 50)
● IPCC means practical portion integrated with the theory of the course.
● CIE marks for the theory component are 25 marks and that for the practical component is 25
marks.
● 25 marks for the theory component are split into 15 marks for two Internal Assessment Tests (Two
Tests, each of 15 Marks with 01-hour duration, are to be conducted) and 10 marks for other

2
assessment methods mentioned in 22OB4.2. The first test at the end of 40-50% coverage of the
syllabus and the second test after covering 85-90% of the syllabus.
● Scaled-down marks of the sum of two tests and other assessment methods will be CIE marks for the
theory component of IPCC (that is for 25 marks).
● The student has to secure 40% of 25 marks to qualify in the CIE of the theory component of IPCC.
CIE for the practical component of the IPCC
● 15 marks for the conduction of the experiment and preparation of laboratory record, and 10 marks
for the test to be conducted after the completion of all the laboratory sessions.
● On completion of every experiment/program in the laboratory, the students shall be evaluated
including viva-voce and marks shall be awarded on the same day.
● The CIE marks awarded in the case of the Practical component shall be based on the continuous
evaluation of the laboratory report. Each experiment report can be evaluated for 10 marks. Marks of
all experiments’ write-ups are added and scaled down to 15 marks.
● The laboratory test (duration 02/03 hours) after completion of all the experiments shall be
conducted for 50 marks and scaled down to 10 marks.
● Scaled-down marks of write-up evaluations and tests added will be CIE marks for the laboratory
component of IPCC for 25 marks.
● The student has to secure 40% of 25 marks to qualify in the CIE of the practical component of the IPCC.
SEE for IPCC
Theory SEE will be conducted by University as per the scheduled timetable, with common question
papers for the course (duration 03 hours)
1. The question paper will have ten questions. Each question is set for 20 marks.
2. There will be 2 questions from each module. Each of the two questions under a module (with a
maximum of 3 sub-questions), should have a mix of topics under that module.
3. The students have to answer 5 full questions, selecting one full question from each module.
4. Marks scored by the student shall be proportionally scaled down to 50 Marks
The theory portion of the IPCC shall be for both CIE and SEE, whereas the practical portion will have
a CIE component only. Questions mentioned in the SEE paper may include questions from the
practical component.
Suggested Learning Resources:
Books:
1. Seema Acharya and Subhashini Chellappan “Big data and Analytics” Wiley India Publishers, 2nd Edition,
2019.
2. Rajkamal and Preeti Saxena, “Big Data Analytics, Introduction to Hadoop, Spark and Machine Learning”,
McGraw Hill Publication, 2019.
Reference Books:
1. Adam Shook and Donald Mine, “MapReduce Design Patterns: Building Effective Algorithms and Analytics for
Hadoop and Other Systems” - O'Reilly 2012
2. Tom White, “Hadoop: The Definitive Guide” 4th Edition, O’reilly Media, 2015.
3. Thomas Erl, Wajid Khattak, and Paul Buhler, Big Data Fundamentals: Concepts, Drivers & Techniques,
Pearson India Education Service Pvt. Ltd., 1st Edition, 2016
4. John D. Kelleher, Brian Mac Namee, Aoife D'Arcy -Fundamentals of Machine Learning for Predictive Data
Analytics: Algorithms, Worked Examples, MIT Press 2020, 2nd Edition

3
Web links and Video Lectures (e-Resources):
● https://www.kaggle.com/datasets/grouplens/movielens-20m-dataset
● https://www.youtube.com/watch?v=bAyrObl7TYE&list=PLEiEAq2VkUUJqp1k-g5W1mo37urJQOdCZ
● https://www.youtube.com/watch?v=VmO0QgPCbZY&list=PLEiEAq2VkUUJqp1kg5W1mo37urJQOdCZ&in
dex=4
● https://www.youtube.com/watch?v=GG-VRm6XnNk https://www.youtube.com/watch?v=JglO2Nv_92A

Activity Based Learning (Suggested Activities in Class)/ Practical Based learning

1. Implement MongoDB based application to store big data for data processing and analyzing the results [10
marks]

Syllabus BCS714D-Big Data Analytics
50% (2)
Syllabus BCS714D-Big Data Analytics
3 pages
Bad601 Lab
No ratings yet
Bad601 Lab
32 pages
7aimlsyll
No ratings yet
7aimlsyll
11 pages
Big Data Analytics- sem 7 CVMU
No ratings yet
Big Data Analytics- sem 7 CVMU
4 pages
AI Lab Manual (1)
No ratings yet
AI Lab Manual (1)
36 pages
bda 1
No ratings yet
bda 1
95 pages
bda syllb
No ratings yet
bda syllb
4 pages
mldap
No ratings yet
mldap
6 pages
Experiment Pgno
No ratings yet
Experiment Pgno
50 pages
Syallaus 6 Final
No ratings yet
Syallaus 6 Final
16 pages
Circular-21IS643-Data-Mining-Data-Wearhouse-Textbook-Preference-updated
No ratings yet
Circular-21IS643-Data-Mining-Data-Wearhouse-Textbook-Preference-updated
4 pages
6th sem AIDS syllabus 2022 scheme
No ratings yet
6th sem AIDS syllabus 2022 scheme
52 pages
SYCS Minor Syllabus
No ratings yet
SYCS Minor Syllabus
12 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
BE-AIDS-R-20-VII-VIII-Sem-Syllabus_compressed
No ratings yet
BE-AIDS-R-20-VII-VIII-Sem-Syllabus_compressed
55 pages
BigDataSYLLABUS
No ratings yet
BigDataSYLLABUS
4 pages
7th Cssyll
No ratings yet
7th Cssyll
49 pages
MCA 3rd semester Big Data Analytics syllabus
No ratings yet
MCA 3rd semester Big Data Analytics syllabus
15 pages
Notes
No ratings yet
Notes
11 pages
Cyber Security and IT Laws
No ratings yet
Cyber Security and IT Laws
25 pages
IV Yr II Sem Lesson Plans
No ratings yet
IV Yr II Sem Lesson Plans
19 pages
AIADS 7th Sem Syllabus Signed
No ratings yet
AIADS 7th Sem Syllabus Signed
19 pages
7cseaimlsyll
No ratings yet
7cseaimlsyll
11 pages
7th sem syallbus copy
No ratings yet
7th sem syallbus copy
10 pages
7th sem syllabus
No ratings yet
7th sem syllabus
46 pages
ZNRM 55 A9 PRB Ik 9 BI9 Steu MNXMQ DM J45 Aga KNM RFG
No ratings yet
ZNRM 55 A9 PRB Ik 9 BI9 Steu MNXMQ DM J45 Aga KNM RFG
9 pages
Big Data Analytics(r18a0529)
No ratings yet
Big Data Analytics(r18a0529)
139 pages
Mensuration Maths Formula in Hindi 42somp
100% (1)
Mensuration Maths Formula in Hindi 42somp
21 pages
BD Course Handout (Spring 2024)
No ratings yet
BD Course Handout (Spring 2024)
4 pages
7csbssyll
No ratings yet
7csbssyll
11 pages
@vtucode - in 18CS72 Previous Year Paper
No ratings yet
@vtucode - in 18CS72 Previous Year Paper
2 pages
Syllabus
No ratings yet
Syllabus
2 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Big Data Syllabus For Theory and Lab
No ratings yet
Big Data Syllabus For Theory and Lab
4 pages
BDA Syllabus
No ratings yet
BDA Syllabus
4 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
Hadoop Plan
No ratings yet
Hadoop Plan
9 pages
Bcs403 Updated Dbms-4 Sem-hod
No ratings yet
Bcs403 Updated Dbms-4 Sem-hod
31 pages
DBMS Manual
No ratings yet
DBMS Manual
30 pages
3170722_BDA_Lab Manual(1)
No ratings yet
3170722_BDA_Lab Manual(1)
78 pages
Other Words for Home PDF
No ratings yet
Other Words for Home PDF
24 pages
mca2syll
No ratings yet
mca2syll
27 pages
7aidssyll
No ratings yet
7aidssyll
12 pages
6th sem DS syllabus 2022 scheme
No ratings yet
6th sem DS syllabus 2022 scheme
54 pages
Bda Lab
No ratings yet
Bda Lab
47 pages
Co-Po Big Data Analytics
100% (1)
Co-Po Big Data Analytics
41 pages
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
No ratings yet
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
28 pages
BDA Syllabus Final
No ratings yet
BDA Syllabus Final
3 pages
CCS334 UPDATED 05-05-2025
No ratings yet
CCS334 UPDATED 05-05-2025
19 pages
Big Data Analystics
No ratings yet
Big Data Analystics
4 pages
Appendix-74
No ratings yet
Appendix-74
42 pages
AN13879_inversión de motor
No ratings yet
AN13879_inversión de motor
64 pages
2CS702-CPD-Odd 23 24
No ratings yet
2CS702-CPD-Odd 23 24
9 pages
BDAA
No ratings yet
BDAA
4 pages
Introduction To Big data-21CS753-syllabus
No ratings yet
Introduction To Big data-21CS753-syllabus
3 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
CCS334 BDA Syllabus
No ratings yet
CCS334 BDA Syllabus
5 pages
Principles of Thermal Spray
100% (1)
Principles of Thermal Spray
110 pages
Requiem Mass Program - 5th Nov
No ratings yet
Requiem Mass Program - 5th Nov
5 pages
2171607
No ratings yet
2171607
3 pages
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
No ratings yet
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
2 pages
SiCr stainless-steel-1.4762
No ratings yet
SiCr stainless-steel-1.4762
3 pages
STM 001 Reviewer 1
No ratings yet
STM 001 Reviewer 1
10 pages
Comprehensive Guide To Network Unlock Codes For Samsung
No ratings yet
Comprehensive Guide To Network Unlock Codes For Samsung
14 pages
Metacognitive Awareness Inventory MAI
100% (1)
Metacognitive Awareness Inventory MAI
4 pages
Brazil Bans Phones Lesson
No ratings yet
Brazil Bans Phones Lesson
2 pages
Micro - Dosing For Beginners
No ratings yet
Micro - Dosing For Beginners
10 pages
Kesehatan Tanah
No ratings yet
Kesehatan Tanah
134 pages
Analysis VMGO
No ratings yet
Analysis VMGO
2 pages
Confidence Building
No ratings yet
Confidence Building
4 pages
AS Practice PP
No ratings yet
AS Practice PP
12 pages
Research Paper
No ratings yet
Research Paper
16 pages
An R-Curve Assessment of Stable Crack Growth in An Aluminium Alloy
No ratings yet
An R-Curve Assessment of Stable Crack Growth in An Aluminium Alloy
15 pages
1st Assignment (BMAT (SMEC) 101)
No ratings yet
1st Assignment (BMAT (SMEC) 101)
2 pages
Using A Dichotomous Classification Key To Identify Common Freshwater Fish of New York State
No ratings yet
Using A Dichotomous Classification Key To Identify Common Freshwater Fish of New York State
14 pages
TOS 4th Quarter Periodical Test
No ratings yet
TOS 4th Quarter Periodical Test
5 pages
Lab Mnual-Activity-SE-Plant-Cell-and-Animal-Cell
No ratings yet
Lab Mnual-Activity-SE-Plant-Cell-and-Animal-Cell
7 pages
Asf020 - HTTPD
No ratings yet
Asf020 - HTTPD
15 pages
Untitled
No ratings yet
Untitled
4 pages
45 16255 EE543 2015 1 1 1 Load Calculation Egyptian Code English
No ratings yet
45 16255 EE543 2015 1 1 1 Load Calculation Egyptian Code English
1 page
QNET DC Motor Quick Start Guide
No ratings yet
QNET DC Motor Quick Start Guide
4 pages
International Conference On Superconductivity and Magnetism 2008 (ICSM 2008)
No ratings yet
International Conference On Superconductivity and Magnetism 2008 (ICSM 2008)
5 pages
Abstract For Health Monitoring System
No ratings yet
Abstract For Health Monitoring System
2 pages
Calculation Sheet
No ratings yet
Calculation Sheet
2 pages
KT 3 Ngu Am Hoc-Thuc
No ratings yet
KT 3 Ngu Am Hoc-Thuc
10 pages
Electro-Magneto-Mechanics: Dr. Kevin Craig Professor of Mechanical Engineering Rensselaer Polytechnic Institute
No ratings yet
Electro-Magneto-Mechanics: Dr. Kevin Craig Professor of Mechanical Engineering Rensselaer Polytechnic Institute
0 pages
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet
DevOps Foundation Courseware - English
From Everand
DevOps Foundation Courseware - English
Oleg Skrynnik
No ratings yet
Agile Foundation Courseware – English
From Everand
Agile Foundation Courseware – English
Nader Rad
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

syllabus

Uploaded by

syllabus

Uploaded by

BIG DATA ANALYTICS Semester 6

Course Code BAD601 CIE Marks 50

Teaching-Learning Process (General Instructions)

TB1: Ch 1: 1.1, Ch2: 2.1-2.5,2.7,2.9-2.11, Ch3: 3.2,3.5,3.8,3.12, Ch4: 4.1,4.2

TB1: Ch 5: 5.1-,5.8, 5.10-5.12, Ch 8: 8.1 - 8.8

TB1: Ch 9: 9.1-9.6,9.8, Ch 10: 10.1 - 10.15, 10.22

PRACTICAL COMPONENT OF IPCC

Assessment Details (both CIE and SEE)

Activity Based Learning (Suggested Activities in Class)/ Practical Based learning

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.