Big Data
Big Data
Big Data
University Core
University Elective
Programme Core
Total Credits: 6
Programme Elective
Component Percentage
Paper Presentation 25%
Guide 50%
SET Evaluation 25%
Committee
Course Evaluation
Component Percentage
Mid Term Exam
Other Internal Assessment 55%
(Assignments/Projects/Seminars)
Term End Exam 45%
CSE503 Advanced Algorithmic Analysis L T P C
3 0 2 4
Version No. : 2.00
Course
Prerequisites:
Objective
To focus on design and analysis of algorithms in various domains that lays foundations for
designing efficient algorithms.
Expected Outcome
On completion of this course the student would be able to
- Apply the algorithms and design techniques to solve problems
- Have a sense of the complexities of various problems in different domains.
Unit No. I Introduction 9 hours
Overview of algorithmic design, asymptotic notation and its properties, Growth of Functions,
Time complexity and Analysis of algorithms, Recurrence Relations, Amortized analysis.
Unit No. II Linear Programming 9 hours
Geometry, Farkas' Lemma, Strong Duality, Complexity, Interior-point Algorithms, Ellipsoid
Algorithm and Optimization vs. Separation, Extension to Conic Programming.
Course
Prerequisites : -
Objective
To focus towards the various design options in the area of architecture that lays platform to
develop and analyze high performance applications.
Expected Outcome
On completion of this course the student would be able to
- Identify the need for multi-core architecture for specific applications by developing a suitable
complexity measure.
- Identify needs for homogeneous or heterogeneous multi-core architectures for a given
application.
- Develop methods to partition a given application program to run on a multi-core processor
- Use the Intel multi-core architecture for develop high performance code
- Optimize code using appropriate techniques.
Objective
To provide the fundamental principles of modern operating systems that explores design aspects
of modern operating systems.
Expected Outcome
On completion of this course the student should be able to understand and evaluate operating
system implementations, Develop system software modules, Write and debug concurrent
programs, Debug complex systems and low-level software and Work with distributed and real
time OS.
Course
Prerequisites
Objective
To expose the students to the latest industry relevant topics in modern database management
systems.
Expected Outcome
To enable the students to design their own parallel and distributed databases and to expose to
the various warehousing tools.
Unit No. I Database Design And Tuning 6 hours
Introduction to physical database design – Guideline for index selection − Overview of
database tuning – Conceptual schema tuning – Queries and view tuning, Limitations of RDBMS,
Query Optimization, NoSQL, transaction model.
Unit No. II Parallel And Distributed Database 12 hours
Parallel database systems: Architecture of parallel databases, Parallel query evaluation,
parallelizing joins and parallel − query optimization. Distributed database systems: Distributed
database architecture, Properties of distributed database, Types of distributed database, storing
data in a distributed DBMS, distributed query processing, Database Concurrency control
protocols. Transaction failure and Recovery, Database recovery protocol.
Unit No. III Deductive Databases 9 hours
Introduction, Prolog/datalog notation, Interpretation of rules, Basic inference mechanisms for
logic programs, Datalog programs and their evaluation, deductive database system, deductive
object oriented databases, applications.
Unit No. IV Data Warehousing 9 hours
Data warehousing: Characteristics of Data warehouse, Data preprocessing, Data warehouse
architecture, Multi dimensional data model, Schema design, OLAP Operation and Data mart,
Concepts of Data mining.
Unit No. V Database Technologies Use Case 9 hours
Object Database Systems, Multimedia databases, Mobile databases, Spatial Database, Temporal
database, Data bases on the World Wide Web, Geographic Information system, Genome data
management, Digital Libraries.
Text / Reference Books
1. Raghu Ramakrishnan and Johannes Gehrke, “Database Management Systems”, 3rd Edition,
McGraw Hill, 2007.
2. S.K.Singh, “Database Systems: Concepts, Design & Applications”, 1st edition, Prentice Hall,
2009.
3. Ramez Elmasri and B.Navathe, “Fundamentals of database systems”, 4th edition, Addison
Wesley, 2008.
4. Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, 2nd edition,
Morgan Kaufmann publishers, 2011.
5. Gerhard Weikum, Gottfried Vossen, “Transactional Information Systems: Theory,
Algorithms and Practice of Concurrency Control and Recovery, Morgan Koufmann, 2002.
Course :
Prerequisites
Objective
To provide the basic concepts that underlay all graphics applications like computer games,
movies, medicine and information visualization and that will serve the foundations into the areas
of high quality image generation and interactive graphics.
Expected Outcome
On completion of this course the student would be able to deal latest concepts introduced in 3D
cards architecture.
Unit No. I Introduction 9 hours
Overview, Modeling, Procedural Models, Fractal Models, and Grammar based models, particle
systems, and viewing, Rasterization and Ray tracing
Course :
Prerequisites
Objective
To go beyond the basic level of understanding that is typically offered at an undergraduate
networking course.
Expected Outcome
On completion of course students will be able to understand the fundamental concepts in
routing and addressing, transport protocols and congestion control, emerging distributed
applications, and wireless networking.
Unit No. I Networking Standards And Specification 9 hours
Networking standards and specifications, Need for standardization, ISO and the IEEE
standards, The IEEE 802 Project
Unit No. II Overview of OSI and TCP/IP Protocol Suite 9 hours
Layers in the OSI model, TCP/IP protocol suite, Physical layer addressing, Network layer
addressing, Client-Server model.
Unit No. III Addressing And Routing 9 hours
IP Addresses: Classful addressing, Subnetting/Supernetting, Classless Addressing, Delivery and
routing of IP packets, Interior and Exterior routing.
Unit No. IV TCP/IP Protocol Suite 9 hours
Socket Interface, Internet Protocol (IP), ICMP and ARP, Transport Layer Protocols − TCP and
UDP, Congestion control and Quality of Service, File Transfer protocols − FTP and TFTP,
SMTP, SNMP, BOOTP and DHCP, Domain Name System, Mobile IP. Routing protocols −
RIP, OSPF, BGP.
Unit No. V Ad Hoc Wireless Networks 9 hours
Cellular and Ad hoc wireless networks, Applications of Ad hoc wireless networks, issues in ad
hoc wireless networks, issues in designing a routing protocol for ad hoc wireless networks,
Classification of routing protocols, Security in ad hoc wireless networks.
Recommended by the
Board of Studies on
Date of Approval by
the Academic
Council
CSE609 Information Retrieval and Data Mining L T P C
3 0 2 4
Objective
To provide the fundamentals on information retrieval and data mining techniques and focus on
practical algorithms of textual document indexing, relevance ranking, web usage mining, text
analytics, as well as their performance evaluations, that lays foundations for the Data Analytics.
Expected Outcome
On completion of this course students are expected to master both the theoretical and practical
aspects of information retrieval and data mining. More specifically, the student will understand:
1. The basic concepts and processes of information retrieval systems and data mining techniques.
2. The common algorithms and techniques for information retrieval.
3. The quantitative evaluation methods for the IR systems and data mining techniques.
4. The popular probabilistic retrieval methods and ranking principle.
Objectives
This course provides a broad introduction to big data at a foundation level with a focus on big
data technology and tools, including MapReduce and Hadoop. That serves foundation for the
advanced studies in the area of Big Data Analytics.
Expected Outcome
On completion of this course, the students will be able to handle huge volumes of data untapped
by the BI programs. They come to know about the Analytics Life Cycle. They get knowledge of
open source software framework that supports the processing of large data sets.
Mode of Evaluation:
Recommended by the Board of Studies on :
Objective:
To introduce how to mimic learning through algorithms. To learn from data in a supervised /
unsupervised manner so as to facilitate decision making.
Expected Outcome
On completion of this course the student would be able to:
Understand the principles, advantages, limitations and possible applications of machine
learning.
Identify and apply the appropriate machine learning technique to classification, pattern
recognition, optimization and decision making.
Objective
To focus beyond the basic level of understanding that is typically offered at Fundamentals of Big
Data Analytics course.
It focuses on concepts, principles, and techniques applicable to any technology environment and
industry. It establishes a baseline that can be enhanced by further formal training and additional
real-world experience in the Big Data Analytics.
Expected Outcome
On completion of this course the student would be able to:
Objectives
To focus on the state-of-the-art in various areas of data visualization, such as data types, chart
types, visual variables, visualization techniques, color theory, and data patterns.
Expected Outcome
On completion of this course the student would be able to:
Understand the principles of creating and evaluating effective data visualizations.
To use software tools to create various data visualizations.
Familiar with the visualization techniques in major application areas.
Acquire the skill to apply visualization techniques to a problem and associated data set.
Objectives
To focus on Data Management to handle large-scale data arising from the Internet and
Enterprise -based applications.
Expected Outcome
On completion of this course the student would be able to develop solutions for both building
data-intensive scalable applications over the Internet/Web as well as for large-scale data
analytics.