M.tech DS ML
M.tech DS ML
Name of the Programme Master of Technology in Data Science & Machine Learning
Semester I
Programme 2 years
Duration
Effective from Academic 2022- Effective for the batch Admitted in 2022
23
Year
Teaching scheme Examination scheme (Marks)
Code Subject
Name Credit Hours (per week) Theory Practical
Lecture Practical Lecture Practical Int Ext Total Int Ext Total
(Lab.)
L Tu Total P Total L Tu Total P Total
Programme Structure
Semester I
Total Credits 25
Theory 20
Practical 5
Total Marks of
Entire Programme
Syllabus
Semester-I
RASHTRIYA RAKSHA UNIVERSITY
An Institution of National Importance
Lavad, Dehgam, Gandhinagar-382305, Gujarat, India
Hours 03 00 02 05 Practical 15 35 50
Content:
Unit Subject Content Hrs
1 Understanding the need for computational complexity analysis: Asymptotic Notations, Proof of 05
correctness, Performance analysis, Recursive Algorithms and Recurrences.
Divide and Conquer: Implementing rules, Binary search, Merge Sort, Quick Sort, 12
Convex Hull, Matrix multiplication
5 Dynamic Programming: Introduction of Dynamic Programming, Principle of
05
Optimality, Examples like Single source shortest paths, Knapsack problem, Chained
matrix multiplication, Longest Common Subsequence, etc.
6 Intractable Problems: Polynomial Time, class P, Polynomial Time Verifiable Algorithms, 05
class NP, NP completeness and reducibility, NP Hard Problems, NP completeness proofs,
Approximation Algorithms.
Graph Algorithms: Elementary algorithms, DFS, BFS, Backtracking, and Branch & Bound 05
techniques with related examples
Practical Content:
Readings:
1. Gilles Brassard and Paul Bratley, Fundamentals of Algorithmics, PHI Publication.
2. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest& Clifford Stein, Introduction to
Algorithms, PHI Publication.
3. Ellis Horowitz, Sartaj Sahni, Sanguthevar Rajasekaran, Fundamentals of Computer Algorithms,
University Press
4. Jean-Paul Tremblay and Paul G. Sorenson, An Introduction to Data Structures with Applications,
Tata McGraw Hill
5. Robert L. Kruse, Data Structures and Program Design in C, PHI
Hours 03 02 00 05 Practical 15 35 50
Content:
Unit Subject Content Hrs
1 Review of Linear Algebra: Matrices, Vector Properties, Eigenvalues and Eigenvectors, 08
Distance Measures, Half-planes, Application for Linear Algebra in Computer Science
2 The Role of Statistics in Engineering: The Engineering Method and Statistical Thinking,
Collecting Engineering Data, Mechanistic and Empirical Models, Probability and 04
Probability Models
3 Probability and Statistics: Probability theory and axioms; Random variables; Probability
distributions and density functions (univariate and multivariate), Expectations and moments, 10
Covariance and correlation, Confidence intervals, Correlation functions, Random walks,
Markov process, Statistical inference
4 Descriptive Statistics and Point Estimation of Parameters: Numerical Summaries of
Data, Frequency Distributions and Histograms, Box and Probability Plots, Point 08
Estimation, Sampling Distributions and the Central Limit Theorem, Methods of Point
Estimation
5 Statistical Intervals for a Single Sample: Confidence Interval on the mean of a Normal
Distribution, Confidence Interval on the Variance and Standard Deviation of a Normal
Distribution, Large-Sample Confidence Interval for a Population Proportion, Guidelines 08
for Constructing Confidence Intervals, Tolerance and Prediction Intervals
6 Optimization: Basic Concepts, Linear Programming, Duality, Constrained and
unconstrained optimization, gradient decent and non-gradient techniques, Introduction to 07
least squares optimization, optimization in Practice.
Practical Content:
Reference Books:
1. Gilbert Strang, Linear Algebra and its applications, Harcourt, Brace, Jovanovich Publishers
2. Javier R. Movellan, Introduction to Probability Theory and Statistics
3. Douglas C. Montgomery, George C. Runger, Applied Probability and Statistics for
Engineers, Wiley
4. Introduction to Probability, Bertsekas, Dimitri and John Tsitsiklis, Athena Scientific
5. Jeff M. Phillips, Mathematical Foundations for Data Analysis, e-book, University of Utah
Paneerselvam, Operational Research, PHI
Hours 03 00 02 05 Practical 15 35 50
Content:
Unit Subject Content Hrs
1 Introduction to Python, Object oriented paradigm
Interactive computing with IPython, Jypyter 45
2 Arrays with NumPy
Data structures and formats with Pandas
Scientific functions with Scipy
3 Visualization and plotting with Matplotlib, seaborn, mayavi, PIL
4 Data formats and Analysis with Pandas, Numpy, statsmodels, SymPy, networkx, PyTables
Data Wrangling, Data manipulation dplyr
5 Speeding up code and parallelism using PyMPI, cython, numba
6 Testing with pytest
Documentation with Sphinx
Using Binder
Connecting python and interfacing with other languages
Practical Content:
Reference Books:
1. https://aaltoscicomp.github.io/python-for-scicomp/
2. Jeff M. Phillips, Mathematical Foundations for Data Analysis, e-book, University of Utah
Paneerselvam, Operational Research, PHI
3. Continuum Analytics and Robert Johansson, Introduction to Scientific Computing in Python
Subject Code 1A04MAL Subject Name Machine Learning
Reference Books:
1. Tom Mitchell, Machine Learning, TMH
2. C. Bishop, Pattern Recognition and Machine Learning, Springer
3. Kishan Mehrotra, Chilukuri Mohan and Sanjay Ranka, Elements of Artificial Neural
Networks, Penram International
4. Rajjan Shinghal, Pattern Recognition, Techniques and Applications, OXFORD
5. Andrew Kelleher, Adam Kelleher, Applied Machine Learning for Data Scientist and Software
engineers, Addison-Wesley Professional
Hours 03 02 00 05 Tutorial 15 35 50
Content:
Unit Subject Content Hrs
1
Intro to High Performance Computing: Components of a high-performance distributed 08
computing system, Types of parallel programming models, Parallel Programming
Platforms, communication model, architecture, static and dynamic interconnection
networks and their evaluation, topologies, communication costs, routing mechanism,
impact of processor-processor mapping and its techniques.
2 High-throughput computing: Preliminaries, decomposition techniques, task
characteristics, mapping techniques, handling interaction overheads, algorithm models, the 08
Multithreaded DAG Model, Work Optimality and Weak Scaling, Basic Concurrency
Primitives, Data Races and Race Conditions
3 Multicore computing
CUDA Parallelism Model, Foundations of Shared Memory, Thread Cooperation and
Execution Efficiency, Constants memory and events, memory management, Scheduling and 08
Work Distribution, Race conditions, Memory Profiling and optimization
4 Assessing and analyzing application scalability: Scalable Parallel Computing, computing
clusters, Clustering for Massive Parallelism, Computer Clusters and MPP Architectures,
Design Principles of Computer Clusters, Cluster Job and Resource Management, Case 06
Studies of Top Supercomputer Systems
5 Cloud Programming and Software Environments: Features of Cloud and Grid Platforms,
Public and Private cloud infrastructure, compute, GPU-compute and storage cloud, cloud-
based resource management, commercial cloud platforms: Google Cloud, Amazon AWS, 09
Microsoft Azure
6 Fault Tolerance and Security: Failure models, failure detection, algorithms for fault
tolerance, and recovery from failure in distributed systems, Authentication in Distributed
Systems, Distribution of security mechanisms, access control, and security management 06
Practical Content:
Reference Books:
1. Ananth Grama, Anshul Gupta, George Karypis and Vipin Kumar, Introduction to Parallel
Computing (2nd Edition), Pearson, 2003. ISBN-13: 978-0201648652.
2. Georg Hager and Gerhard Wellein, Introduction to High Performance Computing for
Scientists and Engineers (1st Edition), CRC Press, Taylor and Francis Group, 2010. ISBN-
13: 978-1439811924.
3. Parallel Algorithms https://nptel.ac.in/courses/106/103/106103188/
4. Kai Hwang, Geoffrey C. Fox and Jack J. Dongarra, Distributed and Cloud Computing from
Parallel Processing to the Internet of Things, Elsevier
5. Andrew S. Tanenbaum and Maarten van Steen, Distributed Systems: Principles and
Paradigms, Createspace
6. Kenneth P Birman, Guide to Reliable Distributed Systems: Building High-Assurance
Applications and Cloud-Hosted Services, Springer
8 AI in Security: Role of Artificial Intelligence in Security, Need for Early warning and threat 05
detection, Zero Trust based solution using AI, Adversarial Learning basics
Practical Content:
Reference Books:
1. Stallings, W. Cryptography and Network Security. Pearson Education India.
2. Giannotti, F., &Pedreschi, D. (Eds.). Mobility, data mining and privacy: Geographic knowledge discovery.
Springer Science & Business Media.
3. Bygrave, L. A. Data privacy law: an international perspective (Vol. 63). Oxford: Oxford University Press.
4. Scoble, R., Israel, S., &Benioff, M. R.. Age of context: Mobile, sensors, data and the future of privacy.
USA: Patrick Brewster Press.
5. Bendat, J. S., &Piersol, A. G. Random data analysis and measurement procedures.
Hours 01 00 00 01 Practical 15 35 50
Content:
Unit Subject Content Hrs
1 Ethics: Simple utilitarian ethics, concept of informed consent, data ownership, privacy, 05
anonymity, data validity, data forging and fabrication, algorithmic fairness, societal
consequences, code of ethics, Attributions, philosophical frameworks for assessing fairness,
contemporary theories of fairness, research ethics for data science, legal aspects
2 Big Data and Impact: Overview of ethical issues in data-driven organizations, Significance
of big data, neutral data, important concepts and terms, Mosaic data, found data and designed
data
Responsible AI: Biases in Data, Explainability, removing biases in data, data and model 05
privacy
Values and Actions: Articulating values, Turning values into action, ethical decision points,
Aligning values and actions, Methods and tools, Alignment Methodology Framework
3 Current Practices: Findings summary, Opt-in Versus Opt-out, Correlation through
aggregation, Data Ownership, Manifestation of values, Ethical Incoherence, Cultural Values,
03
data ethics for researchers, ethics of data scraping and storage, privacy and surveillance
4 Case Studies 02
5 Cloud Computing Security Challenges: Security policy implementation,
Virtualization Security Management: virtual threats, hypervisor risks, VM Security
Recommendations, VM-Specific Security Techniques, Cloud Access Security
Broker(CASB)
Practical Content:
Reference Books:
1. Mike Loukides, Hilary Mason, DJ Patil, Ethics and Data Science, O'Reilly 2018
2. Davis, Kord, Ethics of Big Data: Balancing risk and innovation, O'Reilly Media, Inc.
3. Craig, Terence, and Mary E. Ludloff, Privacy and big data: The players, regulators,
andstakeholders, O'Reilly Media
4. Richards, Neil M., Jonathan H. King, Big data ethics, Wake Forest L. Rev.
Examination Evaluation Scheme as per Choice Based Credit System (CBCS)
RRU is gearing up for several initiatives towards academic excellence, quality improvement and administrative
reforms. In view of this priority and in-keeping with RRU Vision and Mission; process was already initiated
towards introduction of semester system, grading system and credit system. The above initiatives acquired
further strength with UGC Circular D. O. No. F.1-2/2008(XI Plan) dated March 2009 informing all the
Universities regarding UGC’s new initiatives under the 11th Five Year plan, on speedy and substantive academic
and administrative reforms regarding higher education. Given this background RRU has framed this “RRU
CBCS REGULATION- 2021”. As the RRU has adopted this regulation, the same will have to be implemented
by all the Faculties of RRU for their academic Programmes. The Evaluation scheme shall be follows as per
1. Examination Evaluation:
1.1 Continuous Internal Assessment (CIA)
The performance of a student in each course is evaluated in terms of percentage of marks with a provision
for conversion to grade points. Evaluation for each course shall be done by a continuous internal assessment
(CIA) by the concerned course teacher as well as by an end semester examination and will be consolidated
at the end of the course. The components for continuous internal assessment are as follows.
1.4 Grading:
The RRU adopts absolute grading system wherein the marks are converted to grades, and every
semester result will be declared with semester grade point average (SGPA) and Cumulative Grade
Point Average (CGPA). The CGPA will be calculated every semester, except the first semester.
The grading system is with the following letter grades as given below:
2.1 Grade Point: Grade point is an integer indicating the numerical equivalent of the letter grade.
2.2 Credit Point (P): Credit point is the value obtained by multiplying the grade point (G) by the credit (C):
P = G x C.
2.3 Semester Grade Point Average (SGPA): Semester Grade Point Average (SGPA) is the value obtained
by dividing the sum of credit points (P) earned by a student in various courses taken in a semester by the total
number of credits earned by the student in that semester. SGPA shall be rounded off to two decimal places.
2.4 Cumulative Grade Point Average (CGPA): ‘Cumulative Grade Point Average’ (CGPA) is the value
obtained by dividing the sum of credit points in all the courses earned by a student for the entire programme,
by the total number of credits. CGPA shall be rounded off to two decimal places. CGPA indicates the
comprehensive academic performance of a student in a programme.
An overall letter grade (Cumulative Grade) for the entire programme shall be awarded to a student depending
on his/her CGPA.
2.5 Calculation of semester grade point average (SGPA) and cumulative grade point average (CGPA):
Performance in a semester will be expressed as Semester Grade Point Average (SGPA).
Cumulative performance of all the semesters together will reflect performance in the whole
programme and will be known as Cumulative Grade Point Average (CGPA). Thus, CGPA is the
real indicator of a student’s performance.
The formula for calculation of SGPA and CGPA is given below:
SGPA = (Ʃ Ci Mi) / (Ʃ Ci) CGPA = (Ʃ Ʃ Cni Gni) / (Ʃ Ʃ Cni) Where
Ci - number of credits for the ith course, Gi - grade point obtained in the ith course, Cni - number
of credits of the ith course of the nth semester, Gni - grade points of the ith course of the nth
semester
Refer the following examples for better understanding of CGPA/SGPA.
Example: