0% found this document useful (0 votes)

49 views

Module 2

data science professional elective

Uploaded by

bibliophileonthesamepage

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

Module 2

data science professional elective

Uploaded by

bibliophileonthesamepage

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Question bank for module 2, 3 and 4

Module 2:
1. Identify the steps to optimize the performance of a Compute Engine instance
for a high-traffic web application? (performance optimization)

Instance Type Selection:

• Choose an instance type (machine type) that meets your application's

requirements in terms of CPU, RAM, and network throughput. Consider using
Compute-Optimized (C2) or Memory-Optimized (M2) instances for CPU or
memory-intensive workloads respectively.

Auto Scaling:

• Implement managed instance groups with autoscaling based on traffic load or

CPU utilization. This ensures that your application can handle varying levels of
traffic efficiently without manual intervention.

Load Balancing:

• Utilize Google Cloud's load balancing services such as HTTP(S) Load Balancing
or Network Load Balancing to distribute traffic across multiple instances. This
improves availability and scalability by directing traffic to healthy instances.

Optimized Disk Performance:

• Use SSD persistent disks for better I/O performance compared to standard
persistent disks. Consider using local SSDs for temporary data or caching, as
they offer higher throughput and lower latency.

Networking Optimization:

• Ensure your Compute Engine instance is in the appropriate network and region
to minimize latency. Use VPC networks and subnets effectively. You can also
optimize network performance by enabling Google Cloud CDN (Content
Delivery Network) to cache content closer to your users.

Monitoring and Logging:

• Use Google Cloud Monitoring and Logging to monitor instance performance

metrics such as CPU utilization, disk I/O, and network traffic. Set up alerts based
on thresholds to proactively address performance issues.
Caching and Content Delivery:

• Implement caching mechanisms such as Google Cloud Memorystore (for Redis)

or other caching solutions to reduce database load and improve response times
for frequently accessed data.

Optimize Application Code:

• Review and optimize your web application's code to reduce latency and
improve efficiency. Consider techniques like asynchronous processing, lazy
loading, and efficient database queries.

Security Best Practices:

• Implement security best practices such as firewall rules, IAM roles, and HTTPS
encryption to protect your Compute Engine instance and data.

Regular Performance Testing:

• Conduct regular load testing and performance benchmarking to identify

bottlenecks and optimize system configurations accordingly.

2. How do you set up a Dataflow job using the GCP Console? OR Discuss the steps
to configure and run a Dataflow job from the GCP Console.

Navigate to Dataflow in GCP Console:

• Go to the GCP Console at https://console.cloud.google.com/.

• Select the Navigation menu (three horizontal lines) and navigate
to Dataflow under the Big Data section.

Create a New Dataflow Job:

• Click on the Create job from template button to start configuring a new
Dataflow job.

Configure Job Details:

• Job name: Enter a descriptive name for your Dataflow job.

• Region: Select the region where you want the job to run.
• Dataflow template: Choose the template that matches your job type (e.g.,
batch processing, streaming processing).
Configure Pipeline Options:

• Specify the input sources, output sinks, and any additional parameters required
by your Dataflow job.
• This may include details like input file paths, output file paths, Cloud Storage
buckets, Pub/Sub topics, etc.

Set Dataflow Job Execution Options:

• Define the job execution settings such as worker machine type, number of
workers, autoscaling options, and other performance-related configurations.
• You can also specify additional parameters like max workers, disk size, etc.,
depending on your job requirements.

Review and Launch the Job:

• Review all the configurations to ensure they are correct.

• Click on Run job to launch your Dataflow job.

Monitor Job Progress:

• Once the job is launched, you can monitor its progress in the Dataflow section
of the GCP Console.
• You can view details such as job status, throughput, input/output metrics, and
logs to track how your Dataflow job is performing.

View Job Logs and Output:

• After the job completes, you can view detailed logs and review the output
generated by your Dataflow job.
• Logs are accessible from the GCP Console, and output files are typically stored
in the specified Cloud Storage bucket or Pub/Sub topic.

Cleanup (if necessary):

• If you no longer need the resources associated with the Dataflow job, consider
cleaning up by deleting unnecessary resources like temporary files or unused
Cloud Storage buckets.

3. What types of encryption are available in Azure? OR List the types of encryption
supported by Azure, such as data-at-rest encryption and data-in-transit
encryption.
4. Data-at-Rest Encryption:
a. Azure Disk Encryption: Encrypts operating system and data disks used
by Azure Virtual Machines (VMs) to protect sensitive data.
b. Azure Storage Service Encryption (SSE): Automatically encrypts data
before persisting it to Azure Storage. SSE supports Blob storage, File
storage, and Queue storage.
5. Data-in-Transit Encryption:
a. Transport Layer Security (TLS/SSL): Azure uses TLS/SSL protocols to
encrypt data transmitted between users and Azure services, as well as
between Azure services.
b. Azure VPN Gateway: Provides secure encrypted tunnels between your
on-premises network and Azure Virtual Network (VPN encryption).
6. Encryption for Data Services:
a. SQL Database Transparent Data Encryption (TDE): Automatically
encrypts data in Azure SQL Database, ensuring data remains encrypted
at rest.
b. Azure Cosmos DB Encryption: Offers encryption of data both at rest and
in transit within Azure Cosmos DB using platform-managed keys.
c. Azure Storage Encryption: Besides SSE, Azure provides client-side
encryption where applications encrypt data before storing it in Azure
Storage using customer-managed keys.
7. Encryption Key Management:
a. Azure Key Vault: Centralizes key management and helps safeguard
cryptographic keys and secrets used by cloud applications and services.
8. Application-Level Encryption:
a. Developers can implement encryption within their applications using
libraries and APIs provided by Azure to encrypt sensitive data before
storing it in Azure services or transmitting it over networks.

4 Compare Azure Blob Storage and Azure Files for storing

container data in Kubernetes. What factors would influence
your choice between the two?-storage type and use cases

Azure Blob Storage:

1. Storage Type:
o Object Storage: Azure Blob Storage is optimized for storing large
amounts of unstructured data, such as images, videos, backups, and
logs.
2. Use Cases:
o Data Lakes: Ideal for building data lakes where data is ingested from
various sources and accessed for analytics and machine learning.
o Backup and Archive: Suitable for long-term storage and archival of data
that is accessed infrequently.
o Content Distribution: Used for serving static content to web applications
and streaming media.
3. Key Features:
o Access Tiers: Offers hot, cool, and archive tiers for cost-effective data
storage based on access frequency.
o Versioning: Supports versioning of blobs to maintain historical changes.
o Lifecycle Management: Automates the transition of data between
storage tiers and deletion of outdated data.

Azure Files:

1. Storage Type:
o File Storage: Azure Files provides SMB (Server Message Block) file shares
that can be accessed over the network using standard file system
protocols.
2. Use Cases:
o Shared File Storage: Suitable for applications that need shared access to
files, such as application data, configuration files, and shared libraries.
o Development and Testing: Useful for sharing files across development
teams and testing environments.
o Distributed Applications: Supports scenarios where multiple instances
of an application need access to the same files.
3. Key Features:
o SMB Protocol: Supports SMB protocol for seamless integration with
Windows and Linux applications.
o Mounting: Can be mounted directly as file shares on Kubernetes pods
using PersistentVolumeClaims (PVCs).
o Azure File Sync: Allows synchronization of on-premises file servers with
Azure Files for hybrid cloud scenarios.

Factors Influencing Choice:

1. Access Method:
o Blob Storage: Accessed via REST APIs, suitable for applications that need
to store and retrieve large amounts of unstructured data directly.
o Azure Files: Accessed via SMB protocol, suitable for applications that
require shared file access and compatibility with existing file-based
applications.
2. Data Structure:
o Blob Storage: Best for unstructured data and binary large objects
(BLOBs).
o Azure Files: Suitable for structured data and file-based applications
requiring hierarchical file storage.
3. Performance Requirements:
o Blob Storage: Optimized for handling large files and streaming data.
o Azure Files: Offers low-latency access for small file read/write
operations.
4. Integration Needs:
o Consider whether your applications or Kubernetes workloads require
direct integration with file shares (Azure Files) or object storage (Blob
Storage).
5. Cost Considerations:
o Evaluate cost differences based on storage consumption, access
patterns (frequency of access), and data transfer.

Michael Okpara University of Agriculture, Umudike
No ratings yet
Michael Okpara University of Agriculture, Umudike
2 pages
Dance Website
No ratings yet
Dance Website
117 pages
Contoh Skema RAB Event
No ratings yet
Contoh Skema RAB Event
1 page
Secure Remote Access For Industrial Machines For Dummies
No ratings yet
Secure Remote Access For Industrial Machines For Dummies
67 pages
Website of Library Management System (Aak)
No ratings yet
Website of Library Management System (Aak)
48 pages
Project Report On Room Booking System FOR Hotel Abp
No ratings yet
Project Report On Room Booking System FOR Hotel Abp
30 pages
Java Project Report Project
No ratings yet
Java Project Report Project
24 pages
AWS in ACTION Part -1: Real-world Solutions for Cloud Professionals
From Everand
AWS in ACTION Part -1: Real-world Solutions for Cloud Professionals
Poonam Devi
No ratings yet
Online Tourism Management System
75% (4)
Online Tourism Management System
4 pages
Tourism Management System (S)
No ratings yet
Tourism Management System (S)
46 pages
Online Travel and Tourism
No ratings yet
Online Travel and Tourism
40 pages
SRS Photo Cloud
No ratings yet
SRS Photo Cloud
29 pages
Tourism
No ratings yet
Tourism
60 pages
Railway Reservation System Report
No ratings yet
Railway Reservation System Report
89 pages
Internet Banking Java Project Report
100% (1)
Internet Banking Java Project Report
68 pages
Tasleem Black Book
No ratings yet
Tasleem Black Book
56 pages
Photoart Gallery With Report
No ratings yet
Photoart Gallery With Report
69 pages
BCA Quiz Web-App - Proposal
No ratings yet
BCA Quiz Web-App - Proposal
7 pages
Travel Agency Management System
No ratings yet
Travel Agency Management System
11 pages
A System To Filter Unwanted Messages From Osn User Walls
0% (1)
A System To Filter Unwanted Messages From Osn User Walls
19 pages
Se Practicle
No ratings yet
Se Practicle
47 pages
Tourism Management System
No ratings yet
Tourism Management System
15 pages
Vision Document: For Hotel Reservation System (HRS)
No ratings yet
Vision Document: For Hotel Reservation System (HRS)
7 pages
Tulsi Ws LB
No ratings yet
Tulsi Ws LB
17 pages
Smart City Project Modules
0% (1)
Smart City Project Modules
3 pages
Tour & Travel Management System
No ratings yet
Tour & Travel Management System
8 pages
Dot Net Program Solved Slips
No ratings yet
Dot Net Program Solved Slips
37 pages
Mini Project Report On KSRTC Ticket Rese
No ratings yet
Mini Project Report On KSRTC Ticket Rese
75 pages
ApartmentVisitor Django Report
No ratings yet
ApartmentVisitor Django Report
74 pages
Bank Customer Management System C++
100% (1)
Bank Customer Management System C++
73 pages
WTA Mini Project Format
100% (3)
WTA Mini Project Format
21 pages
Project Report On "Online Tour and Travel Agency"
No ratings yet
Project Report On "Online Tour and Travel Agency"
8 pages
Library Management System
No ratings yet
Library Management System
17 pages
Important Instructions To Examiners:: Q. No. Sub Q.N. Answers Marking Scheme
No ratings yet
Important Instructions To Examiners:: Q. No. Sub Q.N. Answers Marking Scheme
18 pages
Project Report Final - Find My Tutor
No ratings yet
Project Report Final - Find My Tutor
63 pages
SRS Tourism Management System
100% (2)
SRS Tourism Management System
9 pages
Travel and Tour Management System Synopsis
No ratings yet
Travel and Tour Management System Synopsis
4 pages
App Java Report-Eb Ocr
No ratings yet
App Java Report-Eb Ocr
42 pages
Shipping Management System: Bachelor of Computer Applications (Bca)
No ratings yet
Shipping Management System: Bachelor of Computer Applications (Bca)
134 pages
SRMS Project Report
No ratings yet
SRMS Project Report
60 pages
Final Year Project Report For Solomon Gai Ayuen
No ratings yet
Final Year Project Report For Solomon Gai Ayuen
56 pages
SQL Queries
No ratings yet
SQL Queries
2 pages
Jungle Safari Booking Management System: Mini Project Report
100% (1)
Jungle Safari Booking Management System: Mini Project Report
19 pages
Hostel Management System PPT Oumfbo
100% (1)
Hostel Management System PPT Oumfbo
20 pages
Final PDF
No ratings yet
Final PDF
18 pages
Projectreport 121202043850 Phpapp02
No ratings yet
Projectreport 121202043850 Phpapp02
120 pages
MU TD1 03 Book My Trip Project Report
No ratings yet
MU TD1 03 Book My Trip Project Report
50 pages
Major Synopsis IPU PDF
No ratings yet
Major Synopsis IPU PDF
17 pages
#Shuaib OODJ Assignment
No ratings yet
#Shuaib OODJ Assignment
46 pages
Travel and Tourism Management System Abstract
No ratings yet
Travel and Tourism Management System Abstract
2 pages
BCA 5th Semester Project Synopsis
No ratings yet
BCA 5th Semester Project Synopsis
3 pages
Srs Template
No ratings yet
Srs Template
9 pages
HTML CSS Report
No ratings yet
HTML CSS Report
44 pages
Tour and Travels Website Using React - Js
No ratings yet
Tour and Travels Website Using React - Js
7 pages
Project Report 4th Sem
No ratings yet
Project Report 4th Sem
21 pages
Simple Billing System - (PROJECT SYNOPSIS)
No ratings yet
Simple Billing System - (PROJECT SYNOPSIS)
20 pages
Hotel Management System SPMP
50% (2)
Hotel Management System SPMP
2 pages
Feasibility Analysis Course Code: CSE-401 Course Title: System Analysis and Design On Calchamp
No ratings yet
Feasibility Analysis Course Code: CSE-401 Course Title: System Analysis and Design On Calchamp
8 pages
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
From Everand
Mastering Microsoft 365 ENTRA ID - 100 Practical Guides For Secure Identity and Access Management: Mastering Microsoft 365, #122
Openshelves
No ratings yet
About Kubernetes and Security Practices - Short Edition: First Edition, #1
From Everand
About Kubernetes and Security Practices - Short Edition: First Edition, #1
Ami Adi
No ratings yet
Trackpad Pro Ver. 5.0 Class 7
From Everand
Trackpad Pro Ver. 5.0 Class 7
Nidhi Arora
5/5 (1)
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet
Data Warehouse Scheme and Syllabus
No ratings yet
Data Warehouse Scheme and Syllabus
2 pages
Module 4
No ratings yet
Module 4
4 pages
Reverse Engineering Notes
No ratings yet
Reverse Engineering Notes
4 pages
Bda Mod2
No ratings yet
Bda Mod2
8 pages
Ia-2 QB
No ratings yet
Ia-2 QB
2 pages
Big Data
No ratings yet
Big Data
11 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Question Bank For Module 1 and 2
No ratings yet
Question Bank For Module 1 and 2
2 pages
Hive
No ratings yet
Hive
9 pages
BDA Mod 3 Piglatin
No ratings yet
BDA Mod 3 Piglatin
10 pages
Bda Ans For Ia2 (Partial
No ratings yet
Bda Ans For Ia2 (Partial
5 pages
BDA - MongoDB
No ratings yet
BDA - MongoDB
12 pages
Progression
No ratings yet
Progression
29 pages
Bda Mod 1
No ratings yet
Bda Mod 1
32 pages
Big Data Hadoop and Spark
No ratings yet
Big Data Hadoop and Spark
27 pages
Question Bank - BDA (Module 5) 2
No ratings yet
Question Bank - BDA (Module 5) 2
1 page
Automata Theory and Computability (18CS54)
No ratings yet
Automata Theory and Computability (18CS54)
3 pages
Ip CV QB 1
No ratings yet
Ip CV QB 1
3 pages
Ratio and Proportion
No ratings yet
Ratio and Proportion
40 pages
Data Sufficiency
No ratings yet
Data Sufficiency
39 pages
Races & Games
No ratings yet
Races & Games
34 pages
Probability
No ratings yet
Probability
38 pages
Pipes and Cisterns
No ratings yet
Pipes and Cisterns
25 pages
Seating Arrangement
No ratings yet
Seating Arrangement
28 pages
Requirements Engineering SE Notes
No ratings yet
Requirements Engineering SE Notes
7 pages
Formal Languages and Automata Theory (06CS56)
No ratings yet
Formal Languages and Automata Theory (06CS56)
2 pages
Automata Theory and Computability (18CS54)
No ratings yet
Automata Theory and Computability (18CS54)
2 pages
Pyq - Atc - 5TH Sem
No ratings yet
Pyq - Atc - 5TH Sem
2 pages
ATC Question Bank
No ratings yet
ATC Question Bank
2 pages
STS - Group 4 (When Technology and Humanity Cross) - PPT
No ratings yet
STS - Group 4 (When Technology and Humanity Cross) - PPT
28 pages
Kurikulum Matematika Di Australia
No ratings yet
Kurikulum Matematika Di Australia
96 pages
Design For Manufacturing and Assembly For Sustainable Quick and Cost Effective Prefabricated Construction A Review
No ratings yet
Design For Manufacturing and Assembly For Sustainable Quick and Cost Effective Prefabricated Construction A Review
10 pages
Amanda Viona - Review Management Feedlot
No ratings yet
Amanda Viona - Review Management Feedlot
3 pages
CCC 2
No ratings yet
CCC 2
32 pages
Earth As An Island
No ratings yet
Earth As An Island
110 pages
MOBOTIX-mx ML T26-Part1 en 20200513
No ratings yet
MOBOTIX-mx ML T26-Part1 en 20200513
130 pages
Deep Q-Learning Based Sparse Code Multiple Access For Ultra Reliable Low Latency Communication in Industrial Wireless Networks
No ratings yet
Deep Q-Learning Based Sparse Code Multiple Access For Ultra Reliable Low Latency Communication in Industrial Wireless Networks
13 pages
User Guide Training Management Application-LMS
No ratings yet
User Guide Training Management Application-LMS
31 pages
Bseg Ebeln
No ratings yet
Bseg Ebeln
3 pages
Anomalies 2312.16139
No ratings yet
Anomalies 2312.16139
41 pages
Iit Delhi Dissertation
100% (2)
Iit Delhi Dissertation
5 pages
Object Oriented Programming (LAB) Comp (ONPO121A)
No ratings yet
Object Oriented Programming (LAB) Comp (ONPO121A)
28 pages
Code
No ratings yet
Code
9 pages
Calculator_Project
No ratings yet
Calculator_Project
3 pages
C5 Ebrochure - Brochure
No ratings yet
C5 Ebrochure - Brochure
33 pages
Frid Dell Harold G 1958
No ratings yet
Frid Dell Harold G 1958
43 pages
OM AATPC ICTD 006 WDDBA4 M06 Database System Testing
No ratings yet
OM AATPC ICTD 006 WDDBA4 M06 Database System Testing
42 pages
Tutorial Letter 101/0/2024: Management in Foundation Phase
No ratings yet
Tutorial Letter 101/0/2024: Management in Foundation Phase
17 pages
answer WE CARE 2 SC015 2024_2025
No ratings yet
answer WE CARE 2 SC015 2024_2025
4 pages
Chat App
No ratings yet
Chat App
28 pages
Statistics Using R Language
No ratings yet
Statistics Using R Language
5 pages
Catalogue PHPK
No ratings yet
Catalogue PHPK
12 pages
Temperature Monıtorıng of Chıllıng System Usıng IoT Technıques
No ratings yet
Temperature Monıtorıng of Chıllıng System Usıng IoT Technıques
6 pages
Checklist - Rebar Work
No ratings yet
Checklist - Rebar Work
2 pages
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
No ratings yet
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
45 pages
General: 1.1 Analog vs. Digital Instruments 3
No ratings yet
General: 1.1 Analog vs. Digital Instruments 3
14 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Module 2

Uploaded by

Module 2

Uploaded by

Question bank for module 2, 3 and 4

Instance Type Selection:

• Choose an instance type (machine type) that meets your application's

• Implement managed instance groups with autoscaling based on traffic load or

Optimized Disk Performance:

Monitoring and Logging:

• Use Google Cloud Monitoring and Logging to monitor instance performance

• Implement caching mechanisms such as Google Cloud Memorystore (for Redis)

Optimize Application Code:

Security Best Practices:

Regular Performance Testing:

• Conduct regular load testing and performance benchmarking to identify

Navigate to Dataflow in GCP Console:

• Go to the GCP Console at https://console.cloud.google.com/.

Create a New Dataflow Job:

Configure Job Details:

• Job name: Enter a descriptive name for your Dataflow job.

Set Dataflow Job Execution Options:

Review and Launch the Job:

• Review all the configurations to ensure they are correct.

Monitor Job Progress:

View Job Logs and Output:

Cleanup (if necessary):

4 Compare Azure Blob Storage and Azure Files for storing

Azure Blob Storage:

Factors Influencing Choice:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.