0% found this document useful (0 votes)
67 views

Cloud Data Warehouse or CDW Platforms

The document discusses cloud data warehouse (CDW) platforms and provides details about several popular CDW options. It defines CDWs as databases designed for analytics, scalability and usability that are offered as managed cloud services. The document then lists several major CDWs - Amazon Redshift, Azure Synapse Analytics, Google BigQuery, Azure SQL Database, and Snowflake. For each, it summarizes key features, strengths, and weaknesses based on research. Finally, it provides sources for further information about each CDW platform.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views

Cloud Data Warehouse or CDW Platforms

The document discusses cloud data warehouse (CDW) platforms and provides details about several popular CDW options. It defines CDWs as databases designed for analytics, scalability and usability that are offered as managed cloud services. The document then lists several major CDWs - Amazon Redshift, Azure Synapse Analytics, Google BigQuery, Azure SQL Database, and Snowflake. For each, it summarizes key features, strengths, and weaknesses based on research. Finally, it provides sources for further information about each CDW platform.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Don Mariano Marcos Memorial State University

Mid - La Union Campus


College of Information Technology
"Center of Development in Information Technology"
City of San Fernando

Name: GACUTAN, Ram Adrian N. Student ID #: 201-1517-2


Year and Section: 3E Activity: Cloud Data Warehouse or CDW platforms

TITLE
Cloud Data Warehouse or CDW platforms

Research different CDW available online. Based on your research, you should be able to
answer the following:
1. Definition: Define what are CDW and how it is related to Data Warehousing.
- is a database designed with analytics, scalability, and usability in mind and offered as a
managed service in the public cloud. The greater accessibility, scalability, and speed of
cloud-based data warehouses enables business intelligence teams to provide quicker and
better insights than was previously possible.

- provides flexibility by getting up and running in minutes instead of months and expanding
or contracting as needed. Legacy on-premises data warehouses must migrate to the cloud
to continue providing value and integrating into a new analytics environment.

- a cloud service provider handles the management and hosting of the solution. The cloud's
inherent adaptability is yours to use here, along with more predictable prices that can be
set at a flat rate or a percentage of actual usage.

Sources:
1. https://hevodata.com/blog/cloud-data-warehouse-101/
2. https://www.informatica.com/resources/articles/what-is-a-cloud-data-warehouse.html
3. https://cloud.google.com/learn/what-is-a-data-warehouse
2. Availability: What are the different CDWs available online?
1. Amazon Redshift- Amazon Redshift, which is also called AWS Redshift, is a fully-
managed petabyte-scale cloud-based data warehouse product made for storing and
analyzing large amounts of data. It is also used to move databases on a large scale.

2. Azure Synapse Analytics- Azure Synapse is a service for unlimited analytics that
combines enterprise data warehousing and Big Data analytics. It gives you the freedom
to query data on your own terms, using either serverless or dedicated resources at
scale.

3. Google BigQuery- is a fully managed enterprise data warehouse that lets you manage
and analyze your data with built-in features like machine learning, geospatial analysis,
and business intelligence.

4. Azure SQL Database- Azure SQL Database is a fully managed platform as a service (PaaS)
database engine that takes care of most database management tasks like upgrading,
patching, backups, and monitoring without the user having to do anything.

5. Snowflake- is a fully managed SaaS (software as a service) that offers a single platform
for data warehousing, data lakes, data engineering, data science, data application
development, and secure sharing and consumption of real-time/shared data. Snowflake
has built-in features like separating storage and compute, scalable compute on the fly,
data sharing, data cloning, and support for third-party tools to meet the needs of growing
businesses.

6.
Sources
1. https://www.scnsoft.com/analytics/data-warehouse/cloud
2. https://hevodata.com/blog/amazon-redshift-pros-and-cons/
3. https://www.sumologic.com/blog/what-is-amazon-redshift/
4. https://learn.microsoft.com/en-us/azure/synapse-
analytics/?ranMID=43674&ranEAID=rl2xnKiLcHs&ranSiteID=rl2xnKiLcHs-
Eo1ri5Hop5j.3EB_qUR61w&epi=rl2xnKiLcHs-
Eo1ri5Hop5j.3EB_qUR61w&irgwc=1&OCID=AID2200057_aff_7795_1243925&tduid=(ir__2r
zsylcrokkfblpbmf0wesvwp32xqlrm0t9xevsn00)(7795)(1243925)(rl2xnKiLcHs-
Eo1ri5Hop5j.3EB_qUR61w)()&irclickid=_2rzsylcrokkfblpbmf0wesvwp32xqlrm0t9xevsn00
5. https://cloud.google.com/bigquery/docs/introduction
6. https://learn.microsoft.com/en-us/azure/azure-sql/database/sql-database-paas-
overview?view=azuresql
7. https://www.snaplogic.com/blog/snowflake-data-
platform#:~:text=A%20Snowflake%20database%20is%20where,size%2C%20compression%2
C%20and%20statistics
3. Cosmparison: List their features and capabilities. What are their weakness and
strengths?

CLOUD DATA
FEATURES STRENGTHS WEAKNESSES
WAREHOUSING

- analytics for - high- - only receive


everyone performance support for
-Analyze all your warehouse parallel upload
data solution. with specific data.
1. Amazon -Performance at - consistent - it does not work
Redshift any scale backup for your as a live app
-Most secure and data. database.
compliant - integrates with
-ETL third-party tools.
-Pricing
- Synapse Sql - Data integration - Integrating
dedicated pool via polybase external 3rd party
- T-SQL - Data distribution data sources is
- ETL and Data - Resource missing in Azure
2. Azure Synapse
integration allocation (for Synapse
Analytics
- Variety of production ETL - Master data
analytics services and report users). services and data
- Real-time data quality services
stream processing are missing

- Multi-cloud - Fully managed - Works best only


Functionality (BQ platform that does with flat tables
Omni) not require - Tooling support
- Built-in ML downtime for outside of the GCP
Integration (BQ updates and ecosystem is often
ML) automatically lacking
- Foundation for BI - allows for
(BQ BI Engine) querying data
- Geospatial across cloud
Analysis (BQ GIS) platforms
- Automated Data - Excels at
3. Google BigQuery
Transfer (BQ Data analyzing
Transfer Service) enormous data
- Free Access (BQ sets
Sandbox) - uses artificial
- Serverless intelligence to
Architecture optimize storage
- Standard SQL - Supported by
- Automated pretty much any
Backup platform
- In Memory - Works on the
Functions petabyte scale
- Limitless Data
Types

- Scalability and - Robust database -limited only to


Resource -It is Easy to structure data.
Management with access -The query editor
Elastic Pools -Easy to manage is lacking
- Performance -High Speed -limited data type
4. Azure SQL Tuning -Minimal support
Database - Security downtime -Lacking on table
- Business -Highly reliable creation and
Consistency addition of new
Features columns.
-Hard to
understand UI
- Scalability -Highly reliable -Doesn't
- Concurrency and - Allow users to Unstructured Data
Workload organize data in -Unsuitable for
Separation any manner they Bulk Data
- Near-Zero wish.
Administration - Offers IP
- Semi-Structured whitelisting to
5. Snowflake Data limit data access
- Security to only trusted,
authorized users.
- Offers a much
greater capacity
without needing to
update
equipment.
-

Sources:

1. https://aws.amazon.com/redshift/features/?tag=mochaglobal10-20
2. https://brandongaille.com/23-pros-and-cons-of-amazon-redshift/
3. https://intellipaat.com/blog/azure-synapse-analytics/
4. https://www.trustradius.com/reviews/azure-synapse-analytics-azure-sql-data-
warehouse-2022-03-15-20-41-14
5. https://www.2ndwatch.com/blog/high-level-overview-google-bigquery/
6. https://www.trustradius.com/products/sql-azure/reviews?qs=pros-and-cons#reviews
7. https://www.zuar.com/blog/pros-and-cons-of-using-snowflake-cloud-data-warehouse/
4. Purpose: Why are there CDW nowadays? What is its purpose, and where are they being
used at?
-CDW nowadays deliver faster and better insights because they are easier to access, can
be scaled up, and run better. It also gives real-time data from numerous sources, allowing
them to run better analytics quickly. It is also less expensive to scale a cloud data
warehouse because it doesn't require purchasing new hardware. Thus there is Cloud Data
Warehousing nowadays to make data analytics and access cost-effective, efficient, highly
reliable, and most of all secure.

- The purpose of Cloud Data Warehousing is to provide faster insights with the help of
powerful computing and deliver real-time cloud analytics. They are using data from
diverse sources much faster than an on-premises data warehouse allows users to access
more insights quickly.

-Cloud Data Warehousing is being used at businesses, companies, and enterprises where
they can store historical information that has been collected.

Sources
1. https://www.qlik.com/us/cloud-data-migration/cloud-data-
warehouse#:~:text=Faster%20Insights%3A%20A%20cloud%20data,to%20access%20better%
20insights%2C%20faster.
2. https://hevodata.com/blog/cloud-data-warehouse-101/

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy