0% found this document useful (0 votes)

27 views

Hadoop-Hive Report

Report Hadoop

Uploaded by

Arturo Daniel Cordova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Hadoop-Hive Report

Report Hadoop

Uploaded by

Arturo Daniel Cordova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Hospital Database Implementation Using

Apache Hive, Hadoop, and Google Cloud

Dataproc
Hisham AlQanneh – 2110017

Table of Contents:
0- Introduction
1- Abstract
2- Environment Setup
3- Prerequisites
4- Database Creation
a. Tables of Dataset
b. EER Diagram
c. Relationships
d. Schema
5- Implementation
a. Creating The Relations
b. Inserting Values
c. Queries
6- Conclusion
Introduction:
In the contemporary healthcare landscape, the efficient management of hospital data is
paramount to ensuring high-quality patient care and operational effectiveness. Hospitals
generate vast amounts of data daily, ranging from patient records and staff details to billing
information and departmental logistics. To handle this complexity, robust database
management systems are essential. Apache Hive, a data warehouse software that facilitates
querying and managing large datasets residing in distributed storage, is increasingly being
adopted for such tasks. This report delves into the creation and management of a hospital
database using Apache Hive, supported by Hadoop and Google Cloud Dataproc. It covers the
design and implementation of various tables to store and retrieve critical hospital data, outlines
the SQL commands used for these operations, and provides an analysis of the system's
efficiency and areas for improvement.

Hadoop, an open-source framework, enables the distributed processing of large data sets
across clusters of computers using simple programming models. It is designed to scale up from
single servers to thousands of machines, each offering local computation and storage. Google
Cloud Dataproc is a fast, easy-to-use, fully managed cloud service for running Apache Spark and
Apache Hadoop clusters. Dataproc automates cluster management and simplifies the process
of running big data workloads in the cloud, providing a scalable and cost-effective solution for
data processing.

Abstract:
This report presents the implementation of a hospital database using Apache Hive, aimed at
improving data management and retrieval processes within a healthcare setting. The database
encompasses various essential entities, including staff, doctors, nurses, departments, rooms,
patients, medical records, and billing information. The document details the SQL commands
executed to create and populate these tables, providing a comprehensive overview of the data
structure and relationships. The database operations are supported by Hadoop, which enables
distributed data processing, and Google Cloud Dataproc, which simplifies cluster management
and enhances scalability. Key issues such as logging configuration conflicts and illegal reflective
access warnings are identified, with recommendations provided to resolve these challenges.
Additionally, the report analyzes the performance of data queries and suggests optimizations
for storage and query execution. This analysis highlights the strengths of the current system
while proposing strategies for enhancing future scalability, performance, and maintainability.
Environment Setup:
We started by creating a Virtual Machine instance(cluster) in google cloud dataproc

Prerequisites:
We started a Hive Session in it.
Database Creation:
Tables of Dataset:
EER Diagram:
The Relationships:
1. doctor treats patient:
(One to many) as one doctor can treat many patients at once
(Partial, total) participation as a doctor don’t need to treat a patient while a patient
should be treated by a doctor
2. doctor surpervises nurse:
(One to many) as one doctor can supervise more than one nurse
(Partial, total) participation as a doctor don’t need to supervise a nurse while a
nurse should be supervised by a doctor
3. doctor works_in department:
(Many to one) as many doctors can work in the same department
(Total, total) participation as every doctor works in a department and every
department have at least one doctor that works in it
4. nurse goven room:
(One to many) as one nurse can goven more than one room
(Partial, total) participation as not every nurse govens a room but every room is
govened by a nurse
5. patient has medical record:
(One to many) as one patient can have many medical records
(Partial, total) participation as a patient isn't required to have a medical record
while a medical record is required to have a patient
6. Patient assigned room:
(Many to one) as many patients can be assigned to same room
(Partial, partial) participation as not every patient is assigned a room and not every
room has a patient
7. patient issued bill:
(One to one) as every patient has one bill
(Total, total) participation as every patient is issued a bill and each bill has a patient
The Database Schema:
Creating The Relations:
Inserting Values:
SQL Queries:
1. Select all doctors and their specialties:

2. Select all patients and their respective doctors:

3. Select all nurses and the doctors they report to:

4. Find the total charges of each bill and their patients:

5. Select all patients who were admitted in 2023:

6. Find all departments and their managers:

7. Find all rooms and their availability:

8. Find all patients who were diagnosed with 'Heart Disease':

9. Find the average age of all patients:

10. Find all patients who are currently admitted (haven't been
discharged yet):

(No patient has been discharged yet)

Conclusion
The implementation of the hospital database using Apache Hive, supported by Hadoop and
Google Cloud Dataproc, provides a robust foundation for managing healthcare data efficiently.
While the current setup is effective for small to medium-sized datasets, future optimizations
and updates are necessary to handle larger volumes of data and ensure long-term scalability
and performance.

Uml Diagrams of Hospital Managments
91% (68)
Uml Diagrams of Hospital Managments
34 pages
Project Management of Clinical Trials
From Everand
Project Management of Clinical Trials
Richard Chamberlain
No ratings yet
FHIR DATA SOLUTIONS WITH AZURE FHIR SERVER, AZURE API FOR FHIR & AZURE HEALTH DATA SERVICES: INCLUDES END-TO-END DESIGN PHI DATA LAKE FOR EHR, OMICS, IMAGING, IOMT, WEARABLES & BUSINESS DATA
From Everand
FHIR DATA SOLUTIONS WITH AZURE FHIR SERVER, AZURE API FOR FHIR & AZURE HEALTH DATA SERVICES: INCLUDES END-TO-END DESIGN PHI DATA LAKE FOR EHR, OMICS, IMAGING, IOMT, WEARABLES & BUSINESS DATA
AJIT DASH
No ratings yet
Hospital Database Project
75% (4)
Hospital Database Project
23 pages
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Crossbow Trigger Mechanism Simple But Sturdy PDF
100% (1)
Crossbow Trigger Mechanism Simple But Sturdy PDF
3 pages
361المشروع اسيات قواعد البيانات
No ratings yet
361المشروع اسيات قواعد البيانات
8 pages
Report#nn
No ratings yet
Report#nn
15 pages
Case Study On Processing Data Driven For Health
No ratings yet
Case Study On Processing Data Driven For Health
9 pages
Database Final Project Report
No ratings yet
Database Final Project Report
9 pages
Hcin 543 Final Report
No ratings yet
Hcin 543 Final Report
11 pages
Mini Project Doc 2
No ratings yet
Mini Project Doc 2
25 pages
Noc 24 Hs 176 s 650906310
No ratings yet
Noc 24 Hs 176 s 650906310
19 pages
Medical Big Data Warehouse: Architecture and System Design, A Case Study: Improving Healthcare Resources Distribution
No ratings yet
Medical Big Data Warehouse: Architecture and System Design, A Case Study: Improving Healthcare Resources Distribution
16 pages
Dbms Final Report
No ratings yet
Dbms Final Report
58 pages
Phase 1
No ratings yet
Phase 1
3 pages
Hms Rdbms Project Reportprint
No ratings yet
Hms Rdbms Project Reportprint
19 pages
Database Management System
No ratings yet
Database Management System
12 pages
DC241 STID5034 - Assignment 1
No ratings yet
DC241 STID5034 - Assignment 1
3 pages
DBMS Micro-Project 1
No ratings yet
DBMS Micro-Project 1
15 pages
Fixed Dbms Project On Hospital Management
No ratings yet
Fixed Dbms Project On Hospital Management
23 pages
Hospital Management System
100% (1)
Hospital Management System
14 pages
Hospital Database Management System SQL PDF
100% (2)
Hospital Database Management System SQL PDF
5 pages
Cmpe 226 - Database Project: Hospital Management & Alert System
No ratings yet
Cmpe 226 - Database Project: Hospital Management & Alert System
24 pages
project report sample
No ratings yet
project report sample
7 pages
DM - Report (1) (1) VISHAL
No ratings yet
DM - Report (1) (1) VISHAL
14 pages
Healthy Mission Hospital Project Detailed
No ratings yet
Healthy Mission Hospital Project Detailed
2 pages
AISHUSDA (DBMS)
No ratings yet
AISHUSDA (DBMS)
26 pages
Hospital Management System PDF
No ratings yet
Hospital Management System PDF
19 pages
Assignment 1
No ratings yet
Assignment 1
113 pages
Ijs DR 2204037
No ratings yet
Ijs DR 2204037
8 pages
Database - Hospital
50% (2)
Database - Hospital
60 pages
Ass 1 Database
No ratings yet
Ass 1 Database
6 pages
Design and Implementation of A Hospital Database M
No ratings yet
Design and Implementation of A Hospital Database M
7 pages
dbmsfinalisampdf
No ratings yet
dbmsfinalisampdf
46 pages
HOSPITAL PROJECT REPORT WITHOUT INDEX
No ratings yet
HOSPITAL PROJECT REPORT WITHOUT INDEX
122 pages
Case Study DS-BDA
No ratings yet
Case Study DS-BDA
29 pages
Hospital Management System
No ratings yet
Hospital Management System
3 pages
Micro Project DBMS
No ratings yet
Micro Project DBMS
18 pages
DBMS Casestudy
57% (7)
DBMS Casestudy
6 pages
DM CS2
No ratings yet
DM CS2
3 pages
Hospital Management System
No ratings yet
Hospital Management System
35 pages
DBMS Case Study Hospital Management System
100% (1)
DBMS Case Study Hospital Management System
31 pages
Role of Databases in Hospital Information Management
100% (1)
Role of Databases in Hospital Information Management
17 pages
HOSPITAL DATABASE Information
No ratings yet
HOSPITAL DATABASE Information
11 pages
Case Study
No ratings yet
Case Study
7 pages
Dbmsminiproject 211218133048
No ratings yet
Dbmsminiproject 211218133048
20 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
9 pages
Efficient Management of Large Metadata Catalogs in a Ubiquitous Computing Environment
From Everand
Efficient Management of Large Metadata Catalogs in a Ubiquitous Computing Environment
Daniel Beatty
No ratings yet
SQL Programming & Database Management For Absolute Beginners: SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
From Everand
SQL Programming & Database Management For Absolute Beginners: SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
William Sullivan
No ratings yet
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Building and Operating Data Hubs: Using a practical Framework as Toolset
From Everand
Building and Operating Data Hubs: Using a practical Framework as Toolset
Georg Graner
No ratings yet
Towards best practice in the Archetype Development Process
From Everand
Towards best practice in the Archetype Development Process
Alberto Moreno Conde
No ratings yet
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
From Everand
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
Robert Johnson
No ratings yet
EHR Systems Administrator - The Comprehensive Guide: Vanguard Professionals
From Everand
EHR Systems Administrator - The Comprehensive Guide: Vanguard Professionals
ANTILLIA TAURED
No ratings yet
Development of Pharmacy Service Weights in the Implementation of Casemix System for Provider Payment: Concept, Methods and Applications
From Everand
Development of Pharmacy Service Weights in the Implementation of Casemix System for Provider Payment: Concept, Methods and Applications
Dr Syed M. Aljunid
No ratings yet
Implementation of Casemix System as Prospective Provider Payment Method in Social Health Insurance: a Case Study of Acheh Provincial Health Insurance
From Everand
Implementation of Casemix System as Prospective Provider Payment Method in Social Health Insurance: a Case Study of Acheh Provincial Health Insurance
Prof Dr Syed Mohamed Aljunid
No ratings yet
Clinical Decision Support System: Fundamentals and Applications
From Everand
Clinical Decision Support System: Fundamentals and Applications
Fouad Sabry
5/5 (1)
Sample Size Tables for Clinical Studies
From Everand
Sample Size Tables for Clinical Studies
David Machin
No ratings yet
SQL for Beginners: Your Essential Guide to Querying and Managing Databases
From Everand
SQL for Beginners: Your Essential Guide to Querying and Managing Databases
Emily Harris
No ratings yet
Health Informatics Specialist - The Comprehensive Guide
From Everand
Health Informatics Specialist - The Comprehensive Guide
Viruti Shivan
No ratings yet
Bibliography of DR
No ratings yet
Bibliography of DR
3 pages
Leica Model II
No ratings yet
Leica Model II
49 pages
Year 6 Daily Lesson Plans
No ratings yet
Year 6 Daily Lesson Plans
8 pages
On A/C FSN All: Reference QT Y Designation
No ratings yet
On A/C FSN All: Reference QT Y Designation
19 pages
Concrete Mix Design Calculation
100% (1)
Concrete Mix Design Calculation
10 pages
Vanagon Protraining Digifant I 86-91
No ratings yet
Vanagon Protraining Digifant I 86-91
51 pages
Course: BOT - 525 Title: Botany - Service Course Basic Plant Tissue Culture
No ratings yet
Course: BOT - 525 Title: Botany - Service Course Basic Plant Tissue Culture
14 pages
Full download Can We Know Anything?: A Debate (Little Debates about Big Questions) Bryan Frances pdf docx
100% (2)
Full download Can We Know Anything?: A Debate (Little Debates about Big Questions) Bryan Frances pdf docx
51 pages
Solar Swimming Pool Heating: Do It Yourself?
No ratings yet
Solar Swimming Pool Heating: Do It Yourself?
10 pages
Unctad - Digital Identity
No ratings yet
Unctad - Digital Identity
51 pages
Rotronic HygroPalm HP32 User Manual
No ratings yet
Rotronic HygroPalm HP32 User Manual
5 pages
L'Oreal Test
No ratings yet
L'Oreal Test
1 page
Sense Relations
No ratings yet
Sense Relations
8 pages
LNK562 564
No ratings yet
LNK562 564
16 pages
Director of Health Services Mumbai (Driver Vacancy Matrix)
No ratings yet
Director of Health Services Mumbai (Driver Vacancy Matrix)
1 page
Quant Checklist 141 PDF 2022 by Aashish Arora
No ratings yet
Quant Checklist 141 PDF 2022 by Aashish Arora
80 pages
HIRAC
No ratings yet
HIRAC
38 pages
AN - 30122-01 - Scout Camp - Final
No ratings yet
AN - 30122-01 - Scout Camp - Final
4 pages
Eci MD-500
No ratings yet
Eci MD-500
152 pages
Negotiating For Success
No ratings yet
Negotiating For Success
11 pages
Flottec F150 Frother MSDS r04
No ratings yet
Flottec F150 Frother MSDS r04
6 pages
USB2000 Fiber Optic Spectrometer Operating Instructions: Ocean Optics, Inc
No ratings yet
USB2000 Fiber Optic Spectrometer Operating Instructions: Ocean Optics, Inc
45 pages
Rachel Yehuda Cortisol PTSD Policía Bomberos
No ratings yet
Rachel Yehuda Cortisol PTSD Policía Bomberos
14 pages
Giant Bicycles Case Study Version
No ratings yet
Giant Bicycles Case Study Version
5 pages
Drilling Machine and Types
No ratings yet
Drilling Machine and Types
15 pages
Assignment 1 Measurelab 3 Final
No ratings yet
Assignment 1 Measurelab 3 Final
12 pages
441907983 Coursera Certificate PDF
No ratings yet
441907983 Coursera Certificate PDF
1 page
Invoice 1839 2023-09-11
No ratings yet
Invoice 1839 2023-09-11
2 pages
Download full Networks on Chips Theory and Practice Embedded Multi Core Systems 1st Edition Fayez Gebali ebook all chapters
100% (3)
Download full Networks on Chips Theory and Practice Embedded Multi Core Systems 1st Edition Fayez Gebali ebook all chapters
78 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Hadoop-Hive Report

Uploaded by

Hadoop-Hive Report

Uploaded by

Hospital Database Implementation Using

Apache Hive, Hadoop, and Google Cloud

2. Select all patients and their respective doctors:

4. Find the total charges of each bill and their patients:

6. Find all departments and their managers:

8. Find all patients who were diagnosed with 'Heart Disease':

(No patient has been discharged yet)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.