Open navigation menu

Scribd

0% found this document useful (0 votes)

8 views11 pages

Big Data Case Study

Uploaded by

ffff

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Big Data Case Study

Uploaded by

0% found this document useful (0 votes)

8 views11 pages

ffff

Original Title

Big Data Case Study[1]

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

ffff

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

8 views11 pages

Big Data Case Study

Uploaded by

ffff

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

You are on page 1/ 11

Big Data Architecture and Ecosystem

Case Study
402

HDFS
1. Create a folder on hdfs named as case_study.

2. Upload the file to the hdfs folder created earlier.

3. List all the files uploaded in the hdfs folder.

4. To Copy files from source to destination.

hadoop fs -cp /source_path /destination_path

5. To move file from local to HDFS.

hadoop fs -moveFromLocal /localpath /hdfsdestination

6. To read the content of a file.

hadoop fs -cat /user/cloduera/case_study/sample.txt

7. To Displays free space.

8. To returns the checksum information of a file.

9. To Create a file of zero length.

10. To Append single src, or multiple srcs from local file system to
the destination file system.

11. To Count the number of directories, files and bytes under the
paths that match the specified file pattern.

12. To display the extended attribute names and values (if any) for a
file or directory.

13. To Concatenate existing source files into the target file.

hadoop fs -concat /case_study/target_file.txt /case_study/src_file1.txt
/case_study/src_file2.txt

14. To Displays sizes of files and directories contained in the given

directory or the length of a file in case it’s just a file.

15. To take a source directory and a destination file as input and

concatenates files in src into the destination local file.
hadoop fs -getmerge /user/cloudera/case_study /local/destination/concatenated_file.txt
HIVE
16. create a database named as exam_db.

17. create 2 tables in the same database.

18. load sample data into the relevant tables.

19. display the data loaded into the tables.

20. create a table with a partition and insert some random data.

21. Create a table with 5 buckets and insert some random data.

22. Write a query to update one of the column’s values.

Query - update students set age = 20 where student_id = 2;

23. Create a table same as table created earlier.

24. Write a query to delete a record.

delete from students where student_id = 3;

25. Write queries to explain joins in hive.

JOIN -

FULL OUTER JOIN -

PIG

26. Create a random dataset using rollno, name, gpa and year.

Created using text file editor in Cloudera.

27. To load the dataset in pig shell.

28. Pig script to display the structure and some random data loaded.
illustrate students;

29. Pig script to display resultset of name, gpa and year.

dump q29;
30. Pig script to group data by year.

dump group_by_year;

31. Pig script to group data by gpa.

dump group_by_gpa;

32. Pig script to display count of records year-wise.

dump count_by_year;
33. Pig script to display sum and average of gpa.

dump sum_avg_gpa;

34. Pig script to write the results to the file.

STORE students INTO '/home/cloudera/Desktop/cspigresults’ USING PigStorage(',');

35. Pig script to display all records.

dump students;

Thank You

You might also like

Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
CA01
No ratings yet
CA01
14 pages
Big Data Lab Manual and Syllabus
No ratings yet
Big Data Lab Manual and Syllabus
71 pages
Bhoomika Bdi Lab
No ratings yet
Bhoomika Bdi Lab
15 pages
Big Data Manual
No ratings yet
Big Data Manual
19 pages
GCP CSP554 Assignment 2 v01
No ratings yet
GCP CSP554 Assignment 2 v01
6 pages
Data Science
No ratings yet
Data Science
82 pages
BIGDATALABCURRENT
No ratings yet
BIGDATALABCURRENT
54 pages
Migrating Data From HDFS To Big Query
No ratings yet
Migrating Data From HDFS To Big Query
5 pages
CCS334 SET4
No ratings yet
CCS334 SET4
2 pages
Unix and Shell Programming Practical File
0% (1)
Unix and Shell Programming Practical File
6 pages
Maker and Sender
No ratings yet
Maker and Sender
13 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
Exercises
No ratings yet
Exercises
6 pages
BDA RECORD (24-25)
No ratings yet
BDA RECORD (24-25)
50 pages
Pipe Questions
No ratings yet
Pipe Questions
1 page
Anushka Shetty 35
No ratings yet
Anushka Shetty 35
34 pages
Big Data Manual
No ratings yet
Big Data Manual
82 pages
Big Data Fundamentals and Platforms Assginment 3
No ratings yet
Big Data Fundamentals and Platforms Assginment 3
6 pages
BDA Lab Manual by T.Naga Praveena
No ratings yet
BDA Lab Manual by T.Naga Praveena
40 pages
Operating System Laboratory Manual
92% (13)
Operating System Laboratory Manual
57 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Bda Recordfo
No ratings yet
Bda Recordfo
24 pages
Big Data Analytics IT
No ratings yet
Big Data Analytics IT
55 pages
Dataset Join
No ratings yet
Dataset Join
12 pages
Cp5261 Da Lab Me-Cse 2021 - Edit
No ratings yet
Cp5261 Da Lab Me-Cse 2021 - Edit
88 pages
Course: Big Data Analytics Lab Scheme: 2017
No ratings yet
Course: Big Data Analytics Lab Scheme: 2017
25 pages
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
No ratings yet
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
31 pages
HPH 562 Data Management & Informatics: Lecture 2: Getting Your Data Into SAS
No ratings yet
HPH 562 Data Management & Informatics: Lecture 2: Getting Your Data Into SAS
23 pages
Step 2 - First MapReduce Program
No ratings yet
Step 2 - First MapReduce Program
25 pages
Lab File Format
No ratings yet
Lab File Format
60 pages
Assessment - 6: Question 1: Write The C Program To Implement A Linked List File Allocation Method
No ratings yet
Assessment - 6: Question 1: Write The C Program To Implement A Linked List File Allocation Method
6 pages
CCS334 SET3
No ratings yet
CCS334 SET3
2 pages
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
No ratings yet
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
11 pages
Lab 3 ML
No ratings yet
Lab 3 ML
19 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
Aim of Experiment - : WAP To Copy The Contents of One File To Another and Display It On Output Screen
No ratings yet
Aim of Experiment - : WAP To Copy The Contents of One File To Another and Display It On Output Screen
10 pages
Sample
No ratings yet
Sample
30 pages
Bda Record
No ratings yet
Bda Record
46 pages
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
No ratings yet
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
28 pages
Big Data: Sqoop
No ratings yet
Big Data: Sqoop
43 pages
CCS334 BDA Lab Manual
No ratings yet
CCS334 BDA Lab Manual
35 pages
Batch Export .Ai Files To PDF
No ratings yet
Batch Export .Ai Files To PDF
5 pages
DWDM Lab Manual 7th Sem
No ratings yet
DWDM Lab Manual 7th Sem
45 pages
qne (1)
No ratings yet
qne (1)
4 pages
bda lab
No ratings yet
bda lab
2 pages
Lab Manual Big Data
No ratings yet
Lab Manual Big Data
22 pages
Introduction To Hadoop - Part Two: 1 Hadoop and Comma Separated Values (CSV) Files 1
No ratings yet
Introduction To Hadoop - Part Two: 1 Hadoop and Comma Separated Values (CSV) Files 1
38 pages
DBMS Lab Manual
100% (1)
DBMS Lab Manual
76 pages
Computer Science Project Record
No ratings yet
Computer Science Project Record
67 pages
CSA Database Monitoring Guide
No ratings yet
CSA Database Monitoring Guide
49 pages
ccs 334 bigdata manual
No ratings yet
ccs 334 bigdata manual
45 pages
Python For Netcdf
No ratings yet
Python For Netcdf
17 pages
CP5261 Data Analytics Laboratory LTPC0042 Objectives
No ratings yet
CP5261 Data Analytics Laboratory LTPC0042 Objectives
80 pages
Cs3361 Set3 Fds Anna University
No ratings yet
Cs3361 Set3 Fds Anna University
3 pages
Unix and Linux Programming File
No ratings yet
Unix and Linux Programming File
20 pages
Big Data Lab File
No ratings yet
Big Data Lab File
49 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
Data Science
No ratings yet
Data Science
3 pages
How to Hack Like a Ghost: Breaching the Cloud
From Everand
How to Hack Like a Ghost: Breaching the Cloud
Sparc Flow
No ratings yet
Smcwbr14s n4 Manual
No ratings yet
Smcwbr14s n4 Manual
127 pages
Arcam Airdac
No ratings yet
Arcam Airdac
32 pages
FYI Notification and Add Post Approval Action Type
No ratings yet
FYI Notification and Add Post Approval Action Type
14 pages
Module4-Notes
No ratings yet
Module4-Notes
14 pages
Lab 2.2 - Create User Accounts - Docx-P.nam
No ratings yet
Lab 2.2 - Create User Accounts - Docx-P.nam
10 pages
FPTD FDM Config Guide 621 PDF
No ratings yet
FPTD FDM Config Guide 621 PDF
346 pages
Cisco 1000 Series Integrated Services Routers Ordering Guide 1kseries-Integrated-Services-Routers-Guide
No ratings yet
Cisco 1000 Series Integrated Services Routers Ordering Guide 1kseries-Integrated-Services-Routers-Guide
23 pages
Sync Sort
No ratings yet
Sync Sort
7 pages
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
No ratings yet
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
9 pages
Learning Paths by Persona: Author: Training and Certification Last Updated: December 1, 2020
No ratings yet
Learning Paths by Persona: Author: Training and Certification Last Updated: December 1, 2020
23 pages
Aethra FTTDP 1612 FTTDP Rev.1
No ratings yet
Aethra FTTDP 1612 FTTDP Rev.1
4 pages
Control Unit 9.3 Control Unit
No ratings yet
Control Unit 9.3 Control Unit
10 pages
Class123 Codes
No ratings yet
Class123 Codes
35 pages
BEC302 MOD-5 NOTES
No ratings yet
BEC302 MOD-5 NOTES
19 pages
CS669 Lab1
No ratings yet
CS669 Lab1
36 pages
Com 213 Uml Lecture Note
50% (2)
Com 213 Uml Lecture Note
35 pages
EA700 Heritage Series ADSL Modem User's Manual
No ratings yet
EA700 Heritage Series ADSL Modem User's Manual
122 pages
Microsoft - Ai 900.VApr 2024.by .ToanNguyen.116q
No ratings yet
Microsoft - Ai 900.VApr 2024.by .ToanNguyen.116q
73 pages
MIXPAD MXP124FX MXP144FX CUTSHEET - mUo1d8O PDF
No ratings yet
MIXPAD MXP124FX MXP144FX CUTSHEET - mUo1d8O PDF
1 page
OMA DDS DM - ConnMO V1 - 0 20080812 C
No ratings yet
OMA DDS DM - ConnMO V1 - 0 20080812 C
26 pages
Theory of Computation
No ratings yet
Theory of Computation
18 pages
OpenStack For NFV and SDN
No ratings yet
OpenStack For NFV and SDN
2 pages
DrafteTransportTermsofServicev4 12.12.2022
No ratings yet
DrafteTransportTermsofServicev4 12.12.2022
2 pages
Foxconn Sds-Bu
No ratings yet
Foxconn Sds-Bu
120 pages
Floating Labels: Applying Dynamic Potential Fields For Label Layout
No ratings yet
Floating Labels: Applying Dynamic Potential Fields For Label Layout
13 pages
2022 RPCPPE Template 1
No ratings yet
2022 RPCPPE Template 1
6 pages
Cinterion MLA31 W Datahseet
No ratings yet
Cinterion MLA31 W Datahseet
4 pages
JavaScript Tutorial
No ratings yet
JavaScript Tutorial
2 pages
Nakul Giri RP Resume
No ratings yet
Nakul Giri RP Resume
5 pages
Isca TSP
No ratings yet
Isca TSP
14 pages

Big Data & Analytics Lab Manual
Big Data & Analytics Lab Manual
CA01
CA01
Big Data Lab Manual and Syllabus
Big Data Lab Manual and Syllabus
Bhoomika Bdi Lab
Bhoomika Bdi Lab
Big Data Manual
Big Data Manual
GCP CSP554 Assignment 2 v01
GCP CSP554 Assignment 2 v01
Data Science
Data Science
BIGDATALABCURRENT
BIGDATALABCURRENT
Migrating Data From HDFS To Big Query
Migrating Data From HDFS To Big Query
CCS334 SET4
CCS334 SET4
Unix and Shell Programming Practical File
Unix and Shell Programming Practical File
Maker and Sender
Maker and Sender
@bigdatalabfile 09
@bigdatalabfile 09
Exercises
Exercises
BDA RECORD (24-25)
BDA RECORD (24-25)
Pipe Questions
Pipe Questions
Anushka Shetty 35
Anushka Shetty 35
Big Data Manual
Big Data Manual
Big Data Fundamentals and Platforms Assginment 3
Big Data Fundamentals and Platforms Assginment 3
BDA Lab Manual by T.Naga Praveena
BDA Lab Manual by T.Naga Praveena
Operating System Laboratory Manual
Operating System Laboratory Manual
Unstructured Dataload Into Hive Database Through PySpark
Unstructured Dataload Into Hive Database Through PySpark
Bda Recordfo
Bda Recordfo
Big Data Analytics IT
Big Data Analytics IT
Dataset Join
Dataset Join
Cp5261 Da Lab Me-Cse 2021 - Edit
Cp5261 Da Lab Me-Cse 2021 - Edit
Course: Big Data Analytics Lab Scheme: 2017
Course: Big Data Analytics Lab Scheme: 2017
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
HPH 562 Data Management & Informatics: Lecture 2: Getting Your Data Into SAS
HPH 562 Data Management & Informatics: Lecture 2: Getting Your Data Into SAS
Step 2 - First MapReduce Program
Step 2 - First MapReduce Program
Lab File Format
Lab File Format
Assessment - 6: Question 1: Write The C Program To Implement A Linked List File Allocation Method
Assessment - 6: Question 1: Write The C Program To Implement A Linked List File Allocation Method
CCS334 SET3
CCS334 SET3
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
Lab 3 ML
Lab 3 ML
BDT Lab Manual
BDT Lab Manual
Aim of Experiment - : WAP To Copy The Contents of One File To Another and Display It On Output Screen
Aim of Experiment - : WAP To Copy The Contents of One File To Another and Display It On Output Screen
Sample
Sample
Bda Record
Bda Record
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
Big Data: Sqoop
Big Data: Sqoop
CCS334 BDA Lab Manual
CCS334 BDA Lab Manual
Batch Export .Ai Files To PDF
Batch Export .Ai Files To PDF
DWDM Lab Manual 7th Sem
DWDM Lab Manual 7th Sem
qne (1)
qne (1)
bda lab
bda lab
Lab Manual Big Data
Lab Manual Big Data
Introduction To Hadoop - Part Two: 1 Hadoop and Comma Separated Values (CSV) Files 1
Introduction To Hadoop - Part Two: 1 Hadoop and Comma Separated Values (CSV) Files 1
DBMS Lab Manual
DBMS Lab Manual
Computer Science Project Record
Computer Science Project Record
CSA Database Monitoring Guide
CSA Database Monitoring Guide
ccs 334 bigdata manual
ccs 334 bigdata manual
Python For Netcdf
Python For Netcdf
CP5261 Data Analytics Laboratory LTPC0042 Objectives
CP5261 Data Analytics Laboratory LTPC0042 Objectives
Cs3361 Set3 Fds Anna University
Cs3361 Set3 Fds Anna University
Unix and Linux Programming File
Unix and Linux Programming File
Big Data Lab File
Big Data Lab File
BDA Lab
BDA Lab
Data Science
Data Science
How to Hack Like a Ghost: Breaching the Cloud
From Everand
How to Hack Like a Ghost: Breaching the Cloud
Smcwbr14s n4 Manual
Smcwbr14s n4 Manual
Arcam Airdac
Arcam Airdac
FYI Notification and Add Post Approval Action Type
FYI Notification and Add Post Approval Action Type
Module4-Notes
Module4-Notes
Lab 2.2 - Create User Accounts - Docx-P.nam
Lab 2.2 - Create User Accounts - Docx-P.nam
FPTD FDM Config Guide 621 PDF
FPTD FDM Config Guide 621 PDF
Cisco 1000 Series Integrated Services Routers Ordering Guide 1kseries-Integrated-Services-Routers-Guide
Cisco 1000 Series Integrated Services Routers Ordering Guide 1kseries-Integrated-Services-Routers-Guide
Sync Sort
Sync Sort
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
Learning Paths by Persona: Author: Training and Certification Last Updated: December 1, 2020
Learning Paths by Persona: Author: Training and Certification Last Updated: December 1, 2020
Aethra FTTDP 1612 FTTDP Rev.1
Aethra FTTDP 1612 FTTDP Rev.1
Control Unit 9.3 Control Unit
Control Unit 9.3 Control Unit
Class123 Codes
Class123 Codes
BEC302 MOD-5 NOTES
BEC302 MOD-5 NOTES
CS669 Lab1
CS669 Lab1
Com 213 Uml Lecture Note
Com 213 Uml Lecture Note
EA700 Heritage Series ADSL Modem User's Manual
EA700 Heritage Series ADSL Modem User's Manual
Microsoft - Ai 900.VApr 2024.by .ToanNguyen.116q
Microsoft - Ai 900.VApr 2024.by .ToanNguyen.116q
MIXPAD MXP124FX MXP144FX CUTSHEET - mUo1d8O PDF
MIXPAD MXP124FX MXP144FX CUTSHEET - mUo1d8O PDF
OMA DDS DM - ConnMO V1 - 0 20080812 C
OMA DDS DM - ConnMO V1 - 0 20080812 C
Theory of Computation
Theory of Computation
OpenStack For NFV and SDN
OpenStack For NFV and SDN
DrafteTransportTermsofServicev4 12.12.2022
DrafteTransportTermsofServicev4 12.12.2022
Foxconn Sds-Bu
Foxconn Sds-Bu
Floating Labels: Applying Dynamic Potential Fields For Label Layout
Floating Labels: Applying Dynamic Potential Fields For Label Layout
2022 RPCPPE Template 1
2022 RPCPPE Template 1
Cinterion MLA31 W Datahseet
Cinterion MLA31 W Datahseet
JavaScript Tutorial
JavaScript Tutorial
Nakul Giri RP Resume
Nakul Giri RP Resume
Isca TSP
Isca TSP

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Alternative Proxies:

Alternative Proxy