Big Data Case Study
Big Data Case Study
Big Data Case Study
Case Study
402
HDFS
1. Create a folder on hdfs named as case_study.
10. To Append single src, or multiple srcs from local file system to
the destination file system.
11. To Count the number of directories, files and bytes under the
paths that match the specified file pattern.
12. To display the extended attribute names and values (if any) for a
file or directory.
21. Create a table with 5 buckets and insert some random data.
26. Create a random dataset using rollno, name, gpa and year.
28. Pig script to display the structure and some random data loaded.
illustrate students;
dump q29;
30. Pig script to group data by year.
dump group_by_year;
dump group_by_gpa;
dump count_by_year;
33. Pig script to display sum and average of gpa.
dump sum_avg_gpa;
dump students;
Thank You