CCS334 -BDA -QB - SEC A
CCS334 -BDA -QB - SEC A
CCS334 -BDA -QB - SEC A
Question Bank
(ODD Semester 2023-24)
Prepared by
Mr.N. KARTHIK, [AP/CSE]
Department of Computer Science and Engineering,
Kingston Engineering College.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
QUESTION BANK (R2021)
Year/Sem: III / V SEM Batch: 2021-25
Subject Code/Name: CCS334 – BIG DATA ANALYTICS
Part- A
1. Discuss in detail about Big Data, compare difference between Data Science and Big Data,
Benefits and challenges.[APR/MAY 2022]
2. Explain about the Converegence of Key trends in detail.[NOV/DEC 2022]
3. Discuss about Unstructured Data and compare with structured Data.
4. Explain about the Industry examples of Big Data.[[NOV/DEC 2020]
5. Elaborate on Web Analytics with examples.
6. Briefly explain about the Big Data Applications.[APR/MAY 2021]
7. Explain about Big Data Technologies.
8. Explain about the Hadoop features, ecosystem, Hadoop advantages.[APR/MAY 2022]
9. Explain about the Open Source Technologies, difference between Open source and Open
Standards,
10. Explain about the i)Cloud and Big Data,
ii) difference between Cloud Computing and Big Data
iii) difference between Cloud Computing and internet[APR/MAY 2022]
11. Explain about the i) mobile business intelligence
ii) Difference between Mobile Analytics and Web Analytics
12. Discuss briefly about the Crowd Sourcing Analytics in detail.[NOV/DEC 2021]
13. Explain the Inter and Trans Firewall Analytics in detail.
14. Explain Big data and Hadoop open source technologies? [NOV/DEC 2021]
15. Explain Characteristics of big data applications. [APRIL/MAY 2018]
16. Discuss Industry Examples of Bigdata in detail. [APRIL/MAY 2019] [APRIL/MAY 2021]
PART A
PART-B
PART A
PART B
PART A
1. Why do we need Hadoop streaming?
2. What is the Hadoop Distributed file system?
3. What is data locality optimization?
4. Why do map tasks write their output to the local disk, not to HDFS?
5. Why is a block in HDFS so large?
6. How HDFS services support big data?
7. What if writable were not there in Hadoop?
8. Define serialization.
9. What is writables? Explain its importance in Hadoop.
10. What happens if a client detects an error when reading a block in Hadoop?
11. What is MapFile?
12. What are Hadoop pipes?
13. Define Blocks.
14. Define Job tracker.
15. Define Task tracker.
16. What is input splits?
17. List the features of Hadoop Streaming.
18. Define DataNodes.
19. Define chunks.
20. List the design issue of HDFS.
21. List the goals of HDFS.
22. Define Checkpoint node.
23. Define Data queue.
24. Define raw file system.
25. List the benefits of compression.
26. Define Writable.
27. Define Writable Comparator.
28. List the primitive writable data types available in Hadoop.
29. Define sequence files.
30. Define lazy write-back caching.
PART B
PART A
1. What is HBase?
2. What is Hive?
3. What is Hive data definition?
4. Explain services provided by Zookeeper in Hbase.
5. What is Zookeeper?
6. What are the responsibilities of HMaster?
7. Where to Use HBase?
8. Explain unique features of Hbase?
9. Explain data model in Hbase?
10.What is the difference between Pig Latin and Pig engine?
11.What is pig storage?
12.What are the features of Hive?
13.List the features and application of Hbase.
14.Where to use Hbase?
15.Difference between HDFS and Hbase.
16.Difference between Hbase and Relational Database.
17.List the limitations of Hbase.
18.What are the four components in Pig Hadoop framework?
19.Define Parser.
20.Define Optimizer.
21.Define Compiler.
22.What are the advantages of Pig?
PART B
1. What is Hbase? Draw architecture of Hbase. Explain difference between HDFS and Hbase.
[APR/MAY 2020]
2. i. Write short note on Hbase client.[NOV/DEC 2020]
ii. What is pig? Explain feature of pig. Draw architecture of pig.[NOV/DEC 2019]
3. Explain about Hbase clients.[NOV/DEC 2022]
4. Explain about Praxis with examples. [APR/MAY 2022]
5. Describe about the Pig. Explain the features of Pig Hadoop. Explain the Pig Data Model.
[APR/MAY 2019]
6. Describe Hive with architecture. List the data types and file formats.[NOV/DEC/2018]
7. Explain HiveQL Data Definition.[NOV/DEC 2021]
8. Explain in detail about HiveQL Data Manipulation.[APR/MAY 2021]
9. Describe about HiveQL Queries.[APR/MAY 2018]
10. Describe the system architecture and components of Hive and Hadoop(13) [APRIL/MAY
2021]
11. What is HBase? Give detailed note on features of HBASE(13) [NOV/DEC 2018] [NOV/DEC
2021]
12. Explain features and Application of Hbase in detail.[APR/MAY 2022]
13. Difference between HDFS and Hbase.[APR/MAY 2022]
14. Explain about the limitations of Hbase.[NOV/DEC 2022]
15. Discuss about the Hbase and Relational database. [APR/MAY 2019]
16. List the features of Pig Hadoop.[NOV/DEC 2021]
17. Describe the advantages and the disadvantages of Pig.[APR/MAY 2018]
18. Explain about the types of Pig Data Model.[APR/MAY 2022]
19. Explain the Data types and File formats.
20. Draw the Architecure of Hive and explain in detail about the Hive Architecture.
21. Describe about the developing and testing Pig Latin Scripts.