0% found this document useful (0 votes)

10 views6 pages

hadoop

Hadoop is an open-source framework for storing and processing large amounts of data, primarily using HDFS for storage and YARN for resource management. It includes additional modules like Hive, Pig, and HBase for enhanced functionality and is widely used in big data applications. Key features include fault tolerance, high availability, and cost-effectiveness, although it is not suitable for small data quantities.

Uploaded by

1mp22ad002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

hadoop

Uploaded by

1mp22ad002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

What is Hadoop?

Hadoop is an open source software programming framework for storing a large amount of data and
performing the computation. Its framework is based on Java programming with some native code in
C and shell scripts.
Hadoop has two main components:
 HDFS (Hadoop Distributed File System): This is the storage component of Hadoop, which
allows for the storage of large amounts of data across multiple machines. It is designed to work
with commodity hardware, which makes it cost-effective.
 YARN (Yet Another Resource Negotiator): This is the resource management component of
Hadoop, which manages the allocation of resources (such as CPU and memory) for processing
the data stored in HDFS.
 Hadoop also includes several additional modules that provide additional functionality, such as
Hive (a SQL-like query language), Pig (a high-level platform for creating MapReduce
programs), and HBase (a non-relational, distributed database).
 Hadoop is commonly used in big data scenarios such as data warehousing, business
intelligence, and machine learning.

Features of hadoop:
1. it is fault tolerance.
2. it is highly available.
3. it’s programming is easy.
4. it have huge flexible storage.
5. it is low cost.
Advantages of HDFS: It is inexpensive, immutable in nature, stores data reliably, ability to tolerate
faults, scalable, block structured, can process a large amount of data simultaneously and many
more. Disadvantages of HDFS: It’s the biggest disadvantage is that it is not fit for small quantities
of data. Also, it has issues related to potential stability, restrictive and rough in nature.
Commands:
1. ls: This command is used to list all the files. Use lsr for recursive approach. It is useful when
we want a hierarchy of a folder.
Syntax:
bin/hdfs dfs -ls <path>
Example:
bin/hdfs dfs -ls /
It will print all the directories present in HDFS. bin directory contains executables
so, bin/hdfs means we want the executables of hdfs particularly dfs(Distributed File System)
commands.

2. mkdir: To create a directory. In Hadoop dfs there is no home directory by default. So let’s first
create it.
Syntax:
bin/hdfs dfs -mkdir <folder name>

creating home directory:

hdfs/bin -mkdir /user

hdfs/bin -mkdir /user/username -> write the username of your computer

touchz: It creates an empty file.

Syntax:
bin/hdfs dfs -touchz <file_path>
Example:
bin/hdfs dfs -touchz /geeks/myfile.txt

1. copyFromLocal (or) put: To copy files/folders from local file system to hdfs store. This is the
most important command. Local filesystem means the files present on the OS.
Syntax:
bin/hdfs dfs -copyFromLocal <local file path> <dest(present on hdfs)>
Example: Let’s suppose we have a file AI.txt on Desktop which we want to copy to
folder geeks present on hdfs.
bin/hdfs dfs -copyFromLocal ../Desktop/AI.txt /geeks

(OR)

bin/hdfs dfs -put ../Desktop/AI.txt /geeks

1. cat: To print file contents.
Syntax:
bin/hdfs dfs -cat <path>
Example:
// print the content of AI.txt present
// inside geeks folder.
bin/hdfs dfs -cat /geeks/AI.txt ->
2. copyToLocal (or) get: To copy files/folders from hdfs store to local file system.
Syntax:
bin/hdfs dfs -copyToLocal <<srcfile(on hdfs)> <local file dest>
Example:
bin/hdfs dfs -copyToLocal /geeks ../Desktop/hero

(OR)

bin/hdfs dfs -get /geeks/myfile.txt ../Desktop/hero

myfile.txt from geeks folder will be copied to folder hero present on Desktop.

Note: Observe that we don’t write bin/hdfs while checking the things present on local
filesystem.
3. moveFromLocal: This command will move file from local to hdfs.
Syntax:
bin/hdfs dfs -moveFromLocal <local src> <dest(on hdfs)>
Example:
bin/hdfs dfs -moveFromLocal ../Desktop/cutAndPaste.txt /geeks

4. cp: This command is used to copy files within hdfs. Lets copy folder geeks to geeks_copied.
Syntax:
bin/hdfs dfs -cp <src(on hdfs)> <dest(on hdfs)>
Example:
bin/hdfs dfs -cp /geeks /geeks_copied

5. mv: This command is used to move files within hdfs. Lets cut-paste a
file myfile.txt from geeks folder to geeks_copied.
Syntax:
bin/hdfs dfs -mv <src(on hdfs)> <src(on hdfs)>
Example:
bin/hdfs dfs -mv /geeks/myfile.txt /geeks_copied

6. rmr: This command deletes a file from HDFS recursively. It is very useful command when you
want to delete a non-empty directory.
Syntax:
bin/hdfs dfs -rmr <filename/directoryName>
Example:
bin/hdfs dfs -rmr /geeks_copied -> It will delete all the content inside the
directory then the directory itself.

7. du: It will give the size of each file in directory.

Syntax:
bin/hdfs dfs -du <dirName>
Example:
bin/hdfs dfs -du /geeks

1. dus:: This command will give the total size of directory/file.

Syntax:
bin/hdfs dfs -dus <dirName>
Example:
bin/hdfs dfs -dus /geeks

1. stat: It will give the last modified time of directory or path. In short it will give stats of the
directory or file.
Syntax:
bin/hdfs dfs -stat <hdfs file>
Example:
bin/hdfs dfs -stat /geeks
2. setrep: This command is used to change the replication factor of a file/directory in HDFS. By
default it is 3 for anything which is stored in HDFS (as set in hdfs core-site.xml).
Example 1: To change the replication factor to 6 for geeks.txt stored in HDFS.
bin/hdfs dfs -setrep -R -w 6 geeks.txt
Example 2: To change the replication factor to 4 for a directory geeksInput stored in HDFS.
bin/hdfs dfs -setrep -R 4 /geeks
Note: The -w means wait till the replication is completed. And -R means recursively, we use it
for directories as they may also contain many files and folders inside them.

Note: There are more commands in HDFS but we discussed the commands which are commonly
used when working with Hadoop. You can check out the list of dfs commands using the following
command:
bin/hdfs dfs

Hadoop HDFS Commands With Examples
No ratings yet
Hadoop HDFS Commands With Examples
3 pages
Thrive: Solar LED Home Lighting System
No ratings yet
Thrive: Solar LED Home Lighting System
2 pages
HDFS and HAdoop command
No ratings yet
HDFS and HAdoop command
5 pages
BDA
No ratings yet
BDA
88 pages
BDA-ALLEXP (2)_merged
No ratings yet
BDA-ALLEXP (2)_merged
149 pages
Bda Practical File
No ratings yet
Bda Practical File
28 pages
Ex-2 Hadoop Commands (1)
No ratings yet
Ex-2 Hadoop Commands (1)
6 pages
kh5(bda)_merged
No ratings yet
kh5(bda)_merged
21 pages
BDA Final Compiled_pagenumber
No ratings yet
BDA Final Compiled_pagenumber
71 pages
Hadoop-HDFS-commands
No ratings yet
Hadoop-HDFS-commands
1 page
Integrated Design Engineering: Interdisciplinary and Holistic Product Development Sándor Vajna pdf download
100% (4)
Integrated Design Engineering: Interdisciplinary and Holistic Product Development Sándor Vajna pdf download
55 pages
Hafs Commands
No ratings yet
Hafs Commands
17 pages
Final Bda 1-8 Lab Aayush
No ratings yet
Final Bda 1-8 Lab Aayush
17 pages
DSCI 551 _ Lab 2 _ Aayush Chamria (1)
No ratings yet
DSCI 551 _ Lab 2 _ Aayush Chamria (1)
3 pages
2335_m4_demo1_v1_b54_kwf9d75
No ratings yet
2335_m4_demo1_v1_b54_kwf9d75
8 pages
Hadoop Assignement Sumit 241111 133837
No ratings yet
Hadoop Assignement Sumit 241111 133837
13 pages
Big Data Cheat Sheet
No ratings yet
Big Data Cheat Sheet
12 pages
Command
No ratings yet
Command
1 page
Lab Assignment-1
No ratings yet
Lab Assignment-1
4 pages
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
Experiment No 1
No ratings yet
Experiment No 1
13 pages
HDFS Command
No ratings yet
HDFS Command
15 pages
HDFS Commands1
No ratings yet
HDFS Commands1
18 pages
hdfs commands
No ratings yet
hdfs commands
3 pages
Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
No ratings yet
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
210 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
HDFS Commands - Revised
No ratings yet
HDFS Commands - Revised
6 pages
BDA Record (1)
No ratings yet
BDA Record (1)
34 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
HDFS Commands
No ratings yet
HDFS Commands
7 pages
BDA UNIT -3 Updated (1).docx
No ratings yet
BDA UNIT -3 Updated (1).docx
25 pages
Hadoop Command Line Interface
No ratings yet
Hadoop Command Line Interface
10 pages
Assignment HDFS
No ratings yet
Assignment HDFS
1 page
HDFS Commands AfterINstallation
No ratings yet
HDFS Commands AfterINstallation
4 pages
Hadoop1
No ratings yet
Hadoop1
15 pages
Hadoop Commands Only
No ratings yet
Hadoop Commands Only
19 pages
Practical 1 - 1 - Hadoop Commands
No ratings yet
Practical 1 - 1 - Hadoop Commands
3 pages
COMMAND Line Interface
No ratings yet
COMMAND Line Interface
26 pages
BIG DATA UNIT -2
No ratings yet
BIG DATA UNIT -2
18 pages
1948-65-HD-FL-Panhead-Parts-Catalog
No ratings yet
1948-65-HD-FL-Panhead-Parts-Catalog
86 pages
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
2 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
HDFS Commands Updated
No ratings yet
HDFS Commands Updated
87 pages
Lista de Comandos HDFS
No ratings yet
Lista de Comandos HDFS
8 pages
Hadoop Linux Hdfs Commands
No ratings yet
Hadoop Linux Hdfs Commands
2 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
No ratings yet
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
10 pages
Bermocoll EHM 300 PDS
No ratings yet
Bermocoll EHM 300 PDS
3 pages
HDFS Tutorial
No ratings yet
HDFS Tutorial
5 pages
Formal Method in SE
No ratings yet
Formal Method in SE
17 pages
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
No ratings yet
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
5 pages
HDFS
No ratings yet
HDFS
6 pages
Paper - II Linguistics
No ratings yet
Paper - II Linguistics
16 pages
Electron JS
No ratings yet
Electron JS
21 pages
Hadoop Linux Commands
No ratings yet
Hadoop Linux Commands
8 pages
Because with only a high
No ratings yet
Because with only a high
2 pages
HDFS Basic Commands
No ratings yet
HDFS Basic Commands
2 pages
Hadoop Commands
100% (1)
Hadoop Commands
6 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
History of Aerospace
No ratings yet
History of Aerospace
81 pages
Steel Girder
No ratings yet
Steel Girder
42 pages
GA& WIRING DRAWING OF MCC PANEL
No ratings yet
GA& WIRING DRAWING OF MCC PANEL
6 pages
Hadoop Commands
No ratings yet
Hadoop Commands
2 pages
Purchasing Monthly Report - November 2023
No ratings yet
Purchasing Monthly Report - November 2023
105 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Bruker: Technical Manual
No ratings yet
Bruker: Technical Manual
27 pages
HDFS Commands
No ratings yet
HDFS Commands
15 pages
Evopact SF Sf1000000x1fx
No ratings yet
Evopact SF Sf1000000x1fx
2 pages
CAI Vibration Training
No ratings yet
CAI Vibration Training
119 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
06 1021 Introduction To Turtle Graphics s2019 9spp BW
No ratings yet
06 1021 Introduction To Turtle Graphics s2019 9spp BW
5 pages
B. Jayant Baliga Silicon RF Power MOSFETS
No ratings yet
B. Jayant Baliga Silicon RF Power MOSFETS
320 pages
Soal Bahasa Inggris SMP
No ratings yet
Soal Bahasa Inggris SMP
7 pages
Group 15 EMD332 Machine Design Report - 2D Camera Slider
100% (1)
Group 15 EMD332 Machine Design Report - 2D Camera Slider
33 pages
Lecture 2. Measuring Tools-Rules and Calipers
No ratings yet
Lecture 2. Measuring Tools-Rules and Calipers
45 pages
How To Set Up A Hadoop Cluster in Docker
No ratings yet
How To Set Up A Hadoop Cluster in Docker
13 pages
ProMatura Brochure
No ratings yet
ProMatura Brochure
16 pages
KV DV
No ratings yet
KV DV
2 pages
En Ladycomp Instructions 2008
No ratings yet
En Ladycomp Instructions 2008
21 pages
General Manager
No ratings yet
General Manager
2 pages
Hot Sauce Experiment
No ratings yet
Hot Sauce Experiment
3 pages
CD Player & FM Tuner PDF
No ratings yet
CD Player & FM Tuner PDF
8 pages
Linux Commands By Example
From Everand
Linux Commands By Example
Khaled Jamal
4.5/5 (3)
Time Table IMO Model Course 1.08
100% (1)
Time Table IMO Model Course 1.08
2 pages
Properties of Equality and Congruence
No ratings yet
Properties of Equality and Congruence
18 pages
Determination of Viscosity Through Brookfield Viscometer.
No ratings yet
Determination of Viscosity Through Brookfield Viscometer.
6 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

hadoop

Uploaded by

hadoop

Uploaded by

What is Hadoop?

creating home directory:

hdfs/bin -mkdir /user

touchz: It creates an empty file.

bin/hdfs dfs -put ../Desktop/AI.txt /geeks

bin/hdfs dfs -get /geeks/myfile.txt ../Desktop/hero

7. du: It will give the size of each file in directory.

1. dus:: This command will give the total size of directory/file.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.