0% found this document useful (0 votes)

43 views

Exercise 2 Configuring Flume For Data Loading: IBM Software

Uploaded by

rajesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Exercise 2 Configuring Flume For Data Loading: IBM Software

Uploaded by

rajesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

IBM Software

Exercise 2
Configuring Flume for Data Loading
© Copyright IBM Corporation, 2013
US Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM Software

Contents
LAB 2 CONFIGURING FLUME FOR DATA LOADING ................................................................................................ 4
1.1 GETTING STARTED .................................................................................................................................. 5
1.2 GETTING PUTTY SETUP ........................................................................................................................... 7
1.3 INSTALL THE FLUME SERVICE .................................................................................................................. 10
1.4 CREATING A CONFIGURATION FILE FOR BASIC TESTING ............................................................................... 11
1.5 TESTING YOUR FIRST AGENT ................................................................................................................... 13
1.6 A MORE COMPLICATED CONFIGURATION OVERVIEW .................................................................................... 14
1.7 SETUP YOUR AGENTS ............................................................................................................................. 15
1.8 TEST YOUR CONFIGURATION ................................................................................................................... 17
1.9 TRANSFER DATA .................................................................................................................................... 20

Contents Page 3
IBM Software

Lab 2 Configuring Flume for Data Loading

This exercise introduces how to configure Flume agents in order to load data into Hadoop.

After completing this hands-on lab, you’ll be able to:

• Configure Flume agents for loading data into Hadoop

Allow 30 minutes to complete this lab.

This version of the lab was updated and tested on the InfoSphere BigInsights 4.1 Quick Start Edition.
Throughout this lab you will be using the following account login information. If your passwords are
different, please note the difference.

Username Password

VM image setup screen root password

Ambari admin admin

Flume is a distributed service for efficiently moving the large amounts of data. The original use
case of Flume was to gather a set of log files on various machines in a cluster and aggregate
them to a centralized persistent store such as Hadoop Distributed File System (HDFS). It has
since been re-architected and expanded to cover a much wider variety of data.
Probably the most important ingredient for this exercise is your imagination. You are going to
have to draw upon the inner child within you. Your exercises are running on a BigInsights cluster
that has a whopping one node. So it should be obvious that you are not going to be able to
move data from one node to another. For this exercise to be somewhat meaningful, you are
going to have to image that, when you start multiple agents, you have multiple nodes and that
Flume is running on each of those nodes.

Note:
Be aware that the initialization of the flume agents will take several minutes. Please be patient.

Page 4
IBM Software

1.1 Getting Started

It is assumed that you have downloaded the VM image (BigInsights QuickStart 4.1) and completed the
setup and configuration for VM Workstation 12 Player.

First, you want to start the BigInsights components.

__1. Open a Web browser and navigate to http://rvm.svl.ibm.com:8080, and sign in using the
Ambari user id and password specified at the beginning of this document.

Notice that most of the BigInsights components listed at the left are in a Stopped state as
indicated by the red, triangular warning icon.

__2. Click on Hosts in the top of web page.

__3. Check the box next to your host. “rvm.svl.ibm.com”

Hands-on-Lab Page 5
IBM Software

__4. Click Actions -> Selected Hosts (1) -> Hosts -> Start All Components

__5. Confirm by hitting OK when “Confirm Bulk Operations” pops up.

Clicking Stop All would stop the components in a similar manner.

Note: Be sure to allow ample time for all the components to start. The first time you start the components
and services, it may take approximately 30 minutes or even longer, depending on the physical resources
on your machine.

Page 6
IBM Software

__6. Periodically examine the background operations indicator at the top of the screen. It should show
that 1 operation is running. When it updates to 0 ops as shown below, the Start All script is
complete and the components should all be running as indicated by the green check mark icons
on the left.

Note: If your icons still show red warning signs after the startup, it may be that the Ambari interface did
not refresh properly, even though the details in the background operations show 100% and display a
successful message. Feel free to click the Admin button at the top right of the window, and then click
Sign out. You will be presented with the Ambari login screen. Log back in using the credentials at the
beginning of this document and the component list should be updated with the correct, green check-mark
icons.

Now that the components are started you may move onto the next section.

1.2 Getting PuTTY setup

We will be requiring the assistance of PuTTY for this lab, as since this lab involves the use of 4
command line prompts to be open and running at the same time. We run into a little problem
since we only have one available to us on the VM. That’s where PuTTY comes in, we will be
able to use multiple PuTTYs to get remote shell access your VM and have multiple command
lines to our disposal! Note: Make sure your VM is up and running when trying to connect.
__ 1. Navigate to www.chiark.greenend.org.uk/~sgtatham/putty/download.html and download
PuTTY

Hands-on-Lab Page 7
IBM Software

__ 2. After the download, open PuTTY. It should look like this:

__3. Now for your Host Name, you should be able to find it easily on the page when you first log
into BigInsights QuickStart 4.1 VM image. It should show it on this page.

Note: To get to this page when your already logged in, just type “exit” until you get back to this
screen (instead of restart the whole VM)

Page 8
IBM Software

__4. Now just input the Host Name (IP Address) into your PuTTY client under “Host Name” and
hit “Open.” It should look like this: (Optional: Hit “Save” after you input your details to keep it for
next time!)

Note: Your Host Name will be different from the above.

__5. Next you should be greeted with a command line requesting login details. Just enter the
VM image setup screen login (default- username: root password: password). Then you have
successfully setup PuTTY for to connect to your VM!

Hands-on-Lab Page 9
IBM Software

1.3 Install the Flume service

Before you proceed with this lab, we first need to use the Ambari interface to add the Flume service to
our BigInsights virtual machine.

__ 1. Log into your lab image as the root.

__ 2. Open Ambari. Once all the Hadoop components have started up, click the Actions
dropdown and Add Service.

__ 3. Scroll down and check Flume. Then click Next.

Page 10
IBM Software

Click Next on the following screen.

Keep clicking Next through the screens and finally click the Deploy button. You should receive
confirmation that the service has been deployed and started. After the install you may be asked to
restart some of the Hadoop components. Restart the requested components and move on to the
next section of the lab.

1.4 Creating a configuration file for basic testing

All of the information that is required by a Flume agent is acquired from a configuration file. So
to begin with, you are going to code up a simple configuration file. You are going to first define a
source and a target that use some built-in Flume testing capabilities.

Hands-on-Lab Page 11
IBM Software

__ 1. Make sure you are logged into your lab image as root.
__ 2. If Hadoop is not running, start it using Ambari.
__ 3. Your Flume configuration file can reside anywhere as long as the agent can access it.
The convention is to place the configuration file in Flume’s conf directory. We will just
place our Flume configuration files into the /home/virtuser directory for this lab.
Start the vi editor or use a notepad (transferred via VM Shared Folder).

Name of the file will be flume_agent1.properties.

Entries in the configuration file are prefixed with an agent’s name. Assume that the first
agent with which you are going to work is to be named agent1. Also, since this is possibly
the first time that you have worked with Flume, you will initially make use of some of the
Flume testing capabilities.
The sequential generator source is an easy source to use since it creates the source data
for you. The logger is a good sink with which to play since it can display the results in
your console window.
Remember from the presentation material that a source and a sink are connected
together via a channel. You will use the memory channel for this exercise.
__ 4. Although the order in which the Flume elements are defined is immaterial, I am going to
present them in a way in which I am comfortable. I am going to first tell you what is to be
defined followed by the actual statements. If you want to try your luck in coding the
configuration statements before seeing the answers, I suggest that you cover the
answers with a sheet of paper, code your own statements, and then do a comparison.
Define your source, sink, and channel. Remember your agent’s name is agent1.
__ a. Source name is seqGenSource
__ b. Sink name is loggerSink
__ c. Channel name is memChannel
Code the following in your editor:
agent1.sources = seqGenSource
agent1.sinks = loggerSink
agent1.channels = memChannel
__ 5. Code the properties for seqGenSource
__ a. The source type is seq
Code the following in your editor:
agent1.sources.seqGenSource.type = seq
__ 6. Code the properties for loggerSink
Page 12
IBM Software

__ a. The sink type is logger

Code the following in your editor:
agent1.sinks.loggerSink.type = logger
__ 7. Code the properties for memChannel
__ a. The channel type is memory
__ b. Its capacity is 100
Code the following in your editor:
agent1.channels.memChannel.type = memory
agent1.channels.memChannel.capacity = 100
__ 8. Connect your source to your defined channel.
agent1.sources.seqGenSource.channels = memChannel
__ 9. Connect your sink to your defined channel.
agent1.sinks.loggerSink.channel = memChannel

Important:
Did you note that the binding definition for the source contains the keyword channels (plural)
and the binding definition for the sink contains the keyword channel? This is because a source
can read from multiple channels whereas a sink can only write to a single channel.

__ 10. Save your work into File System->home->virtuser and make sure the file is named
flume_agent1.properties.

1.5 Testing your first agent

__ 1. From a command line, change to the flume directory.
cd /usr/iop/4.1.0.0/flume/
__ 2. Since this is just an exercise and exercises should never mirror real life, you are not
going to worry about specifying a configuration directory and setting environment
variables. Information about that was covered in the presentation material.

Hands-on-Lab Page 13
IBM Software

Note:
To terminate your running agent, do a ctrl-z in the console window. This is true for both an agent
that did not initialize properly due to a configuration error and one that is running just fine. The
cntl-z does not terminate the Java process however.
If you have a configuration error, do a ctrl-z, correct your problem and restart your agent. You do
not have to worry about killing the process. The existing process is able to reload the
configuration file.

Now start your flume agent. Override the default logging information and write
informational records to the console. Your agent’s name is agent1. Note: a lot of data will
be written to the console. cnlt-z will terminate the output.
bin/flume-ng agent --name agent1 --conf conf --conf-file
/home/virtuser/flume_agent1.properties -Dflume.root.logger=INFO,console
Or
bin/flume-ng agent -n agent1 --conf conf -f / home/virtuser
/flume_agent1.properties -Dflume.root.logger=INFO,console

1.6 A more complicated configuration overview

You did not have to really use your imagination when working with the first flume agent. But you
will now. Here is the configuration that you are to implement.
Files are periodically dropped into a directory on system A. The data from each of those files is
to be read and turned into events. Each of those events is to be forwarded to system B where
the data is to be enhanced by adding timestamp information into the header for each event. The
enhanced events are then to be sent to system C where the events are to be loaded into HDFS.
The timestamp information in the event header is to be used to define the directory names in
which the events are to be stored.
In real life each of the three agents would run on separate systems and so you would have three
configuration files. But since your three agents are running on the same system, you can use
just a single configuration file.
To add some humor, at least for older people in the U.S.A. who remember the TV series, Get
Smart, the name of the three agents are agent13, agent99, and agent86.

Page 14
IBM Software

1.7 Setup your agents

Note:
I purposely chose to use the same channel name for all three agents. This is to show you that
the names only have to be unique within an agent.

agent13 is to do the initial read of the data from files dropped into a specified directory.
__ 1. First create your directory. From a command line
mkdir /home/virtuser/flumesourcedata
__ 2. Open a new file in your text editor.
__ 3. Define the source, sink, and channel to be used by agent13 as well as the bindings.
__ a. The source name is spoolDirSource. Its type is spoolDir.
__ b. The sink name is avroSink. Its type is avro. (Remember that to pass events from one
agent to another requires avro sinks and sources.)
The hostname for binding is localhost and the port is 10013.
__ c. The channel name is memChannel. Its type is memory and it has a capacity of 100.
#These statements are for agent13
agent13.sources = spoolDirSource
agent13.sinks = avroSink
agent13.channels = memChannel

agent13.sources.spoolDirSource.type = spooldir
agent13.sources.spoolDirSource.spoolDir= /home/virtuser/flumesourcedata
agent13.sinks.avroSink.type = avro
agent13.sinks.avroSink.hostname = localhost
agent13.sinks.avroSink.port = 10013
agent13.channels.memChannel.type = memory
agent13.channels.memChannel.capacity = 100

agent13.sources.spoolDirSource.channels = memChannel
agent13.sinks.avroSink.channel = memChannel
__ 4. agent99 is to get its events from agent13. Since the avro sink for agent13 was bound to
localhost at port 10013, that implies that the avro source for agent99 will also be bound to
locahost at port 10013.
Also, you are going to enhance your events by adding a timestamp to the header for
each event.
Define the source, sink, and channel to be used by agent99 as well as the bindings.

Hands-on-Lab Page 15
IBM Software

__ a. The source name is avroSource. Its type is avro.

The bind parameter is localhost
The port is 10013
The interceptor type is ts
The interceptor type is timestamp
__ b. The sink name is avroSink. Its type is avro.
The hostname for binding is localhost and the port is 10099.
__ c. The channel name is memChannel. its type is memory and it has a capacity of 100.
#These statements are for agent99
agent99.sources = avroSource
agent99.sinks = avroSink
agent99.channels = memChannel

agent99.sources.avroSource.type = avro
agent99.sources.avroSource.bind = localhost
agent99.sources.avroSource.port = 10013
agent99.sources.avroSource.interceptors = ts
agent99.sources.avroSource.interceptors.ts.type = timestamp
agent99.sinks.avroSink.type = avro
agent99.sinks.avroSink.hostname = localhost
agent99.sinks.avroSink.port = 10099
agent99.channels.memChannel.type = memory
agent99.channels.memChannel.capacity = 100

agent99.sources.avroSource.channels = memChannel
agent99.sinks.avroSink.channel = memChannel
__ 5. agent86 is to get its events from agent99. So there must be an avro source to receive the
data and the data is to be passed to an hdfs sink.
Define the source, sink, and channel to be used by agent86 as well as the bindings.
__ a. The source name is avroSource. Its type is avro.
The bind parameter is localhost
The port is 10099
__ b. The sink name is hdfsSink. Its type is hdfs.
A portion of the hdfs path is created by extracting date and time information from the
header of each event. hdfs://rvm.svl.ibm.com:8020/user/virtuser/flume/%y-%m-
%d/%H%M
The filePrefix is log.
The writeFormat is Text

Page 16
IBM Software

The fileType is DataStream

__ c. The channel name is memChannel. Its type is memory and it has a capacity of 100.
#These statements are for agent86
agent86.sources = avroSource
agent86.sinks = hdfsSink
agent86.channels = memChannel

agent86.sources.avroSource.type = avro
agent86.sources.avroSource.bind = localhost
agent86.sources.avroSource.port = 10099
agent86.sinks.hdfsSink.type = hdfs
agent86.sinks.hdfsSink.hdfs.path =
hdfs://rvm.svl.ibm.com:8020/user/virtuser/flume/%y-%m-%d/%H%M
agent86.sinks.hdfsSink.hdfs.filePrefix = Log
agent86.sinks.hdfsSink.hdfs.writeFormat = Text
agent86.sinks.hdfsSink.hdfs.fileType = DataStream
agent86.channels.memChannel.type = memory
agent86.channels.memChannel.capacity = 100

agent86.sources.avroSource.channels = memChannel
agent86.sinks.hdfsSink.channel = memChannel
__ 6. Save your work into File System->home->virtuser and call the file
flume_agents.properties.

1.8 Test your configuration

You are going to have to be working with a number of terminal windows. A quick way to open
the same session of PuTTY, just right-click on the PuTTY and select “Duplicate Session”

Hands-on-Lab Page 17
IBM Software

agent13
__ 1. Open a PuTTY window, connect and change to the flume directory.
cd /usr/iop/4.1.0.0/flume/
__ 2. When you start agent13, even though you coded your configuration statements correctly,
you will see what looks like a Java exception when you start the agent. That is because
the avro sink is not able to connect to the source yet. Once agent99 starts and the avro
source does its bind, you should see a statement something like the following:
INFO sink.AvroSink: Avro sink avroSink: Building RpcClient with hostname:
rvm, port: 10013

Execute the following:

bin/flume-ng agent -n agent13 --conf conf -f
/home/virtuser/flume_agents.properties -Dflume.root.logger=INFO,console

agent99
__ 3. Open PuTTY window, connect and change to the flume directory.
cd /usr/iop/4.1.0.0/flume/
__ 4. When you start agent99, even though you coded your configuration statements correctly,
you will see what looks like a Java exception when you start the agent. That is because
the avro sink is not able to connect to the source yet. Once agent86 starts and the avro
source does its bind, you should see a statement something like the following:
Rpc sink avroSink: Building RpcClient with hostname: localhost, port:
10099
Execute the following:
bin/flume-ng agent -n agent99 --conf conf -f
/home/virtuser/flume_agents.properties -Dflume.root.logger=INFO,console

agent86
__ 5. Open a PuTTY window, connect and change to the flume directory.

Page 18
IBM Software

cd /usr/iop/4.1.0.0/flume/
__ 6. Start agent86. You should see a statement as follows:
INFO source.AvroSource: Avro source avroSource started.
Execute the following:
bin/flume-ng agent -n agent86 --conf conf -f
/home/virtuser/flume_agents.properties -Dflume.root.logger=INFO,console

You should have 3 windows that look similar to this after they are all running:

agent13:

agent99:

Hands-on-Lab Page 19
IBM Software

agent86:

1.9 Transfer data

Assuming that all of your agents have properly started, you need to test the moving of data from
agent13 into HDFS.
__ 1. Open another terminal window.
__ 2. Change to the virtuser directory.
cd /home/virtuser/
__ 3. Create a file with test data.
cat > test.txt
this is some data
to upload to hadoop
ctrl-c
__ 4. Copy your file to the flumesourcedata directory.
cp test.txt flumesourcedata
__ 5. As soon as the file was added to the sourcedata directory, it gets processed. List the
contents of the sourcedata directory. You should see that the test.txt file has been
renamed to test.txt.COMPLETED
ls flumesourcedata

Page 20
IBM Software

__ 6. Next check to see that the data was moved to HDFS. Return to the terminal window
where you started agent86. You should see some statements indicating that a file was
created as a temporary file and then has been renamed. Notice that the directory
structure has been made up of some data and time information.

__ 7. View the contents of the newly created file. Return to the terminal window where you
created the text.txt file. Execute the following replacing the file name with your file name.
(You can do a copy and paste of the file name from the terminal window for agent86.)
Note: You must be in hdfs to use hadoop unless you’ve changed permissions.
su hdfs
hadoop fs -cat /user/virtuser/flume/16-01-27/1342/Log.1453930980072
__ 8. Execute ctrl-z in each of the three windows where the agents are running in order to
terminate them.
__ 9. You can close your open terminal windows.

End of exercise

The information contained in these materials is provided for

informational purposes only, and is provided AS IS without warranty
of any kind, express or implied. IBM shall not be responsible for any
damages arising out of the use of, or otherwise related to, these
materials. Nothing contained in these materials is intended to, nor
shall have the effect of, creating any warranties or representations
from IBM or its suppliers or licensors, or altering the terms and
conditions of the applicable license agreement governing the use of
IBM software. References in these materials to IBM products,
programs, or services do not imply that they will be available in all
countries in which IBM operates. This information is based on
current IBM product plans and strategy, which are subject to change
by IBM without notice. Product release dates and/or capabilities
referenced in these materials may change at any time at IBM’s sole
discretion based on market opportunities or other factors, and are not
intended to be a commitment to future product or feature availability
in any way.

IBM, the IBM logo and ibm.com are trademarks of International

Business Machines Corp., registered in many jurisdictions
worldwide. Other product and service names might be trademarks of
IBM or other companies. A current list of IBM trademarks is
available on the Web at “Copyright and trademark information” at
www.ibm.com/legal/copytrade.shtml.

Learning Microsoft Endpoint Manager: Unified Endpoint Management with Intune and the Enterprise Mobility + Security Suite
From Everand
Learning Microsoft Endpoint Manager: Unified Endpoint Management with Intune and the Enterprise Mobility + Security Suite
Scott Duffey
No ratings yet
IBM WebSphere Application Server Interview Questions You'll Most Likely Be Asked
From Everand
IBM WebSphere Application Server Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Exploring Maximo Mobile
No ratings yet
Exploring Maximo Mobile
23 pages
Learn Jmeter in 24 Hours
From Everand
Learn Jmeter in 24 Hours
Nordeen Alex
No ratings yet
IBM Spectrum Protect 7.1.4 Implementation and Administration - LSG
No ratings yet
IBM Spectrum Protect 7.1.4 Implementation and Administration - LSG
24 pages
Mastering Microsoft Endpoint Manager
From Everand
Mastering Microsoft Endpoint Manager
Charles Smith
No ratings yet
Avaya AAEP Admin Guide
No ratings yet
Avaya AAEP Admin Guide
566 pages
MR & YARN - Lab 1 - BigInsights 4.1.0 - Updated
No ratings yet
MR & YARN - Lab 1 - BigInsights 4.1.0 - Updated
30 pages
Hadoop Basics With Ibm Biginsights
No ratings yet
Hadoop Basics With Ibm Biginsights
22 pages
Lab1 InstallationOfBigInsight
No ratings yet
Lab1 InstallationOfBigInsight
72 pages
Exercise 1 - Big Insights
No ratings yet
Exercise 1 - Big Insights
4 pages
Build Your First Home Server
From Everand
Build Your First Home Server
R.R. Arnob
No ratings yet
Penetration Testing of Computer Networks Using BurpSuite and Various Penetration Testing Tools
From Everand
Penetration Testing of Computer Networks Using BurpSuite and Various Penetration Testing Tools
Dr. Hidaia Mahmood Alassoulii
No ratings yet
Penetration Testing of Computer Networks Using Burpsuite and Various Penetration Testing Tools
From Everand
Penetration Testing of Computer Networks Using Burpsuite and Various Penetration Testing Tools
Dr. Hidaia Mahmood Alassouli
No ratings yet
Blender 4.3 Guide for All: Mastering 3D Design and Animation
From Everand
Blender 4.3 Guide for All: Mastering 3D Design and Animation
Paige Massy-Greene
No ratings yet
Umbraco User's Guide
From Everand
Umbraco User's Guide
Nik Wahlberg
4/5 (1)
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Introduction To HDFS PDF
No ratings yet
Introduction To HDFS PDF
27 pages
Hiding Web Traffic with SSH: How to Protect Your Internet Privacy against Corporate Firewall or Insecure Wireless
From Everand
Hiding Web Traffic with SSH: How to Protect Your Internet Privacy against Corporate Firewall or Insecure Wireless
Slava Gomzin
No ratings yet
The Definitive Guide to Getting Started with OpenCart 2.x
From Everand
The Definitive Guide to Getting Started with OpenCart 2.x
iSenseLabs
No ratings yet
Python Programming Reference Guide: A Comprehensive Guide for Beginners to Master the Basics of Python Programming Language with Practical Coding & Learning Tips
From Everand
Python Programming Reference Guide: A Comprehensive Guide for Beginners to Master the Basics of Python Programming Language with Practical Coding & Learning Tips
Coleman Newton
No ratings yet
1.4 HDFS Lab 1H
No ratings yet
1.4 HDFS Lab 1H
23 pages
How to Setup a Windows PC: A Step-by-Step Guide to Setting Up and Configuring a New Computer: Location Independent Series, #4
From Everand
How to Setup a Windows PC: A Step-by-Step Guide to Setting Up and Configuring a New Computer: Location Independent Series, #4
Jeff Blum
No ratings yet
Computer Productivity Book 3. Use AutoHotKey to License & Deploy Your Scripts to Sell: AutoHotKey productivity, #3
From Everand
Computer Productivity Book 3. Use AutoHotKey to License & Deploy Your Scripts to Sell: AutoHotKey productivity, #3
Max Drake
No ratings yet
Basic Setup of FortiGate Firewall
From Everand
Basic Setup of FortiGate Firewall
Dr. Hidaia Mahmood Alassoulii
No ratings yet
DB2 11.1 for LUW: Basic Training for Application Developers
From Everand
DB2 11.1 for LUW: Basic Training for Application Developers
Robert Wingate
No ratings yet
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mahmood Alassouli
No ratings yet
Lab 1 - Week2
No ratings yet
Lab 1 - Week2
29 pages
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mamood Alassouli
No ratings yet
Windows Batch File Programming
From Everand
Windows Batch File Programming
Michael Elliott
2/5 (2)
Vps Toolkit
From Everand
Vps Toolkit
Davide Gatti
No ratings yet
Inside Officescan 11 Service Pack 1 Upgrade Documentation
From Everand
Inside Officescan 11 Service Pack 1 Upgrade Documentation
Dale Johnson
No ratings yet
1.0 - Welcome
No ratings yet
1.0 - Welcome
18 pages
Fundamentals of Security in the Windows System
From Everand
Fundamentals of Security in the Windows System
Mark Emerson
No ratings yet
COMPUTER PRODUCTIVITY BOOK 1 Use AutoHotKey Create your own personal productivity scripts: AutoHotKey productivity, #1
From Everand
COMPUTER PRODUCTIVITY BOOK 1 Use AutoHotKey Create your own personal productivity scripts: AutoHotKey productivity, #1
Max Drake
No ratings yet
20 Windows Tools Every SysAdmin Should Know
From Everand
20 Windows Tools Every SysAdmin Should Know
padmin
5/5 (2)
Installing SQL Server 2012 Step by Step
From Everand
Installing SQL Server 2012 Step by Step
Stephen Thomas
No ratings yet
IBM BigFix
No ratings yet
IBM BigFix
506 pages
AutoIT Scripting For Beginners
From Everand
AutoIT Scripting For Beginners
Rajan
5/5 (2)
Watson Workshop WorkstationSetup
No ratings yet
Watson Workshop WorkstationSetup
8 pages
MR YARN - Lab 1 - Cloud - Updated-V2.0
No ratings yet
MR YARN - Lab 1 - Cloud - Updated-V2.0
30 pages
Internet Information Services 8.5
From Everand
Internet Information Services 8.5
Murat Yildirimoglu
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
IBM Training: Front Cover
No ratings yet
IBM Training: Front Cover
26 pages
So You Want To Be an iOS Developer
From Everand
So You Want To Be an iOS Developer
Kent Franks
No ratings yet
PYTHON PROGRAMMING
From Everand
PYTHON PROGRAMMING
Ramsey Hamilton
4/5 (12)
Configure The Windows 7 To Work Options
From Everand
Configure The Windows 7 To Work Options
Noel Carboni
No ratings yet
Setup of a Graphical User Interface Desktop for Linux Virtual Machine on Cloud Platforms
From Everand
Setup of a Graphical User Interface Desktop for Linux Virtual Machine on Cloud Platforms
Dr. Hidaia Mahmood Alassouli
No ratings yet
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
From Everand
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
Owen Smith
5/5 (1)
Virtualizing Your Datacenter: With Windows Server 2012 R2 & System Center 2012 R2 Hands-On Lab - Step-by-Step Guide
No ratings yet
Virtualizing Your Datacenter: With Windows Server 2012 R2 & System Center 2012 R2 Hands-On Lab - Step-by-Step Guide
160 pages
HANDS Hadoop Cloud
No ratings yet
HANDS Hadoop Cloud
10 pages
Spectrum Archive VM
No ratings yet
Spectrum Archive VM
83 pages
vSphere 5 AutoLab 1.1a Deployment Guide
From Everand
vSphere 5 AutoLab 1.1a Deployment Guide
Alastair Cooke
No ratings yet
Practical Play Framework: Focus on what is really important
From Everand
Practical Play Framework: Focus on what is really important
Alberto Souza
No ratings yet
Create Your Website and E-Commerce at No Cost. Thanks to WordPress and Google Cloud Platform
From Everand
Create Your Website and E-Commerce at No Cost. Thanks to WordPress and Google Cloud Platform
Giovanni Lillo
5/5 (1)
WordPress Security 101 - How to secure your WordPress site against hackers
From Everand
WordPress Security 101 - How to secure your WordPress site against hackers
Brecht Ryckaert
4/5 (2)
Installation, Upgrade, and Configuration of IBM Cognos Analytics: Smooth Onboarding of Data Analytics and Business Intelligence on Red Hat RHEL 8.0, IBM Cloud Private, and Windows Servers
From Everand
Installation, Upgrade, and Configuration of IBM Cognos Analytics: Smooth Onboarding of Data Analytics and Business Intelligence on Red Hat RHEL 8.0, IBM Cloud Private, and Windows Servers
Alan Bluck
No ratings yet
IBM Integration Bus V10 Performance: How To Analyse Your System To Optimise Performance and Throughput
No ratings yet
IBM Integration Bus V10 Performance: How To Analyse Your System To Optimise Performance and Throughput
67 pages
Ibm Infosphere Biginsights Quick Start Edition: VM Image Readme
No ratings yet
Ibm Infosphere Biginsights Quick Start Edition: VM Image Readme
9 pages
Getting Started With PureQuery
No ratings yet
Getting Started With PureQuery
364 pages
A concise guide to PHP MySQL and Apache
From Everand
A concise guide to PHP MySQL and Apache
alasdair gilchrist
4/5 (2)
API Automation Interview Questions
No ratings yet
API Automation Interview Questions
7 pages
UNIT-IV
No ratings yet
UNIT-IV
23 pages
Oracle SQL Day 1
No ratings yet
Oracle SQL Day 1
14 pages
Python 3 Oops Hands On
No ratings yet
Python 3 Oops Hands On
7 pages
Governing 11
100% (3)
Governing 11
35 pages
Tarun B Modified
No ratings yet
Tarun B Modified
3 pages
Pelco
No ratings yet
Pelco
1 page
Parallel and Distributed Computing: Lecture - 02 Week - 01
No ratings yet
Parallel and Distributed Computing: Lecture - 02 Week - 01
12 pages
CLARKE-JU JW Model - Manual JD English C13960.Sflb
No ratings yet
CLARKE-JU JW Model - Manual JD English C13960.Sflb
51 pages
8051 Microcontroller Memory Organization
No ratings yet
8051 Microcontroller Memory Organization
6 pages
Titan 8a
No ratings yet
Titan 8a
17 pages
Announcement: Computer Architecture
No ratings yet
Announcement: Computer Architecture
4 pages
Test Report Conical Bus Bar Insulator
No ratings yet
Test Report Conical Bus Bar Insulator
2 pages
Gutor Technology
No ratings yet
Gutor Technology
2 pages
Improve Your Deployment Pipeline
No ratings yet
Improve Your Deployment Pipeline
4 pages
Financial Plan On Nepali E-Bazar
No ratings yet
Financial Plan On Nepali E-Bazar
37 pages
Method Statement - Installation MSB
No ratings yet
Method Statement - Installation MSB
7 pages
Case Study - Structural Resonance
No ratings yet
Case Study - Structural Resonance
16 pages
Resu Maker
No ratings yet
Resu Maker
4 pages
Science 8 Q2 Week 1 2
No ratings yet
Science 8 Q2 Week 1 2
5 pages
RRL 1s
No ratings yet
RRL 1s
5 pages
Assignment-1 COA -AL 404
No ratings yet
Assignment-1 COA -AL 404
2 pages
Weinmann Meducore Easy Defibrillator - Service Manual
No ratings yet
Weinmann Meducore Easy Defibrillator - Service Manual
48 pages
Untitled Document
No ratings yet
Untitled Document
9 pages
Test Bank for OM, 3rd Edition: Collier - PDF Version Is Available For Instant Access
100% (6)
Test Bank for OM, 3rd Edition: Collier - PDF Version Is Available For Instant Access
44 pages
2009 - Smart Helmet
No ratings yet
2009 - Smart Helmet
68 pages
Metacognitive Reading Report Template-STS-1 (3) .Docxchrisciervo321
No ratings yet
Metacognitive Reading Report Template-STS-1 (3) .Docxchrisciervo321
1 page
Lecture 16
No ratings yet
Lecture 16
92 pages
University of Eastern Philippines Laoang Campus
No ratings yet
University of Eastern Philippines Laoang Campus
4 pages
Inventory System Thesis
100% (2)
Inventory System Thesis
6 pages
2006 Toyota Camry Engine Control
No ratings yet
2006 Toyota Camry Engine Control
11 pages
Survey Manual Chap 7 Photogrammetric Surveys
No ratings yet
Survey Manual Chap 7 Photogrammetric Surveys
14 pages
Product Data Sheet: Manual Source Changeover Switch Interpact INS250 - 4 Poles - 200 A
No ratings yet
Product Data Sheet: Manual Source Changeover Switch Interpact INS250 - 4 Poles - 200 A
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Exercise 2 Configuring Flume For Data Loading: IBM Software

Uploaded by

Exercise 2 Configuring Flume For Data Loading: IBM Software

Uploaded by

IBM Software

Lab 2 Configuring Flume for Data Loading

After completing this hands-on lab, you’ll be able to:

• Configure Flume agents for loading data into Hadoop

Allow 30 minutes to complete this lab.

VM image setup screen root password

Ambari admin admin

1.1 Getting Started

First, you want to start the BigInsights components.

__2. Click on Hosts in the top of web page.

__3. Check the box next to your host. “rvm.svl.ibm.com”

__5. Confirm by hitting OK when “Confirm Bulk Operations” pops up.

Clicking Stop All would stop the components in a similar manner.

1.2 Getting PuTTY setup

__ 2. After the download, open PuTTY. It should look like this:

Note: Your Host Name will be different from the above.

1.3 Install the Flume service

__ 1. Log into your lab image as the root.

__ 3. Scroll down and check Flume. Then click Next.

Click Next on the following screen.

1.4 Creating a configuration file for basic testing

Name of the file will be flume_agent1.properties.

__ a. The sink type is logger

1.5 Testing your first agent

1.6 A more complicated configuration overview

1.7 Setup your agents

__ a. The source name is avroSource. Its type is avro.

The fileType is DataStream

1.8 Test your configuration

Execute the following:

1.9 Transfer data

The information contained in these materials is provided for

IBM, the IBM logo and ibm.com are trademarks of International

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.