Hbase
Hbase
Hbase
HBase is schema-less, it doesn't have the concept of fixed An RDBMS is governed by its schema, which describes
columns schema; defines only column families. the whole structure of tables.
It is built for wide tables. HBase is horizontally scalable. It is thin and built for small tables. Hard to scale.
It is good for semi-structured as well as structured data. It is good for structured data.
HDFS is a distributed file system suitable for HBase is a database built on top of the HDFS.
storing large files.
HDFS does not support fast individual record HBase provides fast lookups for larger tables.
lookups.
It provides high latency batch processing; no It provides low latency access to single rows from
concept of batch processing. billions of records (Random access).
It provides only sequential access of data. HBase internally uses Hash tables and provides
random access, and it stores the data in indexed HDFS
files for faster lookups.
The store contains memory store and HFiles. Memstore is just like a cache
memory. Anything that is entered into the HBase is stored here initially. Later,
the data is transferred and saved in Hfiles as blocks and the memstore is
flushed.
wget http://www.interior-dsgn.com/apache/hbase/stable/hbase-0.98.8-
Hadoop2-bin.tar.gz
tar -zxvf hbase-0.98.8-hadoop2-bin.tar.gz
mv hbase-0.99.1/* Hbase/
Export JAVA_HOME in hbase-env.sh
Set “hbase.rootdir”, “hbase.cluster.distributed”, “hbase.rootdir” in hbase-
site.xml file
cd /usr/local/HBase/bin
./start-hbase.sh
./bin/local-regionservers.sh start
./hbase shell
status - Provides the status of HBase, for example, the number of servers.