Which Hadoop version recommended for HBase 0.90.6? - hadoop

I have no other option than to install HBase 0.90.6 as it is only recommended stable version for Nutch (web crawler) other than 0.90.4.
My question, which Hadoop version is recommended for HBase 0.90.6 to work on pseudo distributed mode?

I figured out Hadoop 0.20.205.0 is the compatible version.
I tried Hadoop 1.2.1 but it doesn't seem to work well with HBase 0.90.6

Related

Which hbase version to be used with hadoop-2.7.1

I am confused which hbase version to used with hadoop- 2.7.1 or hadoop 2.6.0?
HBase V1.2.x is now supporting Hadoop 2.7.1+ version.
I found this link, there is a chart of Hadoop and HBase compatibility.
Have a look of it.
https://www.quora.com/Which-version-of-hbase-should-I-use-with-Hadoop-2-7-1

Disk o_direct on Hadoop 1.2 and Hadoop 2.7

Does anyone know if there is a setting for o_direct for drives in Hadoop 1.2.1 or hadoop 2.7, specifically for hadoop 1.2.1. I need it for hadoop 1.2.1 to use the kernelĀ“s "O_DIRECT" flag.
Any information would be great.
I found it in the xml to utilize the 0_direct kernel flag which you need to enable in the kernel from 3.x and up with.

Compatability of Hive, Hbase and Hadoop 2.5.1

I have Hadoop 2.5.1 installed on three nodes (1 master, 2 slave nodes) and I want to know the version compatibility of HBase and Hive?
Also, are any alternatives for this Hadoop+Hbase+Hive integration or any guides explaining the installation of Hadoop 2.5.1 with compatible HBase and Hive ?
Currently I am trying with Apache Ambari for the above integration and its still ongoing.
Environment:
Jdk version: 1.7.0_67
RHEL 5
64 bit architecture
Any leads will be much appreciated!
With hadoop 2.5.1 supported versions are:
HBase-0.98.x (Support for Hadoop 1.1+ is deprecated.)
HBase-1.0.x (Hadoop 1.x is NOT supported)
HBase-1.1.x
HBase-1.2.x
Here is the link : http://hbase.apache.org/book.html#configuration
Warning: only hive 1.2.1 can work with Hbase 2.x.

Nutch in Hadoop 2.x

I have a three-node cluster running Hadoop 2.2.0 and HBase 0.98.1 and I need to use a Nutch 2.2.1 crawler on top of that. But it only supports Hadoop versions from 1.x branch. By now I am able to submit a Nutch job to my cluster, but it fails with java.lang.NumberFormatException.
So my question is pretty simple: how do I make Nutch work in my environment?
At the moment it's impossible to integrate Nutch 2.2.1 (Gora 0.3) with HBase 0.98.x.
See: https://issues.apache.org/jira/browse/GORA-304
Official Nutch tutorial recommends only 0.90.x HBase branch:
http://wiki.apache.org/nutch/Nutch2Tutorial
Also you can download HBase 0.94.24-hadoop-2.5.0 version which I created and tested today:
https://github.com/dobromyslov/hbase/releases/tag/0.94.24-hadoop-2.5.0
Take a note that Nutch 2.2.1 does not support HBase 0.94.x and you have to get the latest Nutch 2.x from Git branch: https://github.com/apache/nutch/tree/2.x

hbase 0.94.11 and hadoop version

I have a Hadoop cluster with version 1.2.1 and recently i also downloaded hbase 0.94.11 to try out. I able to setup hbase t run in distributed mode but when i checked the web gui status, it stated that the Hadoop version is 1.0.4. I noticed that this is because hbase use the hadoop-core-1.0.4.jar file comes together with hbase. So my question is should i replace this jar file with the hadoop-core-1.2.1.jar so that hbase can use the latest hadoop-core jar file? And does it matter?
Cw
You don't have to do that if 1.0.4 works for you. Because the newest version may bring you any other problems and just replace hadoop-core.jar is unsafe. If you want to upgrade the HBase, please follow the official guide.
Hope it helps.

Resources