Hadoop installation on asymmetric OS? - hadoop

I have Cent OS on two nodes and Ubuntu on other two nodes can i install Cloudera 4.5 or later on the servers.
i have searched on internet but could not find any relevant information.
How can i install Cloudera on these 4 servers?

No we can not install Hadoop on a cluster having heterogeneous OS.
This is one of the limitation of Hadoop.

Related

Master and Slave system OS version

I'm trying to create my own hadoop clister. My all data nodes have installed ubuntu 18 and Name node is having ubuntu 14.
Is it mandatory that Name node and Data nodes should have same version of OS .. ?
It is recommended to have the same major version at least to avoid kernel vulnerabilities. If you come across these low level issues, they are very difficult to debug.
As #piyush-p said, it's not recommended but as long as you are running the same Java version across all the hosts you should be okay. You probably won't want to
do this if you are using a commercial distribution of Hadoop (HDP, Cloudera) as their
respective setup tools (Ambari, Cloudera Manager) will probably disallow this.
See HDP Support for mix of OS Releases within a cluster for more details.

Suitable hadoop framework for ubuntu

I want to start working with Hadoop and BigData. I need an easy graphical interface to start. I try Hue but I couldn't get it configured.
Please help me to choose my suitable Hadoop.
I use Ubuntu 14.04.
I think Cloudera,sandbox(by hortonworks) is a easy way.Hard way is installation to Ubuntu.Also i have ubuntu 14.04 and Hadoop(hive,pig),Apache spark exist and i dont need open virtual machine.
There are 3 major Hadoop distributions that you can start with.
Cloudera
Hortonworks
MapR
Each one of them has a UI installer and manager. I think the best for you would be though, to use the virtual environment that these vendors provide.
The Hortonworks Developer Sandbox is an image including Hue as UI to get started. However, the downloadable sandbox image is based on CentOS.
If you want to install a Hortonworks Distribution on Ubuntu, you need to run an Ambari installation (Downloads - Hortonworks Hadoop). Be aware that Hue is not included into the default Ambari installation, but Hue can be installed easily separately. To run properly, Hue on Hortonworks still needs Python 2.6.x.
There are some distributions like Cloudera or Hortonworks but their package needs high machine configuration. For example RAM + 16GB and sometimes it's not possible for the user. In addition, they include some Hadoop related project that user doesn't need at all. If you want to enter this field seriously I strongly recommend installing Hadoop on your own. Doing that you do some configuration and will get familiar with many Hadoop concepts.
You can start using this install tutorial.

Installing Windows on 100+ cluster nodes?

I have a cluster server which consists of 100 nodes with CentOS installed currently. For some reasons (however bad it is :<), I need to install Windows 8.1 over them.
I am going to install Windows 8.1 on a node and set up the softwares needed.
Can I make a ghost image from it and distribute over the other nodes to install it? Is there any one who have a similar experience?
thanks :D

How to install cloudera manager without cdh installation?

I have a hadoop environment with tarball which I downloaded at http://hadoop.apache.org/releases.html#Download.
Then, I must use cloudera manager to monitor my mapreduce application.
is possible using cloudera manager without cdh installation?
Cloudera Manager is useless without CDH. Any reason why you would not just use that? Usually the question is the other way around ("I have CDH installed, do I need to use Cloudera Manager?")

How to Start working with Hadoop

Hi I want to learn Hadoop.I have basic idea on how hadoop works with MapReduce framework.
Now i want to practice on my local PC so i want to know how to install hadoop on single Node.
I installed VM Workstation 10 and i tried to install any Linux flavour Operating system to install Hadoop , but iam not able to load Ubuntu into VM ware Workstation ,iam getting error as Exiting intel ...,Operating Not found message.
Can any one please provide me steps on how to start with Hadoop installation.
Should i go for any Distributions(Cloudera,Hortonworks,MapR).If that is simple then tell me how to install those distributions.(I tried even with Cloudera importing vmware file into VMWare workstation it did not worked for me)
You can use the VM given by Udacity for its course on Hadoop. I found it really easy to set up.

Resources