Pivotal HAWQ - Centos 7 Installation - greenplum

We have small Hadoop cluster. Hadoop HDP version Installed in it.
Env:
VM consist of os : Centos 7
Facing compatibility issue : HAWQ compatibility issue as it is not supported for Centos 7 Yet.
Constraint: We already installed Hadoop cluster on Centos 7.
Any help on it would be much appreciated.

HAWQ is not, as of yet, supported on 7. It is in the backlog of items and should hopefully be done quickly - but if you're looking to test it's capabilities in the near term, I suggest you reinstall with < 7.

Related

Install specific version of hadoop using Ambari

I am new to HDP installation using Ambari. I want to install Hadoop 2.9.0 using Ambari web installation. My Ambari version is 2.7.0.0 and I am using HDP 3.0 which has Hadoop 3.1.0. But I need to install Hadoop 2.9.0. Can someone please let me know if this can be done? And how can this be achieved?
I have not started the cluster installation yet and I'm done with Ambari installation.
Ambari uses pre-defined software stacks.
HDP does not offer any stack with Hadoop 2.9.0
You would therefore need to manually install that version of Hadoop yourself, although you can still manage the servers (but not the Hadoop configuration) using Ambari
In any case, there's little benefit to installing a lower version of the software, plus you won't get Hortonworks support if you do that

CDH components version numbers

I have an installed CDH cluster and used hadoop version, but it returns only with Hadoop version. Is there any way to get maybe all installed components version number on a graphical interface? Which command can get for example Spark version number?
Open CM (hostname:portnumber) -> Hosts tab -> Host Inspector to find what version of
CM and CDH is installed across all hosts in the cluster, as well as installed cdh components list with version details
Spark version can checked in using
spark-submit --version
Spark was developed separately from Hadoop-hdfs and Hadoop-mapreduce as a standalone tool which can be be used along with Hadoop, as such most of its interfaces are are different from hadoop.

Install HDP 2.2 on CentOS 7

As I can see HDP 2.2 needs Centos 6.5 as an operation system, probably because Ambari needs Centos 6.5. My question is if anyone has installed it on Centos 7. Is there any hard dependencies that will not allow me to complete the installation successfully?
Ambari 2.2+ can be installed successfully and works fine on CentOS 7. Then you can install HDP 2.0+.
As far as I'm aware there are no hard dependencies, per se. However Ambari itself looks at the operating system version, and if its CentOS 7, it'll stop the install.
In order to work around that you'd need to edit Ambari's source code.
Just consult the official Installation Guide for a relevant Ambari version
https://cwiki.apache.org/confluence/display/AMBARI/Install+Ambari+2.2.0+from+Public+Repositories
It's an up-to-date source of OS compatibility information.
Here you can see that Centos 7 is officially supported.

Ubuntu img file with hadoop installed in?

Are there any hadoop in ubuntu img files avalible? I mean a ubuntu system which has hadoop configured in it.
If you want a distribution install, Cloudera has a painless installation process for Ubuntu using Cloudera Manager. Though it only officially supports 12.04, it should also install on 13.x
You can download it here: Cloudera Manager
Cloudera Manager supports the following operating systems:
Red Hat-compatible systems
Red Hat Enterprise Linux and CentOS 5.7, 64-bit
Red Hat Enterprise Linux and CentOS 6.4, 64-bit
Oracle Enterprise Linux 6.4, 64-bit
SLES systems - SUSE Linux Enterprise Server 11, 64-bit. Service Pack 3 or later is required. Also, the SUSE Linux Enterprise Software Development Kit 11 SP1 is required on cluster hosts running the Cloudera Manager Agents (not required on the Cloudera Manager Server host); you can download the SDK here.
Debian systems - Debian 7.0, 6.0 (deprecated), 64-bit
Ubuntu systems - Ubuntu 12.04, 10.04 (deprecated), 64-bit
AFAIK there are neither no "official" ubuntu packages for hadoop nor customized ditributions, but there is a PPA you could use. See https://launchpad.net/~hadoop-ubuntu/+archive/stable for instructions. So at least you can install "semioffical" packages (including updates).

accumulo, zookeeper hadoop Installation instructions, downloads and versions for CENTOS 6

I would appreciate guidance on accumulo, zookeeper hadoop Installation instructions, downloads and versions for CENTOS 6.
Thanks,
Chris
You can do the installation via cloudera manager version 5. I recently installed accumulo using the same.
Here is the link for Cloudera manager 5.
You can use this Youtube video as reference.

Resources