Hadoop setup and configuration - hadoop

C:\hadoop-2.3.0\bin>hadoop
The system cannot find the path specified.
Error: JAVA_HOME is incorrectly set.
Please update C:\hadoop-2.3.0\conf\hadoop-env.c
Usage: hadoop [--config confdir] COMMAND
Facing above error in Hadoop configuration. Can anyone please help to resolve the issue.

If this is for learning purpose to setup hadoop on windows you will find enough blog link
If your primary objective is to learn Hadoop then i will suggest you to download VMware Player and setup hadoop on ubantu or you can download CDH version from cloudera website to start your learning.

Related

any idea about “YarnChild” class not found in hadoop 3.1.1?

I am a hadoop beginner .
I have inastelled Hadoop 3.1.1 on a cluster. As my OS is CentOS 6.9 (64bit), I recompiled Hadoop Native library and replaced it in HADOOP_HOME/lib.
When I run wordcount example which is in "HADOOP_HOME/share/hadoop/mapreduce", I get: "Error: Could not find or load main class org.apache.hadoop.mapred.YarnChild".
I found no answer except changing the version of Hadoop on the internet.
Any idea about how to solve it?
Thanks in advance
Thanks, it is solved.
based on my exprience, any error about class not found is because of mapred-site.xml wrong configuration.
a good config is here:
hadoop pagerank error when running

How to know the default location of hadoop homepath

I have installed cloudera hadoop 2.0.0 CDH4 while doing i am not mentioning any
home path.It is working fine now. But when i ran JPS command.It was showing jps
process only.
So i tried to start the hadoop but I am unable to find the location of hadoop,
where It is actually stored. So can any one please help how to find the default
location.
Is there any commands are there to find the exact location of hadoop in my
system?
please help me on this issue.
Thanks,
Anbu k
Yes, you could try:
find / -name hadoop

Cannot start standalone instance of HBase

I was unable to configure the HBase standalone instance. Following are the steps I followed:
Downloaded hbase-0.98.9-hadoop2 and extracted it.
Set my JAVA_HOMEin the environment variables.
Edited conf/hbase-site.xml and changed the configuration as mentioned in the Apache HBase quick start guide.
Ran the bin/start-hbase.sh and this error came up.
Can anyone tell me what I'm missing or doing wrong? Thanks
Here are the steps:
http://hbase.apache.org/cygwin.html
Hbase cannot be installed without cygwin tooling.

Setting up Hadoop Client on Mac OS X

Currently, I have 3-node cluster running CDH 5.0 using MRv1. I am trying to figure out how to setup Hadoop on my Mac. So, I can submit jobs to the cluster. According to the "Managing Hadoop API Dependencies in CDH 5", you just need the files in /usr/lib/hadoop/client-0.20/* Do I need the following files too? Does Cloudera has hadoop-client in tarball?
- core-site.xml
- hdfs-site.xml
- mapred-site.xml
Yes, I'nk you can make use of cloudera tarball for setting up hadoop client, the same can be downloaded from the following path, configuration files are availble under etc/hadoop/ directory under Hadoop, just need to modify those files according to your environment.
http://archive-primary.cloudera.com/cdh5/cdh/5/hadoop-2.2.0-cdh5.0.0-beta-2.tar.gz
If the above link doesn't match your version, use the following link for getting the available hadoop versions
http://archive-primary.cloudera.com/cdh5/cdh/5/

CDH4 installation using tarball

I have been struggling to install CDH via tarball, there is no document that describes the steps or guides through. I do have root access on the server & wish to install CDH4 via tarball in Pseudo mode. Can anyone help?. On the same server apache hadoop is also installed, i want to install this CDH, without effecting the existing apache hadoop.
It will not work..because in the end CDH4 will use the same ports which your existing apache hadoop is using..It will work ..if you shutdown your existing hadoop cluster and then start your CDH4 cluster. Or else change all the port numbers for namenode,secondary namenode,jobtracker, tasktracker and datanode and their respective web UI's port..which is kind of tedious.. It would be also helpful if you provide some error logs..So I can highlight what exactly is the problem.

Resources