How to mount HDFS on Ubuntu 14.04 - hadoop

So,I can't mount HDFS on Ubuntu 14.04. Mucommander not working, fuse not working, can somebody explain me with images or make me some tutorial. Thanks, best regards.

I successfuly did it today using instructions on Cloudera site
wget http://archive.cloudera.com/cdh5/one-click-install/trusty/amd64/cdh5-repository_1.0_all.deb
sudo dpkg -i cdh5-repository_1.0_all.deb
sudo apt-get update
sudo apt-get install hadoop-hdfs-fuse
sudo mkdir -p <mount_point>
hadoop-fuse-dfs dfs://<name_node_hostname>:<namenode_port> <mount_point>
What OS and version of HDFS do you use?

You have to add the CDH5 repository first (tested on 15.04):
wget http://archive.cloudera.com/cdh5/one-click-install/trusty/amd64/cdh5-repository_1.0_all.deb
sudo dpkg -i cdh5-repository_1.0_all.deb
sudo apt-get update
Then proceed with #AlexDvoretsky's answer.

Alternative instructions for CDH5 repository installation on Ubuntu 16.04: From Cloudera.
sudo wget 'https://archive.cloudera.com/cdh5/debian/wheezy/amd64/cdh/cloudera.list' -O /etc/apt/sources.list.d/cloudera.list
sudo apt-get update
sudo apt-get install hadoop-hdfs-fuse
Then back to #AlexDvoretsky's answer

I developed a python program that uses FUSE to mount HDFS as a standard filesystem, it also works with Kerberos and it works well: https://github.com/EDS-APHP/py-hdfs-mount

Related

cloudera director client installation

I am trying to install apache-spot and Cloudera on AWS following these tutorials:
http://blog.cloudera.com/blog/2018/02/apache-spot-incubating-and-cloudera-on-aws-in-60-minutes/
https://www.cloudera.com/documentation/director/latest/topics/director_get_started_aws_install_dir_server.html#concept_xnb_cwh_jx
But when I run:
sudo apt-get install cloudera-director-client
The Ubuntu terminal gives me:
E: unable to locate package cloudera-director-client
I already ran this code to fix:
$ wget http://archive.cloudera.com/director/ubuntu/precise/amd64/director/cloudera-director.list
$ wget http://archive.cloudera.com/director/ubuntu/precise/amd64/director/archive.key
$ sudo apt-key add archive.key
$ sudo mv cloudera-director.list /etc/apt/sources.list.d
$ sudo apt-get update
Anyone able to help me? Thanks
The cloudera-director.list file that you have installed is likely old. The one in the URL you have listed is for Ubuntu 12.04 (Precise Pangolin). Make sure that you pick the release that matches your OS version.
Cloudera Altus Director 2.x is available for Ubuntu 14.04 (Trusty Tahr) at http://archive.cloudera.com/director/ubuntu/trusty/amd64/director/
Cloudera Altus Director 6.0.0 is available for Ubuntu 16.04 and 14.04 at http://archive.cloudera.com/director6/6.0.0/ubuntu1604/ or http://archive.cloudera.com/director6/6.0.0/ubuntu1404/
Please try installing again with a new cloudera-director.list and archive.key

Linux server sudo command not found

I am having Linux server with
2.6.32-673.8.1.lve1.4.3.el6.x86_64 - GNU/Linux
and trying to run following commands :
sudo apt-get update
-bash: sudo: command not found
I have also tried - sudo su
-bash: sudo: command not found
And for lsb_release -a
-bash: lsb_release: command not found
I am trying to install ruby. Any help is highly appreciated.
sudo is in its own package. You probably don't have it installed, so you'll have to login as root and install it.
Sadly, just showing the kernel version doesn't help much, so see if this returns anything: cat /etc/*-release
If it does, you could be using something like CentOS or any other flavor of linux, so apt-get may not work anyway.
Hopefully, you are logging in with root if sudo isn't available. Running whoami should let you know your username if you are not sure.
If you are root, then try (for CentOS) yum install sudo to get sudo or just yum install ruby to install Ruby. If yum is not found, then try apt-get. If that still doesn't work, then you could be using a system with a different package manager (like emerge for Gentoo Linux).
EDIT: Does anything happen if you try /sbin/sudo (or /usr/sbin/sudo)?
Often sudo is not installed by default on distributions like Debian.
Tipe into the console:
su -
After that try to install sudo with this line:
apt-get install sudo -y
To use sudo properly you should give yourself Superuser permissions:
usermod -aG sudo USERNAME
After this step reboot your device and everything should work fine.

ejabberdctl not found in bash Ubuntu(installing ejabberd)

I was trying to install ejabberd with applying the tutorials in many site to my VM which is ubuntu but I am stuck in the beginning. After I wrote
sudo apt-get update
sudo apt-get -y install ejabberd
it installs ejabberd. But when I try to write the following
ejabberdctl register admin localhost mypassword
it says ejabberdctl not found. I also tried to restart it with but it is still same.
sudo service ejabberd restart
Note: I did not install erlang seperately. Can it be the problem?
Try sudo ejabberdctl,
if it didn't work, Do:
sudo updatedb
sudo locate ejabberdctl
check if the output is in your $PATH variable.

How to uninstall all versions of hadoop completely from the system?

I had installed CDH5 with Mvr1 in ubuntu 14.04 LTS (single node) in pseudo-distributed mode using this tutorial
http://www.cloudera.com/content/www/en-us/documentation/enterprise/latest/topics/cdh_qs_mrv1_pseudo.html
I used the command
sudo apt-get install hadoop-0.20-conf-pseudo
to install the package in pseduo distributed mode.
I then tried to uninstall it and migrate to YARN (MvR2). But in doing so, my datanode fails to start up every time. I removed Mvr1 and installed YARN using this tutorial:
http://www.cloudera.com/content/www/en-us/documentation/enterprise/latest/topics/cdh_qs_yarn_pseudo.html.
I used the command
sudo apt-get remove hadoop-0.20-conf-pseudo hadoop-0.20-mapreduce-*
to uninstall Mvr1
and
sudo apt-get install hadoop-conf-pseudo
to install YARN.
Can you suggest me how to remove all versions of hadoop completely from my system and verify that no file remains before I do a fresh installation?
Do a:
sudo dpkg -l | grep hadoop
to see what packages are installed and then go through the list running:
sudo apt-get remove
on anything that pops up. That should remove hadoop completely from your system.

Installing cloudera impala without cloudera manager

Kindly provide the link for installing the imapala in ubuntu without cloudera manager. Couldn't able to install with official link.
Unable to locate package impala using these queries :
sudo apt-get install impala # Binaries for daemons
sudo apt-get install impala-server # Service start/stop script
sudo apt-get install impala-state-store # Service start/stop script
First you need to get the list of packages and store it in /etc/apt/sources.list.d/, then update the packages, then you fire the Impala queries.
At the terminal do the following:
cd /etc/apt/sources.list.d/
wget http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala/cloudera.list
sudo apt-get update
sudo apt-get install impala # Binaries for daemons
sudo apt-get install impala-server # Service start/stop script
sudo apt-get install impala-state-store # Service start/stop script
Have you installed the correct list file?
See "Packages and Repositories" on page 14.
http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/PDF/Installing-and-Using-Impala.pdf

Resources