hadoop cluster on virtual machines [closed] - hadoop

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
I have to setup a cluster, on my computer using 5 virtual machines with hadoop. The configuration requires a port number. Can someone enlighten me on this.I am a beginner in it

If your primary objective is to learn Hadoop then it does not matter you learn it on Windows or Linux, because everything is exactly same on both platforms. I have extensively used Hadoop on both platform and found all the commands and processing are identical on Windows and Linux. So here are my suggestions:
Download VMware VMPlayer on your Windows/Linux Machine
Download CDH Virtual Machine for VMware
https://ccp.cloudera.com/display/SUPPORT/Downloads
Access virtual machine in your Windows/Linux box and follow the tutorials exactly they are on Linux.
Same info is shared here:
Hadoop on Windows

Its upto you to choose the port. Normally people use the default ports provided by hadoop. For the default ports, see this. There will be absolutely no harm if you use those ports (unless you have something else running on any of those ports).

Related

Fastest way to setup Hadoop cluster using Cloudera [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 7 years ago.
Improve this question
Need some help here guys. I am new to Hadoop and I need to setup a Hadoop cluster fast using windows machines.
I am aware that I can use Cloudera for this but I was just wondering that instead of downloading a virtual box first, configuring it with Ubuntu and then installing CDH4 on it, can I not just download a pre-configured VM that Cloudera provides on the different machines and then network them?
Is there any step by step tutorial available to do this using the VMs provided by Cloudera?
Any help would be very appreciated.
Thanks,
Kumar
EDIT : I have VMPlayer, isos of Ubuntu 12.04 LTS, CentOS 6.2, VirtualBox and fast internet. Now can someone tell me what's the fastest way of setting up a cluster using CDH4 on 4-5 laptops I have in a LAN with windows on them?
The fastest way to setup Cloudera Hadoop cluster is to install Cloudera Manager and leave all jobs to it.
First, install Cloudera manager server in one node, start the server service.
Second, install Cloudera manager agent on other nodes, set the hostname of server to /etc/cloudera-scm-agent/config.ini, then start all the agents.
Third, use a browser to visit the http://cloudera-scm-server:7180, then follow the wizard and Cloudera Manager will take care of all left jobs.

Setup Teradata database on windows 7 32-bit [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 5 years ago.
Improve this question
I want to start working with Teradata database and for that I need to setup it on my system. After searching a lot I didn't find any setup which I can use to install it on windows machine. The only link I found was http://www.teradata.com/teradata-express-13-0-windows/ this one but there is no download link on this page. I have also found the VMware version to use Teradata on 64-bit windows on this link http://downloads.teradata.com/download/database/teradata-express/vmware but I am not sure how to install this using VMware after downloading the setup.
Please provide some help for installing Teradata on 32 windows or 64 windows using VmWare.
Have you read this article on Teradata's Developer Exchange? It should cover the basics of getting the VMware environment up and running.
http://developer.teradata.com/database/articles/introduction-to-teradata-express-for-vmware-player
You may wish to change the runlevel of SLES to boot to the command line instead of the Gnome desktop to reduce the memory footprint of the VM. You will want to dedicate 4GB of RAM to the VM as well.

What is virtualization (for servers) and how do I set it up [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
I just purchased myself a IBM System x3650 and was wondering what the best way to set it up is. I'm going to be running 5 Drupal (php) websites from it. I have read numerous articles on virtualization and was wondering how I would go about doing this. Is virtualization better on a Windows machine VS Linux? Can I use Oracle VirtualBox. Any kind of help would be greatly appreciated!
If you're just going to run a bunch of websites, you don't need virtualization.
Virtualization takes your physical hardware and allows you to logically allocate it to virtual machines. You would install a hypervisor (such as Hyper-V or VMware ESX) rather than an Operating System. Then, you could create virtual machines and install Operating Systems on those (you can install any OS that the hypervisor support). Most hypervisors support Windows and Linux.
However to run 5 websites, use a web server that allows you to run multiple web sites on a single server. Both apache (httpd) and IIS (Windows Web Server) allow this. Virtualization would be overkill to accomplish this task.

Using a "local" S3 as a replacement for HDFS? [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
I have been testing out the most recent Cloudera CDH4 hadoop-conf-pseudo (i.e. MRv2 or YARN) on a notebook, which has 4 cores, 8GB RAM, and an Intel X25MG2 SSD. The OS is Ubuntu 12.04LTS 64bit. So far so good.
Looking at Setting up hadoop to use S3 as a replacement for HDFS, I would like to do it on my notebook - on this notebook, there is a S3 emulator that my colleagues and I implemented.
Nevertheless, I can't find where I can set the jets3t.properties to change the end point to localhost. I downloaded the hadoop-2.0.1-alpha.tar.gz and searched the source without finding out a clue. There is a similar Q on SO Using s3 as fs.default.name or HDFS?, but I want to use our own lightweight and fast S3 emulation layer, instead of AWS S3, for our experiments.
I would appreciate a hint as to how I can change the end point to a different hostname.
Regards,
--Zack

Running Windows inside Ubuntu [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 11 years ago.
Improve this question
How can I launch the entire Windows operating system inside Ubuntu. Something similar to Parralel Desktops. Free, please.
The keyword you're looking for is virtualization. There are MANY tools for hosting virtual machines. Ububtu has information specific to it, so you might start there. Other common options are virtual box, vmware, and xen, among many others. All of these have free options.
I'm doing the opposite (an entire Ubuntu box inside of Windows) via http://www.virtualbox.org/
The same thing will work for your case (Windows inside Ubuntu).

Resources