-
Notifications
You must be signed in to change notification settings - Fork 254
GIS Tools for Hadoop for Beginners
#Tryout GIS Tools for Hadoop with a Virtual Machine This tutorial will walk you through the steps of setting up a virtual machine (VM) and running GIS Tools for Hadoop - without a cluster! This example is oriented towards users using Windows.
##Requirements
-
A virtual machine with a Hadoop Environment. We used the Hortonworks Sandbox with VirtualBox
At the time of writing, we had errors installing HDP v2.2, while HDP v2.1 worked well
Once you have downloaded a VM and Hadoop Environment, complete the set-up instructions by following the steps in the installation guide.
###Optional
The openssh package is required with Cygwin (not turned on by default in standard install).
##Instructions
- Open the VM and click the green
show
arrow, your VirtualBox should look like:
If you do not have the Hortonworks Sandbox listed on the left, you will need to add it - follow the installation guide instructions.
- Make note of the IP address - this will allow you to access the Sandbox later using ssh.
ssh [email protected] -p 2222
-
Type
Alt + F5
and complete theusername: root
andpassword: hadoop
. -
Type
ls
and push enter, you will see the files listed in the folderroot
. -
Make a folder named
esri-git
to hold the github project, type:mkdir esri-git
and push enter. -
Type
ls
and enter again and you will see the newly created folder. -
Type
cd esri-git
to enter the newly created directory.
- You are now going to clone the github repository. Since VirtualBox does not recognise the web address "www.github.com" we will need to find the IP address of github.com. In either Windows Command Prompt or Cygwin type:
ping github.com
Once a ping has been returned, pushCtrl + C
to stop the responses. Make note of the IP address.
Here it would be 190.30.252.131
-
In VirtualBox type :
git clone [email protected]:Esri/gis-tools-for-hadoop.git
wherexxx.xx.xxx.xxx
is the IP address from Step 8. -
You have now cloned the GIS Tools for Hadoop toolkit. If you would like to work in Cygwin (which is much easier to use): continue on, if not: skip to step 14.
-
In Cygwin type
ssh [email protected] -p 2222
(From Step 2)
-
Enter the password:
hadoop
-
Change directories if needed (
cd esri-git
) -
You are now able to complete the the sample. Completing the sample successfully means that everything is installed correctly, and will give you a good intro of how to use the framework.