Big Data (hadoop / Udemy) #1
Background
BIG, page 12m-16b + 21-28b
https://www.udemy.com/hadoopstarterkit/ Section 1+2, lesson 1-3
https://www.udemy.com/hadoopstarterkit/ Section 3, lesson 4+6
https://www.udemy.com/hadoopstarterkit/ Section 4, lesson 7+8
https://www.udemy.com/hadoopstarterkit/ Section 3, lesson 5
Assignment 1 Big Data Quiz
After viewing course content section 1 (lesson 1) + 2 (lesson 2+3):
Assignment 2 HDFS
After viewing course content section 3 (lesson 4 + 5 + 6):
-
-
Steps to get cluster access:
- Click on link in Udemy course section 3, lesson 5
- On the web page shown click on the big, yellow box ”Give me access to Hadoop Cluster"
- Fill in the form, and you will get an email with further instructions
- In the received email click on link cluster-key.zip (download) – this will download a zip file. Unpack the file
-
Initial: Get Putty To Access The Udemy-Hadoop Sandbox (cluster)
- Download and install PuTTY (exe file) from http://www.chiark.greenend.org.uk/~sgtatham/putty/latest.html
- Startup PuTTY and change the following settings:
- Category ”Session” : Host name / IP address – type in the IP address received in your email
- Category ”SSH” + ”Auth”: click on Browse button and select the .ppk file from the unzipped cluster-key.zip
- Category ”Session”: Type a name (e.g. Hadoop Cluster) in the field called ”Saved Session” and click the Save button
- Start PuTTY client by clicking on the Open button
- In the window, which now opens, type in User name (copy/paste user name from received email & right click in PuTTY window) and press return
- You now have access to the AWS cluster
-
After the initial setup Normal start og Putty
- Startup PuTTY
- Category ”Session” : select saved session (”Hadoop Cluster”) & press the Load button
- Click the Open button
- In the window, which now opens, type in User name (copy/paste user name from received email & right click in PuTTY window) and press return
- You now have access to the AWS cluster
-
Work with HDFS
- Try the different file commands from the Section 3 Lesson 5 "Working With Hdfs"
After viewing course content section 4 (lesson 7 + 8) :
-
Extra 3 MapReduce advanced (include java programing)
After viewing course content section 4 (lesson 9 + 10) :
-
Use your Hadoop cluster access
- Max Close Price Reducer
- Max Close Price Mapper
- Max Close Price