Hadoop – Architecture, HDFS, EcoSystem, NoSQL Data Modeling & MapReduce Framework
Understanding the architecture of Hadoop and its related ecosystem is the first step anyone aspiring to be an Big Data professional should take. With such a core understanding, they can start working professionally & implementing Hadoop based Big Data Analytic solutions.
This course is designed to provide exactly that and beyond by covering the following areas:
- Architecture,
- Security Model,
- Multi-tenancy,
- Load Strategies,
- Change Management,
- Admin Intro,
- Ecosystem,
- NoSQL Data Modeling
In this course, we will transition the student from the current-world of RDBM structured data management to the file based and unstructured database world of Hadoop.
We will get into the specifics of key parts of the Hadoop architecture, Hive, Pig and Map Reduce programming with explicit use cases.
The hands-on labs will cover exploring various Hadoop components like Namenode, Jobtracker etc; working with the HDFS; creating tables in Hive and loading data into it; performing ETL using Pig and fundamentals of Map Reduce.
We will be using real Hadoop clusters provisioned by ClustersToGo.com.
We always use the latest Hadoop distributions. By default, we use Cloudera’s latest Hadoop distribution.
However, based on demand, we can use also use Hortonworks, MapR, and Hadoop on Windows Azure.
Prerequisites
Basic Linux command line skills , DB knowledge. MPP architecture knowledge is a plus.
Recommended Next Class
Hive Basics and Advanced; Part of Hadoop BI Developer’s Track
Audience
Developers, IT Administrators, Managers, Analysts, Data Scientist
What to bring to your class
Your computer, SSH Client like putty.exe
Recommended Readings
- O’Reilly’s ‘Hadoop’ book by Tom White
Course Dates & Duration:
Saturdays : Every Saturday starting from : 01/05/2013
Duration : 8am till Noon PST
Course Location: Onsite
Third Eye’s Offices
3200 Coronado Dr, Santa Clara, CA 95054, 408 306 8462
Course Location: Online
Access Information would be provided 12 hours prior to the class.
Price & Registration:
Payments must be received by Third Eye CSS 24 hours before class start time.
Any Cancellation must be notified 12 hours before class start time, otherwise, no refund would be issued.
Option 1:
Remit payment using Paypal at djdas@thirdeyecss.com atleast 24 hours before the class.
Option 2:
Mail a Check payable to “Third Eye CSS LLC.” at our mailing Address : 5201 Great America Parkway, Suite 320, Santa Clara, CA 95054.
Check must be received 24 hours before the class start time.
Option 3:
Contact Information:
For any additional information,
please email at jeetadas@thirdeyecss.com
Call or text at (408) 306-8462




