Course name:
Hadoop workshop for Administrators (Live Virtual)
Description:
In this course, attendees will learn about the business benefits and use cases for Hadoop and its ecosystem, how to plan cluster deployment and growth, how to install, maintain, monitor, troubleshoot and optimize Hadoop. They will also practice cluster bulk data load, get familiar with various Hadoop distributions, and practice installing and managing Hadoop ecosystem tools. The course finishes off with discussion of securing cluster with Kerberos.
Duration:
3 Days
Audience:
Administrator, Architect, Developer
Prerequisites:
Basic Linux system administration and scripting
Course outline:
<p><span style="font-family: Times New Roman; font-size: medium;"> Introduction <br />Hadoop history, concepts, Ecosystem, Distributions, High level architecture, Hadoop myths, Hadoop challenges, (hardware / software)<br /></span></p> <p><span style="font-family: Times New Roman; font-size: medium;">Planning and installation <br />Selecting software, Hadoop distributions, Sizing the cluster, planning for growth, Selecting hardware and network, Rack topology, Installation, Multi-tenancy, Directory structure, logs, Benchmarking<br /></span></p> <p><span style="font-family: Times New Roman; font-size: medium;">HDFS operations <br />Concepts (horizontal scaling, replication, data locality, rack awareness), Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode), Health monitoring, Command-line and browser-based administration<br />Adding storage, replacing defective drives<br /></span></p> <p><span style="font-family: Times New Roman; font-size: medium;">MapReduce operations <br />Parallel computing before mapreduce: compare HPC vs Hadoop administration, MapReduce cluster loads, Nodes and Daemons (JobTracker, TaskTracker), MapReduce UI walk through, Mapreduce configuration, Job config, Job schedulers, Administrator view of MapReduce best practices, Optimizing MapReduce, Fool proofing, YARN architecture and use<br /></span></p> <p><span style="font-family: Times New Roman; font-size: medium;">Advanced topics <br />Hardware monitoring, System software monitoring, Hadoop cluster monitoring, Adding and removing servers, upgrading Hadoop, Backup, recovery and business continuity planning, Cluster configuration tweaks, Hardware maintenance schedule, Oozie scheduling for administrators, Securing your cluster with Kerberos.</span></p> <p>&nbsp;</p> <p>&nbsp;</p>

Copyright © 2011-2014 Aziksa Inc, All rights reserved.