Cloudera Administrator Training for Apache Hadoop
Course Summary
This four-day hands-on training course is for system administrators and others responsible for managing Apache Hadoop clusters in production or development environments.
You Will Learn
- How the Hadoop Distributed File System and MapReduce work
- What hardware configurations are optimal for Hadoop clusters
- What network considerations to take into account when building out your cluster
- How to configure Hadoop's options for best cluster performance
- How to configure NameNode High Availability
- How to configure NameNode Federation
- How to configure the FairScheduler to provide service-level agreements for multiple users of a cluster
- How to install and implement Kerberos-base…
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
Course Summary
This four-day hands-on training course is for system administrators and others responsible for managing Apache Hadoop clusters in production or development environments.
You Will Learn
- How the Hadoop Distributed File System and MapReduce work
- What hardware configurations are optimal for Hadoop clusters
- What network considerations to take into account when building out your cluster
- How to configure Hadoop's options for best cluster performance
- How to configure NameNode High Availability
- How to configure NameNode Federation
- How to configure the FairScheduler to provide service-level agreements for multiple users of a cluster
- How to install and implement Kerberos-based security for your cluster
- How to maintain and monitor your cluster
- How to load data into the cluster from dynamically-generated files using Flume and from relational database management systems using Sqoop
- What system administration issues exist with other Hadoop projects such as Hive, Pig, and HBase
Prerequisites
This course is appropriate for system administrators who will be setting up or maintaining a Hadoop cluster. Basic Linux system administration experience is a prerequisite for this training session. Prior knowledge of Hadoop is not required.
Hands-On Exercises
Throughout the course, hands-on exercises help students build their knowledge and apply the concepts being discussed.
Certification Exam
Following successful completion of the training class, attendees receive a Cloudera Certified Administrator for Apache Hadoop (CCAH) practice test. Cloudera training and the practice test together provide the best resources to prepare for the certification exam.
Outline
- Introduction
- The Case for Apache Hadoop
- HDFS
- Getting Data into HDFS
- MapReduce
- Planning Your Hadoop Cluster
- Hadoop Installation and Initial Configuration
- Installing and Configuring Hive, Impala, and Pig
- Hadoop Clients
- Cloudera Manager
- Advanced Cluster Configuration
- Hadoop Security
- Managing and Scheduling Jobs
- Cluster Maintenance
- Cluster Monitoring and Troubleshooting
- Conclusion
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
