Cloudera Developer Training for Apache Hadoop
Course Summary
This four-day training course is for developers who want to learn to use Apache Hadoop to build powerful data processing applications.
You Will Learn
- The core technologies of Hadoop
- How HDFS and MapReduce work
- How to develop MapReduce applications
- How to unit test MapReduce applications
- How to use MapReduce combiners, partitioners and the distributed cache
- Best practices for developing and debugging MapReduce applications
- How to implement data input and output in MapReduce applications
- Algorithms for common MapReduce tasks
- How to join data sets in MapReduce
- How Hadoop integrates into the data center
- How to use Mahout’s machine learning algorithms
- How Hive and P…
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
Course Summary
This four-day training course is for developers who want to learn to use Apache Hadoop to build powerful data processing applications.
You Will Learn
- The core technologies of Hadoop
- How HDFS and MapReduce work
- How to develop MapReduce applications
- How to unit test MapReduce applications
- How to use MapReduce combiners, partitioners and the distributed cache
- Best practices for developing and debugging MapReduce applications
- How to implement data input and output in MapReduce applications
- Algorithms for common MapReduce tasks
- How to join data sets in MapReduce
- How Hadoop integrates into the data center
- How to use Mahout’s machine learning algorithms
- How Hive and Pig can be used for rapid application development
- How to create large workflows using Oozie
Prerequisites
This course is appropriate for developers who will be writing, maintaining and/or optimizing Hadoop jobs. Participants should have programming experience; knowledge of Java is highly recommended. Understanding of common computer science concepts is a plus. Prior knowledge of Hadoop is not required.
Hands-On Exercises
Throughout the course, students write Hadoop code and perform other hands-on exercises to solidify their understanding of the concepts being presented.
Certification Exam
Following successful completion of the training class, attendees receive a Cloudera Certified Developer for Apache HBase (CCDH) practice test. Cloudera training and the practice test together provide the best resources to prepare for the certification exam.
Outline
- Introduction
- The Motivation for Hadoop
- Hadoop: Basic Concepts
- Writing a MapReduce Program
- Unit Testing MapReduce Programs
- Delving Deeper into the Hadoop API
- Practical Development Tips and Techniques
- Data Input and Output
- Common MapReduce Algorithms
- Joining Data Sets in MapReduce Jobs
- Integrating Hadoop into the Enterprise Workflow
- Machine Learning and Mahout
- An Introduction to Hive and Pig
- An Introduction to Oozie
- Conclusion
- Appendix: Graph Processing in Map Reduce
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
