Skip to content

Configuring a Cloudera Distribution for Hadoop (CDH) cluster on AWS Elastic Compute Cloud (EC2) Red Hat Enterprise Linux (RHEL) images.

License

Notifications You must be signed in to change notification settings

Bruno-Jander/CDH-on-AWS-EC2-RHEL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 

Repository files navigation

CDH on AWS EC2 RHEL

Overview 🙇‍♂️

A simple tutorial of how to deploy a Cloudera Distribution for Hadoop (CDH) cluster on AWS Elastic Compute Cloud (EC2) instances running Red Hat Enterprise Linux (RHEL). It was written while I was watching this excellent video tutorial.

Reading order 📖

  1. Configuring_EC2_instances_on_AWS (PART 1)
  2. Installing_Cloudera_CDH_on_AWS_RHEL (PART 2)
  3. Creating_a_bare_metal_cluster (PART 3)
  4. Adding_services_to_your_bare_metal_cluster (PART 4)

The basic deployment architecture consists of the following components 🧰

  • EC2 Instances: Virtual machines on AWS running RHEL, configured to host CDH services.
  • Cloudera Manager: A centralized management tool for CDH deployments, installed on one of the EC2 instances (MASTER).
  • CDH Services: Hadoop components such as HDFS, YARN, Hive, and Impala, distributed across the EC2 instances (MASTER/WORKERs).
  • SSH client: A program that allows establishing secure and authenticated SSH connections to SSH servers (PowerShell/Bash).

GitHub Repository: @CDH-on-AWS-EC2-RHEL 🦫

About

Configuring a Cloudera Distribution for Hadoop (CDH) cluster on AWS Elastic Compute Cloud (EC2) Red Hat Enterprise Linux (RHEL) images.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published