Start and connect to master¶
Start an EMR Cluster¶
Create an EMR cluster, use ‘advanced option’ to create a cluster with the required spec.
- Specify at least 30GB space for Root device EBS volume.
- To login into cluster, choose a EC2 key pair.
Connect to Master Node¶
Log into master and all worker nodes as “hadoop” user.
ssh <key> hadoop@master-public-ip and install required libs.
Setup and check passwordless SSH between cluster machines
ssh hadoop@localhost
ssh hadoop@worker-public-ip