E-MapReduce (EMR) is a cloud-native open-source big data platform that provides easy-to-integrate open-source big data computing and storage engines such as Hadoop, Hive, Spark, Flink, Presto, and ClickHouse. EMR allows you to adjust computing resources based on your business needs and deploy the resources on Alibaba Cloud Elastic Search Service (ECS), Alibaba Cloud Container Service for Kubernetes (ACK), and Apsara Stack. In this blog, we are going to see how to create an Alibaba Cloud EMR cluster.
Step-1: Selecting Alibaba Cloud EMR from the Alibaba Cloud console
Step-2: Selecting the Creating Cluster option present in the Alibaba Cloud EMR cluster console as shown below diagram
Step-3: Providing Basic Configuration for Alibaba Cloud EMR Cluster to be created as shown in the below diagram.
Step-4: Providing Hardware Configuration for both master node and core node of the Alibaba Cloud EMR cluster to be created.
Step-5: Assign public IP for Master Node in the created Alibaba Cloud EMR Cluster
Step-6: Provide the configuration for the Core Node of the created Alibaba Cloud EMR Cluster.
Step-7: Provide Other Basic Configurations such as Cluster Name and Password for the Alibaba Cloud EMR Cluster to be created.
Step-8: Finally, Confirm the configuration by clicking the confirm button present in Alibaba Cloud EMR Console.
Step-9: Once the Cluster Creation Process is completed, we can view the created cluster with status as Running as depicted in the below diagram.
Step-10: After the Cluster Creation process is completed we can view the master Node and core node as shown in the below diagram.
Step-11: We can get the Public IP address of the Master Node from the basic configuration panel of Alibaba Cloud EMR Console.
Step-12: Now we can connect to the Master Node from the windows client through SSH as shown below.
Step-13: We can run the Hadoop cluster by using the command Hadoop in the command prompt as shown below.
Step-14: Finally, we can run the spark job in the created EMR Cluster by using the command spark.
Alibaba Cloud E-MapReduce (EMR), a cloud-native open-source big data platform, provides easy-to-integrate open-source big data computing and storage engines such as Hadoop, Hive, Spark, Flink, Presto, and ClickHouse. The Alibaba Cloud EMR service can also be used to create an EMR cluster within minutes with just a few mouse clicks. In this blog post, we have provided an overview of the steps involved in creating an Alibaba Cloud EMR cluster.
Working with Resource Sharing in Resource Management of Alibaba Cloud
12 posts | 3 followers
FollowAlibaba Clouder - November 16, 2017
Alibaba Clouder - March 31, 2021
Alibaba Clouder - September 2, 2019
Alibaba Clouder - September 2, 2019
Alibaba Clouder - September 27, 2019
Alibaba Clouder - October 15, 2019
12 posts | 3 followers
FollowAlibaba Cloud provides big data consulting services to help enterprises leverage advanced data technology.
Learn MoreAlibaba Cloud experts provide retailers with a lightweight and customized big data consulting service to help you assess your big data maturity and plan your big data journey.
Learn MoreApsaraDB Dedicated Cluster provided by Alibaba Cloud is a dedicated service for managing databases on the cloud.
Learn MoreA Big Data service that uses Apache Hadoop and Spark to process and analyze data
Learn MoreMore Posts by GAVASKAR S