Let’s share what is the best answer to this interview question.
Share
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Leone Costa
First allow’s understand what simply happens in a hadoop cluster, the hadoop cluster follows a master slave concept. The grasp device strategies all of the statistics, slave machines shop the statistics and act as data nodes. On the grounds that all the storage occurs on the slave, a better potential tough disk could be encouraged and considering master does all of the processing, a better ram and a far higher cpu is required. Consequently, you can pick out the configuration of your machine depending for your workload. For e. G. – in this example c4. 8xlarge will be favored for grasp machine whereas for slave system we can choose i2. Massive instance. In case you don’t need to address configuring your instance and putting in hadoop cluster manually, you can right now release an amazon emr(elastic map reduce) example which robotically configures the servers for you. You unload your records to be processed in s3, emr selections it from there, tactics it, and dumps it returned intos3.