Hello everyone, in this article we are going to learn how to set up Cloudera Quickstart VM on a Windows machine. When we start learning Hadoop, we need its installation on a server or standalone system to practice. Though setting up Hadoop on a single machine is not very difficult, there is an easier way to set up a Hadoop environment quickly. We can use Cloudera’s Quickstart VMs. These VMs have Hadoop preconfigured and they are free and quick to set up.

Prerequisite

For Cloudera Quickstart VM to run on a single system,  it should have at least 8 GB of RAM. We can run this on a system with 4GB ram, but performance will be abysmal.

After this, we will need to download hypervisor.  The hypervisor is a piece of software on which wraps our VM and runs on a host machine.  We can set up Cloudera Quickstart VM using three hypervisors.

  1. VMWare (need to pay for a license after free trial)
  2. Virtualbox (opensource)
  3. KVM (opensource)

You can choose any of these hypervisors. In this article, we are going to use Virtualbox. You can download Virtualbox from here. Choose windows host and download Virtualbox. Once the download is complete, install Virtualbox on your machine.

Cloudera Quickstart VM

After that, we need to download VM from Cloudera. You can get that from this link. On this page, in platform choose Virtualbox and click on GET IT NOW. Then it will ask you to fill up a small form. Once you fill up these details and accept terms and conditions, VM download will begin automatically. After the download is complete, we need to extract VM. We can use 7zip for this.

Setting up VM

Now start Virtualbox and click on the Settings icon and then System and choose motherboard tab. There you can set how much RAM Virtualbox can use. Set this value to 4GB or more.

Cloudera quickstart VM setup on windows

After that, click on File-> Import Appliance. From a new window, browse to location to where you have extracted Quickstart VM in an earlier step. Choose *.ovf file. Then click on Next then on import.

It starts importing Cloudera VM on Virtualbox. You can run this VM by clicking on the Start icon in Virtualbox.

Validating VM Set up

Once VM starts running, we can validate it by logging to hue.  The default username and password for this all Cloudera VMs is “cloudera”, “cloudera” respectively.  You can check different services like Hive, Pig from hue.

login in to hue on Cloudera VM

Conclusion

In this article, we have set up Cloudera Quickstart VM on windows. We will use the same for our future tutorials and Hadoop practice. If you have faced any issues with this setup, then please ask me in the comment section below. See you in the next article.

Set up Cloudera in windows
Mahesh Mogal

Mahesh Mogal

I am passionate about Cloud, Data Analytics, Machine Learning, and Artificial Intelligence. I like to learn and try out new things. I have started blogging about my experience while learning these exciting technologies.

Table of Contents
    Add a header to begin generating the table of contents

    Stay updated with latest blogs

    Posts you may be interested in

    Manage S3 Bucket Polices
    S3

    Set, Get and Delete AWS S3 bucket policies

    In this blog, we are going to learn how to get, put and delete S3 bucket policies suing S3 Console as well as programmatically using AWS CLI & Python

    Manage S3 Bucket Polices
    Read More →
    iam policy vs s3 policy vs s3 acls
    S3

    IAM Policies VS S3 Policies VS S3 Bucket ACLs – What should be used?

    You can manage S3 permission using IAM policy or S3 Policy or S3 ACLs. We will understand the difference between them and use cases for each way.

    iam policy vs s3 policy vs s3 acls
    Read More →
    Create S3 bucket
    S3

    Create S3 bucket using AWS CLI and Python Boto3

    In this blog, we are going to learn how to create an S3 bucket using AWS CLI, Python Boto3 and S3 management console.

    Create S3 bucket
    Read More →

    Leave a Comment

    Your email address will not be published. Required fields are marked *

    Share via
    Copy link