HDFS is file system designed by Google and used by Hadoop. It provides reliable, highly available store for data processing. Let us take a look at HDFS and its architecture.
We need Hadoop environment for practice and setting that up on Linux is no fun. There is better alternative in using Cloudera virtual machine. Let me show you how can we set up and use Cloudera quick-start VM to get hands on practice for Hadoop.