Big Data: Hadoop

Now we will about to start an open source framework i.e HADOOP.

What is Hadoop?

Ans: Hadoop is basically a combination of HDFS and MapReduce. Now, Question arises what is "HDFS and MapReduce"? To understand this lets dig up history of HADOOP.

It ws founded by Doug Cutting.
At the very first place, Google has analysed problems of BIG DATA and to overcome this problem Google proposed a concept of GFS : GOOGLE FILE SYSTEM. GFS is a distributed File System. It is Designed to provide efficient, reliable access to data using large clusters of commodity hardware.
Then, MapReduce is introduced after GFS, an updated version of GFS. In this all log files get stored in the Storage devices, processing and generating large data sets with a parallel, distributed algorithm on a cluster.
In 2003, GFS: Google File System was launched.
In 2004, MapReduce was launched.
In 2006-07, HDFS Hadoop Distributed File System was launched.
In 2007-08, Mapreduce (is the technique to process the files in HDFS) was launched.
HDFS: Hadoop Distributed File System is the technique to store data from commodity hardware.

Why Hadoop is termed as 'Hadoop'? and What is the reason behind its Elephant symbol?

Ans: Its answer is really insane but this is true, Doug Cutting child was playing with his toy(Elephant shaped toy) whose name was 'Hadoop'. From this Cutting named it Hadoop with Elephant symbol.

Big Data

Tuesday, 19 May 2015

Hadoop

What is Hadoop?

No comments:

Post a Comment

Total Pageviews

About Me