Question

In: Computer Science

. The Hadoop framework includes many parts. Research more about following topics and describe briefly. Explain...

. The Hadoop framework includes many parts. Research more about following topics and describe briefly. Explain the use of Hadoop.

Expert Solution

Before going to explore how hadoop can use and help to today's world.

Example...

Consider the Google (The Gaint Company ) ,most popularly used as search engine, that contains a lot of information had to store in the database but most of the data was an unstructured format like videos, images, audios, web logs etc resulted in a humongous amount of data. so for every query that we search on the google, the data should be managed quickly and provide results. Not only google, facebook , whatsapp, and other social networking platforms contains the lot of information/data as in unstructured format. so, managing the data is very crucial and the increasing in data at a rapid pace is significant importance. Thus the concept of BIGDATA came in to picture.

When we have large of data, we need to have high end servers like huge amount of storage and memory should be needed. If the amount of data increases, the storage capacity need to be increase or the data should be moved to higher capacity and memory that have massive processing power. However the servers (Rack servers) were very Expensive and not feasible as having in a single server would result in allowing rapid failures in processing the data.

if we place a multiple servers that have high storage and memory but, we cannot process the data at a time in multple servers. if we still have a chance we cannot process the data as fast as possible so this approach may have drawbacks such that the concept of "Distributed Computing" Came to existence. if there is huge data (Peta Bytes), the transferring of data would time consuming process. By considering the all these problems, the concept of Commodity Hardware introduced.

In the concept of BigData, the processing would be directed to huge amount of data and manage the data, the use of Commodity hardware ( it is an economical. low performance systems ) and we can also apply the distributed computing. so if there any failure in one system, the data replication would be possible that can be consider and executed some other node.

Thus Hadoop project is used as distributed computing that were currently handled by the APACHE Foundation. Basically, Hadoop is java based open source framework which is used as one of the way of storing enormous data sets across distributed cluster of servers.

The data processing framework is the tool used to work with the data itself.

Hadoop is majorly classified into two categories.

HADOOP 1.X cluster
HADOOP 2 X cluster

HADOOP 1.X cluster:

it contains the NameNode , DataNode , JobTracker, TaskTracker

A NameNode is a single point of failure in Hdfs. only the metadata will be stored in the namenode. majorily, it takes the snaps of namenode and savepoints to the data, so that incase of failure it will consider to restore the system/node.
A DataNode are used to store the blocks of data. The processing of data that are taken from namenode will be processed here.
A JobTracker contacts the NameNode to get the information about the location of the data
A TaskTracker is used to send the signals to JobTracker about the functionality about the jobs.

HADOOP 2.X cluster:

As the 1.X has its limitations due to scalability, low availability. To overcome these drawbacks, Apache Hadoop Community has recently released the Hadoop 2.X

Hadoop provides three layers as a backbone. they are

HDFS
MAP REDUCE
YARN

in addition that , there are other important components

RESOURCE Manager
NODE Manager

Resource Manager:

it only responsible to schedules required resources to Apllications

Node Manager:

it is responsible for managing the life cycle of the container.

venereology answered 2 years ago

Please briefly discuss each of these topics 1. Describe the filtration membrane, its three parts and...

Please briefly discuss each of these topics 1. Describe the filtration membrane, its three parts and their characteristics. 2. Describe the pressures that promote and oppose glomerular filtration. 3. What is GFR? Discuss the three mechanisms that regulate /influence glomerular filtration rate. 4. Discuss tubular reabsorption in the proximal convoluted tubule. What is the importance of sodium reabsorption as far as reabsorption of other solutes is concerned? 5. What are the substances that are secreted during the process of tubular...

Describe the components of the ROAMEF framework. Select one component to explain in more detail.

Briefly describe 3 topics for a health promotion program.

2. Select one of the following topics and search the Internet to learn more about the...

2. Select one of the following topics and search the Internet to learn more about the topic. How does this issue relate to nursing? In what ways it is a healthcare policy issue? Have any steps been taken to initiate legislation on this issue? If so, what are they? a. Rural healthcare b. Mental health parity c. Aging and long-term care d. Healthcare unions e. Home care f. Emergency room diversions

Pick the following topics, research it, and write a short essay/long paragraph (about a page and...

Pick the following topics, research it, and write a short essay/long paragraph (about a page and a half). Your research must be backed up by solid sources. peer reviewed sources . In all cases, you must use correct in-text citations and provide a bibliography. Must a be peer reviewed source Swimming with dolphins helps autistic kids

explain in short for the following a) hadoop is not fit for lot of small files...

explain in short for the following a) hadoop is not fit for lot of small files why? b) what is the good upper limit for input split? why? c) what happens if the input splits are too small for a small file ? d) The amount of data is less so why to prevent map reduce ?

Briefly describe examples of descriptive research and applied research about bullying in public schools. What sorts...

Briefly describe examples of descriptive research and applied research about bullying in public schools. What sorts of things would be measured in descriptive and applied research on bullying? **Basics of Research Methods for Criminal Justice and Criminology

Choose one of the following topics and write at least two substantial paragraphs that briefly explain...

Choose one of the following topics and write at least two substantial paragraphs that briefly explain both the topic itself and how the topic relates to our study of light. The Double-slit experiment The Photoelectric Effect Emission spectra of gases composed of a single element

Research and explain more about carbon and heat treatment effects on steel.

write a paragraph about how can the following topics effect a chemistry research: 1- Theories of...

write a paragraph about how can the following topics effect a chemistry research: 1- Theories of arousal and performance 2- The stress process 3- How arousal, stress, and anxiety affect performance Psychology of sport