Question

In: Computer Science

explain in short for the following a) hadoop is not fit for lot of small files...

explain in short for the following
a) hadoop is not fit for lot of small files why?
b) what is the good upper limit for input split? why?
c) what happens if the input splits are too small for a small file ?
d) The amount of data is less so why to prevent map reduce ?

Solutions

Expert Solution

a)hadoop is not fit for lot of small files why?

Answer: Hadoop is used for big data and is not suited fro small data.Hadoop has some limitations with small files or data because of its high capacity design.Small files are significantly smaller than the Hadoop File System which is default 128 MB.

b) what is the good upper limit for input split? why?

Answer: By default the split size is approximately equal to block size of HDFS. Input split is user defined and the user can control split size based on the size of data in MapReduce program.The reason for this is to minimize the cost of seek and reduce the meta data information generated per block.

c) what happens if the input splits are too small for a small file ?

Answer: The amount of processing time per file will be huge.So we need to reduce the split size so that we can utilize more nodes.

d) The amount of data is less so why to prevent map reduce ?

Answer: Hadoop is highly scalable.This is largely beacause of its ability to distrubute large data sets.Map reduce allows the storage and processing the data in very affordable way.That it can also be used for later times.Mapreduce provide security for the data storage.


Related Solutions

. The Hadoop framework includes many parts. Research more about following topics and describe briefly. Explain...
. The Hadoop framework includes many parts. Research more about following topics and describe briefly. Explain the use of Hadoop.
Write a short paragraph on the following topic: Using social media such as Facebook consumes lot...
Write a short paragraph on the following topic: Using social media such as Facebook consumes lot of valuable time that results in decreased productivity both personally and professionally
Topic: Data Models Explain what Hadoop is in detail, and what are its basic components?
Topic: Data Models Explain what Hadoop is in detail, and what are its basic components?
How do long and short term objectives fit in with operational and investment plans?
How do long and short term objectives fit in with operational and investment plans?
An Oxxo has a small parking lot with three spaces reserved for customers. If the store...
An Oxxo has a small parking lot with three spaces reserved for customers. If the store is open cars arrive and use a space with an average rate of 2 per hour. For n = 0, 1, 2, 3, the probability P n of that there are exactly n spaces occupied is P 0 = 0.1, P 1 = 0.2, P 2 = 0.4, P 3 = 0.3. a) Describe the interpretation of this parking lot as a queuing system,...
I know it's a lot but they are all small questions related to the same case...
I know it's a lot but they are all small questions related to the same case so I didn't know how to split it into multiple questions. I would really appreciate the help, as I need a way to compare my answers. Dallas & Associates Financial Statement Preparation & Analysis You have been hired as a senior financial analyst for Dallas and Associates and you are in charge of preparing the financial statements and presenting an annual analysis at the...
I know it's a lot but they are all small questions related to the same case...
I know it's a lot but they are all small questions related to the same case so I didn't know how to split it into multiple questions. I would really appreciate the help, as I need a way to compare my answers. Dallas & Associates Financial Statement Preparation & Analysis You have been hired as a senior financial analyst for Dallas and Associates and you are in charge of preparing the financial statements and presenting an annual analysis at the...
Which of the following statements are true regarding AMF/3MF files, in comparison to STL files? Select...
Which of the following statements are true regarding AMF/3MF files, in comparison to STL files? Select all that apply. a) AMF/3MF files can represent overhangs of greater than 45 degrees, while STL files cannot b) AMF/3MF files do not require the model to be sliced before printing c) AMF files can specify multiple materials within the same part d) The 3MF file format can enumerate the structure of a lattice as a periodic unit, making the file size more compact
Answer the following questions for Small Open Economy in the short run with floating exchange rates...
Answer the following questions for Small Open Economy in the short run with floating exchange rates (SOE in the SR) a) In the Mundell–Fleming model (SOE in the SR) with floating exchange rates, explain what happens to aggregate income, the exchange rate, and the trade balance when taxes are decreased. (8 points) b) In the Mundell–Fleming model (SOE in the SR) with floating exchange rates, explain what happens to aggregate income, the exchange rate, and the trade balance when the...
Each of the following files in the Chapter15 folder of your downloadable student files has syntax and/or logic errors.
Each of the following files in the Chapter15 folder of your downloadable student files has syntax and/or logic errors. In each case, determine the problem and fix the program. After you correct the errors, save each file using the same filename preceded with Fix. For example, DebugFifteen1.java will become FixDebugFifteen1.java. a. DebugFifteen1.java b. DebugFifteen2.java c. DebugFifteen3.java d. DebugFifteen4.java    
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT