In: Computer Science
There are Cloudera CDH, Hortonworks HDP and MapR M-series. If your managers have given you the task to select one of these distributions for your organization, list 5-6 critrias that would you consider to select a vendor for your organization? And rate the criteria from 1-5, 5 being very important. And, by the way which vendor would you select? Please note that in this class, we will be vendor agnostic and use the open source distribution.
Three hadoop distribution are Cloudera, MapR and Hortonworks.
Choosing the right hadoop distribution for business needs lead to faster data driven solution and help your organization to gain from best people in industry.
You can download free version of these 3 vendors distribution.
But MapR and cloudera provide additional premium hadoop distribution to their paying customer.
Cloudera provide flexible, scalable, integrated platform for easy to manage increasing volume and variety of data in enterprise.
Cloudera product provide manipulate and analyze data, keep your data secure and protected.
Cloudera provide CDH product that provide security and integration with hardware and software solution.
CDH support multi cluster management.
so for these purpose organization use cloudera.
MapR hadoop distribution used to support market needs faster.
Unlike cloudera and hortonworks provide more distributed approach for storing metadata on processing nodes because it depend on MapR file system.
MapR provide data protection ,no single point of failure and it is fastest.
Hortonworks is open platform and provide free for use.
Hortonworks HDP can easily downloaded and integrated.
HDP makes hive faster and avoid vendor lock-in.
So by considering features of each vendor you can choose one of them.