What is the difference between Hadoop and Cloud Computing?

I knew what Hadoop was and what it did. Basically, it is a structure that allows distributed processing of large data sets through computer clusters using simple programming models. It is designed to scale from individual servers to thousands of computers, each of which offers local computing and storage.

I know little about cloud computing. Just started to figure it out. While reading, I learned that cloud computing or the cloud is a colloquial expression used to describe many different types of computing concepts, which include a large number of computers connected via a real-time communication network.

Both concepts seem to me almost the same. What is the exact difference between the two? Could you explain to me?

Thanks.

+6
source share
7 answers

In its most basic form, you can define cloud computing as the process of delivering computing resources on demand via the Internet on a pay-as-you-go basis. This includes everything from applications to data centers.

Hadoop, as you would read, is just a distributed platform. Once upon a time, people use Hadoop as a means of providing some cloud computing services. Take Amazon EMR , which uses the hosted Hadoop infrastructure running on the Amazon Elastic Compute Cloud (Amazon EC2) web infrastructure infrastructure and Amazon Simple Storage Service (Amazon S3) so we can process huge amounts of data easily and cost-effectively.

You can consider cloud computing a concept or methodology that Hadoop is not. Hadoop itself is based on a distributed computing methodology. It's like asking what the difference is between programming and Java. Programming is something that can be done using Java, but definitely Java, by itself, is not programming. The same is true for Hadoop and cloud computing.

In short, Hadoop is a platform that helps you provide cloud computing services to your customers.

Hope this answers your inquiries.

+12
source

Cloud computing is a model that enables ubiquitous, convenient, and on-demand network access for a variety of configured computing resources on the Internet or intranet. It provides services such as PaaS, SaaS and IaaS. Hadoop, on the other hand, is an open source Apache Software Foundation project that enables distributed processing of large datasets across node clusters. This is mainly based on the concepts of HDSF and MapReduce. So you can think of Hadoop as a cloud computing platform to provide us with distributed data mining because of the speed with which data is growing these days. All the best!

+3
source

As @Tariq suggested, cloud computing is a service (in a business sense) and the concept of providing computing services on a remote computer, while big data is a distributed structure for a large data service that a cloud service provider can use.

0
source

In one line : Big data gets all kinds of data and cloud computing that you give the end user from this collected data. For example, there may be different sensor data. A retailer may not need sensor data collected from the environment and vice versa. And, as a cloud provider, I have all the sensor data; however, based on my end user / client, I will give sensor data related to retail, or I will give sensor data related to the environment.

0
source

Hadoop stores data using HDFS, and MapReduce can be accessed efficiently, so cloud computing uses hadoop to store data and provide flexible end-user access to the cloud.

0
source

“Hadoop” refers to an ecosystem of open source projects that enable cost-effective mass distributed computing on standard hardware.

“Cloud computing” refers to a computer model in which storage and processing resources (or applications built on them) are accessible from another location via the Internet. It also implies a “useful” consumption model in which users pay for the use of these resources.

So, Hadoop can be deployed in the cloud (or not). They are not mutually exclusive.

0
source

Hadoop cannot be used for real-time processing, or we can say that online processing. For example, Hive, which is a hadoop component and has mostly SQL-like commands and an interface, cannot be used for online and real-time databases, unlike MySQL. Cloud is a generalized computing environment, but hasoop is a specialized technology that uses the Map-Reduce paradigm.

0
source

Source: https://habr.com/ru/post/954750/


All Articles