0 votes
1 view
in Big Data Hadoop & Spark by (50.5k points)

What is the difference between Apache Hive and Apache Spark?

1 Answer

0 votes
by (107k points)

One of the major differences between the two is that Hive is a system built on top of Hadoop and essentially is an open-source data warehouse system whereas Apache Spark is simply a Big Data framework.

Apache Hive provides data aggregation, analysis, and query. Hive provides a SQL-like interface for querying data stored in several file systems built into Hadoop. Spark and Hadoop, both are based on the same HDFS file storage classification, so if you have invested heavily in Hadoop and infrastructure, you can use Spark with MapReduce.

Here is a video tutorial which you can watch to learn more about spark:-

Welcome to Intellipaat Community. Get your technical queries answered by top developers !