Explore Courses Blog Tutorials Interview Questions
0 votes
in Big Data Hadoop & Spark by (50.2k points)

What is the difference between Apache Hive and Apache Spark?

1 Answer

0 votes
by (106k points)

One of the major differences between the two is that Hive is a system built on top of Hadoop and essentially is an open-source data warehouse system whereas Apache Spark is simply a Big Data framework.

Apache Hive provides data aggregation, analysis, and query. Hive provides a SQL-like interface for querying data stored in several file systems built into Hadoop. Spark and Hadoop, both are based on the same HDFS file storage classification, so if you have invested heavily in Hadoop and infrastructure, you can use Spark with MapReduce.

Here is a video tutorial which you can watch to learn more about spark:-

Browse Categories