Many of the important features of Apache Spark include:
• Easy integration with Hadoop and HDFS files.
• Interactive language shell because it has a standalone Scala interpreter (Spark language).
• It comprises of RDDs that can be cached by compute nodes in the cluster.
• It backs multiple analysis tools for graph processing, interactive query analysis, and real-time analysis