Spark is an open-source cluster-computing framework for processing big data on computer clusters. Spark usually sits on top of a distributed file system such as HDFS. While its core functionality is processing big data on computer clusters, Spark is increasingly...