Apache Spark is an open-source scalable in-memory computation framework. It is widely used across the industry and supports a variety Big Data Analytics use cases. In this talk we describe Spark's main components, its rich set of libraries and operation modes. The talk will be accompanied with examples and insights from our experience with Spark in the development of Big Data Security applications in IBM.
The speaker can adjust the talk content to the course subject: for IoT course provide more info about architecture aspects of our Spark solution in IoT setup; for machine learning course - give more focus to Machine Learning libraries of Spark.