Date: 13 February 2018 @ 08:00 - 16:30

Apache Spark is one of the most popular computing frameworks for large-scale data processing. It also includes a machine learning library (MLlib) with distributed versions of many machine learning algorithms.

In this workshop we give an introduction to Apache Spark and explain how to use it for distributed machine learning. For the hands-on we will be using PySpark, Sparks Python API, from a Jupyter notebook environment.

https://events.prace-ri.eu/event/686/

Event types:

  • Workshops and courses


Activity log