[ONLINE] High Performance Data Analytics in Python @ENCCS
Date: 18 May 2022 @ 07:00 - 10:30
Overview
Python is an industry-standard programming language for working with data on all levels of the data analytics pipeline, thanks to the rich ecosystem of libraries ranging from generic numerical libraries to special-purpose and/or domain-specific packages which are often supported by large developer communities and stable funding sources.
This online workshop is meant to give an overview of working with research data in Python using general libraries for storing, processing, analysing and sharing data. The focus is on improving performance.
After covering tools for performant processing (netcdf, numpy, pandas, scipy) on single workstations the focus shifts to parallel, distributed and GPU computing (snakemake, numba, dask, multiprocessing, mpi4py).
Agenda
For further information and updated agenda you can visit
https://events.prace-ri.eu/event/1380/
Event types:
- Workshops and courses
Activity log